1
|
Parallel gene size and isoform expansion of ancient neuronal genes. Curr Biol 2024; 34:1635-1645.e3. [PMID: 38460513 PMCID: PMC11043017 DOI: 10.1016/j.cub.2024.02.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 12/16/2023] [Accepted: 02/11/2024] [Indexed: 03/11/2024]
Abstract
How nervous systems evolved is a central question in biology. A diversity of synaptic proteins is thought to play a central role in the formation of specific synapses leading to nervous system complexity. The largest animal genes, often spanning hundreds of thousands of base pairs, are known to be enriched for expression in neurons at synapses and are frequently mutated or misregulated in neurological disorders and diseases. Although many of these genes have been studied independently in the context of nervous system evolution and disease, general principles underlying their parallel evolution remain unknown. To investigate this, we directly compared orthologous gene sizes across eukaryotes. By comparing relative gene sizes within organisms, we identified a distinct class of large genes with origins predating the diversification of animals and, in many cases, the emergence of neurons as dedicated cell types. We traced this class of ancient large genes through evolution and found orthologs of the large synaptic genes potentially driving the immense complexity of metazoan nervous systems, including in humans and cephalopods. Moreover, we found that while these genes are evolving under strong purifying selection, as demonstrated by low dN/dS ratios, they have simultaneously grown larger and gained the most isoforms in animals. This work provides a new lens through which to view this distinctive class of large and multi-isoform genes and demonstrates how intrinsic genomic properties, such as gene length, can provide flexibility in molecular evolution and allow groups of genes and their host organisms to evolve toward complexity.
Collapse
|
2
|
Abstract
Here, we describe the "Obelisks," a previously unrecognised class of viroid-like elements that we first identified in human gut metatranscriptomic data. "Obelisks" share several properties: (i) apparently circular RNA ~1kb genome assemblies, (ii) predicted rod-like secondary structures encompassing the entire genome, and (iii) open reading frames coding for a novel protein superfamily, which we call the "Oblins". We find that Obelisks form their own distinct phylogenetic group with no detectable sequence or structural similarity to known biological agents. Further, Obelisks are prevalent in tested human microbiome metatranscriptomes with representatives detected in ~7% of analysed stool metatranscriptomes (29/440) and in ~50% of analysed oral metatranscriptomes (17/32). Obelisk compositions appear to differ between the anatomic sites and are capable of persisting in individuals, with continued presence over >300 days observed in one case. Large scale searches identified 29,959 Obelisks (clustered at 90% nucleotide identity), with examples from all seven continents and in diverse ecological niches. From this search, a subset of Obelisks are identified to code for Obelisk-specific variants of the hammerhead type-III self-cleaving ribozyme. Lastly, we identified one case of a bacterial species (Streptococcus sanguinis) in which a subset of defined laboratory strains harboured a specific Obelisk RNA population. As such, Obelisks comprise a class of diverse RNAs that have colonised, and gone unnoticed in, human, and global microbiomes.
Collapse
|
3
|
DNA Polymerase Diversity Reveals Multiple Incursions of Polintons During Nematode Evolution. Mol Biol Evol 2023; 40:msad274. [PMID: 38069639 DOI: 10.1093/molbev/msad274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/01/2023] [Accepted: 12/04/2023] [Indexed: 12/19/2023] Open
Abstract
Polintons are double-stranded DNA, virus-like self-synthesizing transposons widely found in eukaryotic genomes. Recent metagenomic discoveries of Polinton-like viruses are consistent with the hypothesis that Polintons invade eukaryotic host genomes through infectious viral particles. Nematode genomes contain multiple copies of Polintons and provide an opportunity to explore the natural distribution and evolution of Polintons during this process. We performed an extensive search of Polintons across nematode genomes, identifying multiple full-length Polinton copies in several species. We provide evidence of both ancient Polinton integrations and recent mobility in strains of the same nematode species. In addition to the major nematode Polinton family, we identified a group of Polintons that are overall closely related to the major family but encode a distinct protein-primed DNA polymerase B (pPolB) that is related to homologs from a different group of Polintons present outside of the Nematoda. Phylogenetic analyses on the pPolBs support the evolutionary scenarios in which these extrinsic pPolBs that seem to derive from Polinton families present in oomycetes and molluscs replaced the canonical pPolB in subsets of Polintons found in terrestrial and marine nematodes, respectively, suggesting interphylum horizontal gene transfers. The pPolBs of the terrestrial nematode and oomycete Polintons share a unique feature, an insertion of an HNH nuclease domain, whereas the pPolBs in the marine nematode Polintons share an insertion of a VSR nuclease domain with marine mollusc pPolBs. We hypothesize that horizontal gene transfer occurs among Polintons from widely different but cohabiting hosts.
Collapse
|
4
|
DNA polymerase diversity reveals multiple incursions of Polintons during nematode evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.22.554363. [PMID: 37662302 PMCID: PMC10473752 DOI: 10.1101/2023.08.22.554363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Polintons are dsDNA, virus-like self-synthesizing transposons widely found in eukaryotic genomes. Recent metagenomic discoveries of Polinton-like viruses are consistent with the hypothesis that Polintons invade eukaryotic host genomes through infectious viral particles. Nematode genomes contain multiple copies of Polintons and provide an opportunity to explore the natural distribution and evolution of Polintons during this process. We performed an extensive search of Polintons across nematode genomes, identifying multiple full-length Polinton copies in several species. We provide evidence of both ancient Polinton integrations and recent mobility in strains of the same nematode species. In addition to the major nematode Polinton family, we identified a group of Polintons that are overall closely related to the major family, but encode a distinct protein-primed B family DNA polymerase (pPolB) that is related to homologs from a different group of Polintons present outside of the Nematoda . Phylogenetic analyses on the pPolBs support the evolutionary scenarios in which these extrinsic pPolBs that seem to derive from Polinton families present in oomycetes and molluscs replaced the canonical pPolB in subsets of Polintons found in terrestrial and marine nematodes, respectively, suggesting inter-phylum horizontal gene transfers. The pPolBs of the terrestrial nematode and oomycete Polintons share a unique feature, an insertion of a HNH nuclease domain, whereas the pPolBs in the marine nematode Polintons share an insertion of a VSR nuclease domain with marine mollusc pPolBs. We hypothesize that horizontal gene transfer occurs among Polintons from widely different but cohabiting hosts.
Collapse
|
5
|
Ancient origins of complex neuronal genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.28.534655. [PMID: 37034725 PMCID: PMC10081198 DOI: 10.1101/2023.03.28.534655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
How nervous systems evolved is a central question in biology. An increasing diversity of synaptic proteins is thought to play a central role in the formation of specific synapses leading to nervous system complexity. The largest animal genes, often spanning millions of base pairs, are known to be enriched for expression in neurons at synapses and are frequently mutated or misregulated in neurological disorders and diseases. While many of these genes have been studied independently in the context of nervous system evolution and disease, general principles underlying their parallel evolution remain unknown. To investigate this, we directly compared orthologous gene sizes across eukaryotes. By comparing relative gene sizes within organisms, we identified a distinct class of large genes with origins predating the diversification of animals and in many cases the emergence of dedicated neuronal cell types. We traced this class of ancient large genes through evolution and found orthologs of the large synaptic genes driving the immense complexity of metazoan nervous systems, including in humans and cephalopods. Moreover, we found that while these genes are evolving under strong purifying selection as demonstrated by low dN/dS scores, they have simultaneously grown larger and gained the most isoforms in animals. This work provides a new lens through which to view this distinctive class of large and multi-isoform genes and demonstrates how intrinsic genomic properties, such as gene length, can provide flexibility in molecular evolution and allow groups of genes and their host organisms to evolve toward complexity.
Collapse
|
6
|
Combined direct/indirect detection allows identification of DNA termini in diverse sequencing datasets and supports a multiple-initiation-site model for HIV plus-strand synthesis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.12.544617. [PMID: 37398293 PMCID: PMC10312614 DOI: 10.1101/2023.06.12.544617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Replication of genetic material involves the creation of characteristic termini. Determining these termini is important to refine our understanding of the mechanisms involved in maintaining the genomes of cellular organisms and viruses. Here we describe a computational approach combining direct and indirect readouts to detect termini from next-generation short-read sequencing. While a direct inference of termini can come from mapping the most prominent start positions of captured DNA fragments, this approach is insufficient in cases where the DNA termini are not captured, whether for biological or technical reasons. Thus, a complementary (indirect) approach to terminus detection can be applied, taking advantage of the imbalance in coverage between forward and reverse sequence reads near termini. A resulting metric ("strand bias") can be used to detect termini even where termini are naturally blocked from capture or ends are not captured during library preparation (e.g., in tagmentation-based protocols). Applying this analysis to datasets where known DNA termini are present, such as from linear double-stranded viral genomes, yielded distinct strand bias signals corresponding to these termini. To evaluate the potential to analyze a more complex situation, we applied the analysis to examine DNA termini present early after HIV infection in a cell culture model. We observed both the known termini expected based on standard models of HIV reverse transcription (the U5-right-end and U3-left-end termini) as well as a signal corresponding to a previously described additional initiation site for plus-strand synthesis (cPPT [central polypurine tract]). Interestingly, we also detected putative terminus signals at additional sites. The strongest of these are a set that share several characteristics with the previously characterized plus-strand initiation sites (the cPPT and 3' PPT [polypurine tract] sites): (i) an observed spike in directly captured cDNA ends, an indirect terminus signal evident in localized strand bias, (iii) a preference for location on the plus-strand, (iv) an upstream purine-rich motif, and (v) a decrease in terminus signal at late time points after infection. These characteristics are consistent in duplicate samples in two different genotypes (wild type and integrase-lacking HIV). The observation of distinct internal termini associated with multiple purine-rich regions raises a possibility that multiple internal initiations of plus-strand synthesis might contribute to HIV replication.
Collapse
|
7
|
Restriction Endonuclease-Based Modification-Dependent Enrichment (REMoDE) of DNA for Metagenomic Sequencing. Appl Environ Microbiol 2023; 89:e0167022. [PMID: 36519847 PMCID: PMC9888230 DOI: 10.1128/aem.01670-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 11/29/2022] [Indexed: 12/23/2022] Open
Abstract
Metagenomic sequencing is a swift and powerful tool to ascertain the presence of an organism of interest in a sample. However, sequencing coverage of the organism of interest can be insufficient due to an inundation of reads from irrelevant organisms in the sample. Here, we report a nuclease-based approach to rapidly enrich for DNA from certain organisms, including enterobacteria, based on their differential endogenous modification patterns. We exploit the ability of taxon-specific methylated motifs to resist the action of cognate methylation-sensitive restriction endonucleases that thereby digest unwanted, unmethylated DNA. Subsequently, we use a distributive exonuclease or electrophoretic separation to deplete or exclude the digested fragments, thus enriching for undigested DNA from the organism of interest. As a proof of concept, we apply this method to enrich for the enterobacteria Escherichia coli and Salmonella enterica by 11- to 142-fold from mock metagenomic samples and validate this approach as a versatile means to enrich for genomes of interest in metagenomic samples. IMPORTANCE Pathogens that contaminate the food supply or spread through other means can cause outbreaks that bring devastating repercussions to the health of a populace. Investigations to trace the source of these outbreaks are initiated rapidly but can be drawn out due to the labored methods of pathogen isolation. Metagenomic sequencing can alleviate this hurdle but is often insufficiently sensitive. The approach and implementations detailed here provide a rapid means to enrich for many pathogens involved in foodborne outbreaks, thereby improving the utility of metagenomic sequencing as a tool in outbreak investigations. Additionally, this approach provides a means to broadly enrich for otherwise minute levels of modified DNA, which may escape unnoticed in metagenomic samples.
Collapse
|
8
|
Heterologous reporter expression in the planarian Schmidtea mediterranea through somatic mRNA transfection. CELL REPORTS METHODS 2022; 2:100298. [PMID: 36313809 PMCID: PMC9606109 DOI: 10.1016/j.crmeth.2022.100298] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Revised: 06/11/2022] [Accepted: 08/25/2022] [Indexed: 06/16/2023]
Abstract
Planarians have long been studied for their regenerative abilities. Moving forward, tools for ectopic expression of non-native proteins will be of substantial value. Using a luminescent reporter to overcome the strong autofluorescence of planarian tissues, we demonstrate heterologous protein expression in planarian cells and live animals. Our approach is based on the introduction of mRNA through several nanotechnological and chemical transfection methods. We improve reporter expression by altering untranslated region (UTR) sequences and codon bias, facilitating the measurement of expression kinetics in both isolated cells and whole planarians using luminescence imaging. We also examine protein expression as a function of variations in the UTRs of delivered mRNA, demonstrating a framework to investigate gene regulation at the post-transcriptional level. Together, these advances expand the toolbox for the mechanistic analysis of planarian biology and establish a foundation for the development and expansion of transgenic techniques in this unique model system.
Collapse
|
9
|
An essential role for the piRNA pathway in regulating the ribosomal RNA pool in C. elegans. Dev Cell 2021; 56:2295-2312.e6. [PMID: 34388368 DOI: 10.1016/j.devcel.2021.07.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 06/11/2021] [Accepted: 07/15/2021] [Indexed: 01/08/2023]
Abstract
Piwi-interacting RNAs (piRNAs) are RNA effectors with key roles in maintaining genome integrity and promoting fertility in metazoans. In Caenorhabditis elegans loss of piRNAs leads to a transgenerational sterility phenotype. The plethora of piRNAs and their ability to silence transcripts with imperfect complementarity have raised several (non-exclusive) models for the underlying drivers of sterility. Here, we report the extranuclear and transferable nature of the sterility driver, its suppression via mutations disrupting the endogenous RNAi and poly-uridylation machinery, and copy-number amplification at the ribosomal DNA locus. In piRNA-deficient animals, several small interfering RNA (siRNA) populations become increasingly overabundant in the generations preceding loss of germline function, including ribosomal siRNAs (risiRNAs). A concomitant increase in uridylated sense rRNA fragments suggests that poly-uridylation may potentiate RNAi-mediated gene silencing of rRNAs. We conclude that loss of the piRNA machinery allows for unchecked amplification of siRNA populations, originating from abundant highly structured RNAs, to deleterious levels.
Collapse
|
10
|
Intron and gene size expansion during nervous system evolution. BMC Genomics 2020; 21:360. [PMID: 32410625 PMCID: PMC7222433 DOI: 10.1186/s12864-020-6760-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 04/28/2020] [Indexed: 01/07/2023] Open
Abstract
Background The evolutionary radiation of animals was accompanied by extensive expansion of gene and genome sizes, increased isoform diversity, and complexity of regulation. Results Here we show that the longest genes are enriched for expression in neuronal tissues of diverse vertebrates and of invertebrates. Additionally, we show that neuronal gene size expansion occurred predominantly through net gains in intron size, with a positional bias toward the 5′ end of each gene. Conclusions We find that intron and gene size expansion is a feature of many genes whose expression is enriched in nervous systems. We speculate that unique attributes of neurons may subject neuronal genes to evolutionary forces favoring net size expansion. This process could be associated with tissue-specific constraints on gene function and/or the evolution of increasingly complex gene regulation in nervous systems.
Collapse
|
11
|
Abstract
In numerous instances, tracking the biological significance of a nucleic acid sequence can be augmented through the identification of environmental niches in which the sequence of interest is present. Many metagenomic data sets are now available, with deep sequencing of samples from diverse biological niches. While any individual metagenomic data set can be readily queried using web-based tools, meta-searches through all such data sets are less accessible. In this brief communication, we demonstrate such a meta-metagenomic approach, examining close matches to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in all high-throughput sequencing data sets in the NCBI Sequence Read Archive accessible with the "virome" keyword. In addition to the homology to bat coronaviruses observed in descriptions of the SARS-CoV-2 sequence (F. Wu, S. Zhao, B. Yu, Y. M. Chen, et al., Nature 579:265-269, 2020, https://doi.org/10.1038/s41586-020-2008-3; P. Zhou, X. L. Yang, X. G. Wang, B. Hu, et al., Nature 579:270-273, 2020, https://doi.org/10.1038/s41586-020-2012-7), we note a strong homology to numerous sequence reads in metavirome data sets generated from the lungs of deceased pangolins reported by Liu et al. (P. Liu, W. Chen, and J. P. Chen, Viruses 11:979, 2019, https://doi.org/10.3390/v11110979). While analysis of these reads indicates the presence of a similar viral sequence in pangolin lung, the similarity is not sufficient to either confirm or rule out a role for pangolins as an intermediate host in the recent emergence of SARS-CoV-2. In addition to the implications for SARS-CoV-2 emergence, this study illustrates the utility and limitations of meta-metagenomic search tools in effective and rapid characterization of potentially significant nucleic acid sequences.IMPORTANCE Meta-metagenomic searches allow for high-speed, low-cost identification of potentially significant biological niches for sequences of interest.
Collapse
|
12
|
An Extensive Meta-Metagenomic Search Identifies SARS-CoV-2-Homologous Sequences in Pangolin Lung Viromes. mSphere 2020; 5:e00160-20. [PMID: 32376697 PMCID: PMC7203451 DOI: 10.1128/msphere.00160-20] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/24/2020] [Indexed: 12/21/2022] Open
Abstract
In numerous instances, tracking the biological significance of a nucleic acid sequence can be augmented through the identification of environmental niches in which the sequence of interest is present. Many metagenomic data sets are now available, with deep sequencing of samples from diverse biological niches. While any individual metagenomic data set can be readily queried using web-based tools, meta-searches through all such data sets are less accessible. In this brief communication, we demonstrate such a meta-metagenomic approach, examining close matches to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in all high-throughput sequencing data sets in the NCBI Sequence Read Archive accessible with the "virome" keyword. In addition to the homology to bat coronaviruses observed in descriptions of the SARS-CoV-2 sequence (F. Wu, S. Zhao, B. Yu, Y. M. Chen, et al., Nature 579:265-269, 2020, https://doi.org/10.1038/s41586-020-2008-3; P. Zhou, X. L. Yang, X. G. Wang, B. Hu, et al., Nature 579:270-273, 2020, https://doi.org/10.1038/s41586-020-2012-7), we note a strong homology to numerous sequence reads in metavirome data sets generated from the lungs of deceased pangolins reported by Liu et al. (P. Liu, W. Chen, and J. P. Chen, Viruses 11:979, 2019, https://doi.org/10.3390/v11110979). While analysis of these reads indicates the presence of a similar viral sequence in pangolin lung, the similarity is not sufficient to either confirm or rule out a role for pangolins as an intermediate host in the recent emergence of SARS-CoV-2. In addition to the implications for SARS-CoV-2 emergence, this study illustrates the utility and limitations of meta-metagenomic search tools in effective and rapid characterization of potentially significant nucleic acid sequences.IMPORTANCE Meta-metagenomic searches allow for high-speed, low-cost identification of potentially significant biological niches for sequences of interest.
Collapse
|
13
|
Transcription polymerase-catalyzed emergence of novel RNA replicons. Science 2020; 368:eaay0688. [PMID: 32217750 PMCID: PMC7445081 DOI: 10.1126/science.aay0688] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 03/09/2020] [Indexed: 12/22/2022]
Abstract
Transcription polymerases can exhibit an unusual mode of regenerating certain RNA templates from RNA, yielding systems that can replicate and evolve with RNA as the information carrier. Two classes of pathogenic RNAs (hepatitis delta virus in animals and viroids in plants) are copied by host transcription polymerases. Using in vitro RNA replication by the transcription polymerase of T7 bacteriophage as an experimental model, we identify hundreds of new replicating RNAs, define three mechanistic hallmarks of replication (subterminal de novo initiation, RNA shape-shifting, and interrupted rolling-circle synthesis), and describe emergence from DNA seeds as a mechanism for the origin of novel RNA replicons. These results inform models for the origins and replication of naturally occurring RNA genetic elements and suggest a means by which diverse RNA populations could be propagated as hereditary material in cellular contexts.
Collapse
|
14
|
Aberrant B cell repertoire selection associated with HIV neutralizing antibody breadth. Nat Immunol 2020; 21:199-209. [PMID: 31959979 PMCID: PMC7223457 DOI: 10.1038/s41590-019-0581-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 12/12/2019] [Indexed: 11/09/2022]
Abstract
A goal of HIV vaccine development is to elicit antibodies with neutralizing breadth. Broadly neutralizing antibodies (bNAbs) to HIV often have unusual sequences with long heavy-chain complementarity-determining region loops, high somatic mutation rates and polyreactivity. A subset of HIV-infected individuals develops such antibodies, but it is unclear whether this reflects systematic differences in their antibody repertoires or is a consequence of rare stochastic events involving individual clones. We sequenced antibody heavy-chain repertoires in a large cohort of HIV-infected individuals with bNAb responses or no neutralization breadth and uninfected controls, identifying consistent features of bNAb repertoires, encompassing thousands of B cell clones per individual, with correlated T cell phenotypes. These repertoire features were not observed during chronic cytomegalovirus infection in an independent cohort. Our data indicate that the development of numerous B cell lineages with antibody features associated with autoreactivity may be a key aspect in the development of HIV neutralizing antibody breadth.
Collapse
|
15
|
Deconvolution of nucleic-acid length distributions: a gel electrophoresis analysis tool and applications. Nucleic Acids Res 2019; 47:e92. [PMID: 31226202 PMCID: PMC6895257 DOI: 10.1093/nar/gkz534] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 05/10/2019] [Accepted: 06/06/2019] [Indexed: 11/12/2022] Open
Abstract
Next-generation DNA-sequencing (NGS) technologies, which are designed to streamline the acquisition of massive amounts of sequencing data, are nonetheless dependent on various preparative steps to generate DNA fragments of required concentration, purity and average size (molecular weight). Current automated electrophoresis systems for DNA- and RNA-sample quality control, such as Agilent's Bioanalyzer® and TapeStation® products, are costly to acquire and use; they also provide limited information for samples having broad size distributions. Here, we describe a software tool that helps determine the size distribution of DNA fragments in an NGS library, or other DNA sample, based on gel-electrophoretic line profiles. The software, developed as an ImageJ plug-in, allows for straightforward processing of gel images, including lane selection and fitting of univariate functions to intensity distributions. The user selects the option of fitting either discrete profiles in cases where discrete gel bands are visible or continuous profiles, having multiple bands buried under a single broad peak. The method requires only modest imaging capabilities and is a cost-effective, rigorous alternative characterization method to augment existing techniques for library quality control.
Collapse
|
16
|
Abstract
In the course of identifying and cleaving RNA, the RNAi machinery must encounter and contend with the megadalton-sized ribosomes that carry out translation. We investigated this interface by examining the fate of actively translated mRNAs subjected to RNAi in C. elegans Quantifying RNA levels (RNA-seq) and ongoing translation (Ribo-seq), we found there is a greater fold repression of ongoing translation than expected from loss of RNA alone, observing stronger translation repression relative to RNA repression for multiple, independent double-stranded RNA triggers, and for multiple genes. In animals that lack the RNA helicase SKI complex and the ribosome rescue factor PELOTA, ribosomes stall on the 3' edges of mRNAs at and upstream of the RNAi trigger. One model to explain these observations is that ribosomes are actively cleared from mRNAs by SKI and PELO during or following mRNA cleavage. Our results expand prior studies that show a role for the SKI RNA helicase complex in removing RNA targets following RNAi in flies and plants, illuminating the widespread role of the nonstop translation surveillance in RNA silencing during RNAi. Our results are also consistent with proposals that RNAi can attack messages during active translation.
Collapse
|
17
|
Abstract
Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology.
Collapse
|
18
|
Target-dependent nickase activities of the CRISPR-Cas nucleases Cpf1 and Cas9. Nat Microbiol 2019; 4:888-897. [PMID: 30833733 PMCID: PMC6512873 DOI: 10.1038/s41564-019-0382-0] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 01/21/2019] [Indexed: 12/26/2022]
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPR) machineries are prokaryotic immune systems that have been adapted as versatile gene editing and manipulation tools. We found that CRISPR nucleases from two families, Cpf1 (also known as Cas12a) and Cas9, exhibit differential guide RNA (gRNA) sequence requirements for cleavage of the two strands of target DNA in vitro. As a consequence of the differential gRNA requirements, both Cas9 and Cpf1 enzymes can exhibit potent nickase activities on an extensive class of mismatched double-stranded DNA (dsDNA) targets. These properties allow the production of efficient nickases for a chosen dsDNA target sequence, without modification of the nuclease protein, using gRNAs with a variety of patterns of mismatch to the intended DNA target. In parallel to the nicking activities observed with purified Cas9 in vitro, we observed sequence-dependent nicking for both perfectly matched and partially mismatched target sequences in a Saccharomyces cerevisiae system. Our findings have implications for CRISPR spacer acquisition, off-target potential of CRISPR gene editing/manipulation, and tool development using homology-directed nicking.
Collapse
|
19
|
Assessment and Maintenance of Unigametic Germline Inheritance for C. elegans. Dev Cell 2019; 48:827-839.e9. [PMID: 30799227 PMCID: PMC6435406 DOI: 10.1016/j.devcel.2019.01.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 11/06/2018] [Accepted: 01/21/2019] [Indexed: 12/22/2022]
Abstract
The recent work of Besseling and Bringmann (2016) identified a molecular intervention for C. elegans in which premature segregation of maternal and paternal chromosomes in the fertilized oocyte can produce viable animals exhibiting a non-Mendelian inheritance pattern. Overexpression in embryos of a single protein regulating chromosome segregation (GPR-1) provides a germline derived clonally from a single parental gamete. We present a collection of strains and cytological assays to consistently generate and track non-Mendelian inheritance. These tools allow reproducible and high-frequency (>80%) production of non-Mendelian inheritance, the facile and simultaneous homozygosis for all nuclear chromosomes in a single generation, the precise exchange of nuclear and mitochondrial genomes between strains, and the assessments of non-canonical mitosis events. We show the utility of these strains by demonstrating a rapid assessment of cell lineage requirements (AB versus P1) for a set of genes (lin-2, lin-3, lin-12, and lin-31) with roles in C. elegans vulval development.
Collapse
|
20
|
Clonality: Point estimation. Ann Appl Stat 2019. [DOI: 10.1214/18-aoas1197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
21
|
Prospective Biopsy-Based Study of CKD of Unknown Etiology in Sri Lanka. Clin J Am Soc Nephrol 2019; 14:224-232. [PMID: 30659059 PMCID: PMC6390926 DOI: 10.2215/cjn.07430618] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 10/30/2018] [Indexed: 01/11/2023]
Abstract
BACKGROUND AND OBJECTIVES A kidney disease of unknown cause is common in Sri Lanka's lowland (dry) region. Detailed clinical characterizations of patients with biopsy-proven disease are limited, and there is no current consensus on criteria for a noninvasive diagnosis. DESIGN, SETTING, PARTICIPANTS, & MEASUREMENTS We designed a prospective study in a major Sri Lankan hospital servicing endemic areas to ascertain pathologic and clinical characteristics of and assess risk factors for primary tubulointerstitial kidney disease. We used logistic regression to determine whether common clinical characteristics could be used to predict the presence of primary tubulointerstitial kidney disease on kidney biopsy. RESULTS From 600 new patients presenting to a tertiary nephrology clinic over the course of 1 year, 87 underwent kidney biopsy, and 43 (49%) had a biopsy diagnosis of primary tubulointerstitial kidney disease. On detailed biopsy review, 13 (30%) had evidence of moderate to severe active kidney disease, and six (15%) had evidence of moderate to severe chronic tubulointerstitial kidney disease. Patients with tubulointerstitial kidney disease were exclusively born in endemic provinces; 91% spent a majority of their lifespan there. They were more likely men and farmers (risk ratio, 2.0; 95% confidence interval, 1.2 to 2.9), and they were more likely to have used tobacco (risk ratio, 1.7; 95% confidence interval, 1.0 to 2.3) and well water (risk ratio, 1.5; 95% confidence interval, 1.1 to 2.0). Three clinical characteristics-age, urine dipstick for protein, and serum albumin-could predict likelihood of tubulointerstitial kidney disease on biopsy (model sensitivity of 79% and specificity of 84%). Patients referred for kidney biopsy despite comorbid diabetes or hypertension did not experience lower odds of tubulointerstitial kidney disease. CONCLUSIONS A primary tubulointerstitial kidney disease occurs commonly in specific regions of Sri Lanka with characteristic environmental and lifestyle exposures.
Collapse
|
22
|
A Reverse Transcriptase-Cas1 Fusion Protein Contains a Cas6 Domain Required for Both CRISPR RNA Biogenesis and RNA Spacer Acquisition. Mol Cell 2018; 72:700-714.e8. [PMID: 30344094 PMCID: PMC6242336 DOI: 10.1016/j.molcel.2018.09.013] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Revised: 08/10/2018] [Accepted: 09/12/2018] [Indexed: 01/30/2023]
Abstract
Prokaryotic CRISPR-Cas systems provide adaptive immunity by integrating portions of foreign nucleic acids (spacers) into genomic CRISPR arrays. Cas6 proteins then process CRISPR array transcripts into spacer-derived RNAs (CRISPR RNAs; crRNAs) that target Cas nucleases to matching invaders. We find that a Marinomonas mediterranea fusion protein combines three enzymatic domains (Cas6, reverse transcriptase [RT], and Cas1), which function in both crRNA biogenesis and spacer acquisition from RNA and DNA. We report a crystal structure of this divergent Cas6, identify amino acids required for Cas6 activity, show that the Cas6 domain is required for RT activity and RNA spacer acquisition, and demonstrate that CRISPR-repeat binding to Cas6 regulates RT activity. Co-evolution of putative interacting surfaces suggests a specific structural interaction between the Cas6 and RT domains, and phylogenetic analysis reveals repeated, stable association of free-standing Cas6s with CRISPR RTs in multiple microbial lineages, indicating that a functional interaction between these proteins preceded evolution of the fusion.
Collapse
|
23
|
Correction: Type III CRISPR-Cas systems can provide redundancy to counteract viral escape from type I systems. eLife 2018; 7:36853. [PMID: 29616920 PMCID: PMC5884672 DOI: 10.7554/elife.36853] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
24
|
A Small RNA Isolation and Sequencing Protocol and Its Application to Assay CRISPR RNA Biogenesis in Bacteria. Bio Protoc 2018; 8:e2727. [PMID: 29600253 PMCID: PMC5870890 DOI: 10.21769/bioprotoc.2727] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 01/23/2018] [Accepted: 01/29/2018] [Indexed: 11/02/2022] Open
Abstract
Next generation high-throughput sequencing has enabled sensitive and unambiguous analysis of RNA populations in cells. Here, we describe a method for isolation and strand-specific sequencing of small RNA pools from bacteria that can be multiplexed to accommodate multiple biological samples in a single experiment. Small RNAs are isolated by polyacrylamide gel electrophoresis and treated with T4 polynucleotide kinase. This allows for 3' adapter ligation to CRISPR RNAs, which don't have pre-existing 3'-OH ends. Pre-adenylated adapters are then ligated using T4 RNA ligase 1 in the absence of ATP and with a high concentration of polyethylene glycol (PEG). The 3' capture step enables precise determination of the 3' ends of diverse RNA molecules. Additionally, a random hexamer in the ligated adapter helps control for potential downstream amplification bias. Following reverse-transcription, the cDNA product is circularized and libraries are prepared by PCR. We show that the amplified library need not be visible by gel electrophoresis for efficient sequencing of the desired product. Using this method, we routinely prepare RNA sequencing libraries from minute amounts of purified small RNA. This protocol is tailored to assay for CRISPR RNA biogenesis in bacteria through sequencing of mature CRISPR RNAs, but can be used to sequence diverse classes of small RNAs. We also provide a fully worked example of our data processing pipeline, with instructions for running the provided scripts.
Collapse
|
25
|
Circulomics: The Structural Genomics of Endogenous and Exogenous Extrachromosomal Circular DNAs. Biophys J 2018. [DOI: 10.1016/j.bpj.2017.11.498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|
26
|
Abstract
Nonsense-mediated mRNA decay is the process by which mRNAs bearing premature stop codons are recognized and cleared from the cell. While considerable information has accumulated regarding recognition of the premature stop codon, less is known about the ensuing mRNA suppression. During the characterization of a second, distinct translational surveillance pathway (nonstop mRNA decay), we trapped intermediates in nonsense mRNA degradation. We present data in support of a model wherein nonsense-mediated decay funnels into the nonstop decay pathway in Caenorhabditis elegans. Specifically, our results point to SKI-exosome decay and pelota-based ribosome removal as key steps facilitating suppression and clearance of prematurely-terminated translation complexes. These results suggest a model in which premature stop codons elicit nucleolytic cleavage, with the nonstop pathway disengaging ribosomes and degrading the resultant RNA fragments to suppress ongoing expression.
Collapse
|
27
|
A novel TRIP11-FLT3 fusion in a patient with a myeloid/lymphoid neoplasm with eosinophilia. Cancer Genet 2017; 216-217:10-15. [DOI: 10.1016/j.cancergen.2017.05.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Revised: 04/14/2017] [Accepted: 05/08/2017] [Indexed: 01/30/2023]
|
28
|
Type III CRISPR-Cas systems can provide redundancy to counteract viral escape from type I systems. eLife 2017; 6:27601. [PMID: 28826484 PMCID: PMC5576922 DOI: 10.7554/elife.27601] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2017] [Accepted: 08/07/2017] [Indexed: 12/26/2022] Open
Abstract
CRISPR-Cas-mediated defense utilizes information stored as spacers in CRISPR arrays to defend against genetic invaders. We define the mode of target interference and role in antiviral defense for two CRISPR-Cas systems in Marinomonas mediterranea. One system (type I-F) targets DNA. A second system (type III-B) is broadly capable of acquiring spacers in either orientation from RNA and DNA, and exhibits transcription-dependent DNA interference. Examining resistance to phages isolated from Mediterranean seagrass meadows, we found that the type III-B machinery co-opts type I-F CRISPR-RNAs. Sequencing and infectivity assessments of related bacterial and phage strains suggests an ‘arms race’ in which phage escape from the type I-F system can be overcome through use of type I-F spacers by a horizontally-acquired type III-B system. We propose that the phage-host arms race can drive selection for horizontal uptake and maintenance of promiscuous type III interference modules that supplement existing host type I CRISPR-Cas systems.
Collapse
|
29
|
High-Throughput Characterization of Cascade type I-E CRISPR Guide Efficacy Reveals Unexpected PAM Diversity and Target Sequence Preferences. Genetics 2017; 206:1727-1738. [PMID: 28634160 PMCID: PMC5560783 DOI: 10.1534/genetics.117.202580] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 05/29/2017] [Indexed: 12/18/2022] Open
Abstract
Interactions between Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) RNAs and CRISPR-associated (Cas) proteins form an RNA-guided adaptive immune system in prokaryotes. The adaptive immune system utilizes segments of the genetic material of invasive foreign elements in the CRISPR locus. The loci are transcribed and processed to produce small CRISPR RNAs (crRNAs), with degradation of invading genetic material directed by a combination of complementarity between RNA and DNA and in some cases recognition of adjacent motifs called PAMs (Protospacer Adjacent Motifs). Here we describe a general, high-throughput procedure to test the efficacy of thousands of targets, applying this to the Escherichia coli type I-E Cascade (CRISPR-associated complex for antiviral defense) system. These studies were followed with reciprocal experiments in which the consequence of CRISPR activity was survival in the presence of a lytic phage. From the combined analysis of the Cascade system, we found that (i) type I-E Cascade PAM recognition is more expansive than previously reported, with at least 22 distinct PAMs, with many of the noncanonical PAMs having CRISPR-interference abilities similar to the canonical PAMs; (ii) PAM positioning appears precise, with no evidence for tolerance to PAM slippage in interference; and (iii) while increased guanine-cytosine (GC) content in the spacer is associated with higher CRISPR-interference efficiency, high GC content (>62.5%) decreases CRISPR-interference efficiency. Our findings provide a comprehensive functional profile of Cascade type I-E interference requirements and a method to assay spacer efficacy that can be applied to other CRISPR-Cas systems.
Collapse
|
30
|
An Abundant Class of Non-coding DNA Can Prevent Stochastic Gene Silencing in the C. elegans Germline. Cell 2016; 166:343-357. [PMID: 27374334 PMCID: PMC4947018 DOI: 10.1016/j.cell.2016.05.072] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Revised: 03/31/2016] [Accepted: 05/18/2016] [Indexed: 11/30/2022]
Abstract
Cells benefit from silencing foreign genetic elements but must simultaneously avoid inactivating endogenous genes. Although chromatin modifications and RNAs contribute to maintenance of silenced states, the establishment of silenced regions will inevitably reflect underlying DNA sequence and/or structure. Here we demonstrate that a pervasive non-coding DNA feature in Caenorhabditis elegans, characterized by 10-basepair periodic An/Tn-clusters (PATCs), can license transgenes for germline expression within repressive chromatin domains. Transgenes containing natural or synthetic PATCs are resistant to position effect variegation and stochastic silencing in the germline. Among endogenous genes, intron length and PATC-character undergo dramatic changes as orthologs move from active to repressive chromatin over evolutionary time, indicating a dynamic character to the An/Tn periodicity. We propose that PATCs form the basis of a cellular immune system, identifying certain endogenous genes in heterochromatic contexts as privileged while foreign DNA can be suppressed with no requirement for a cellular memory of prior exposure.
Collapse
|
31
|
Distinct patterns of Cas9 mismatch tolerance in vitro and in vivo. Nucleic Acids Res 2016; 44:5365-77. [PMID: 27198218 PMCID: PMC4914125 DOI: 10.1093/nar/gkw417] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 04/27/2016] [Accepted: 04/30/2016] [Indexed: 12/26/2022] Open
Abstract
Cas9, a CRISPR-associated RNA-guided nuclease, has been rapidly adopted as a tool for biochemical and genetic manipulation of DNA. Although complexes between Cas9 and guide RNAs (gRNAs) offer remarkable specificity and versatility for genome manipulation, mis-targeted events occur. To extend the understanding of gRNA::target homology requirements, we compared mutational tolerance for a set of Cas9::gRNA complexes in vitro and in vivo (in Saccharomyces cerevisiae). A variety of gRNAs were tested with variant libraries based on four different targets (with varying GC content and sequence features). In each case, we challenged a mixture of matched and mismatched targets, evaluating cleavage activity on a wide variety of potential target sequences in parallel through high-throughput sequencing of the products retained after cleavage. These experiments evidenced notable and consistent differences between in vitro and S. cerevisiae (in vivo) Cas9 cleavage specificity profiles including (i) a greater tolerance for mismatches in vitro and (ii) a greater specificity increase in vivo with truncation of the gRNA homology regions.
Collapse
|
32
|
A streamlined tethered chromosome conformation capture protocol. BMC Genomics 2016; 17:274. [PMID: 27036078 PMCID: PMC4818521 DOI: 10.1186/s12864-016-2596-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 03/16/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Identification of locus-locus contacts at the chromatin level provides a valuable foundation for understanding of nuclear architecture and function and a valuable tool for inferring long-range linkage relationships. As one approach to this, chromatin conformation capture-based techniques allow creation of genome spatial organization maps. While such approaches have been available for some time, methodological advances will be of considerable use in minimizing both time and input material required for successful application. RESULTS Here we report a modified tethered conformation capture protocol that utilizes a series of rapid and efficient molecular manipulations. We applied the method to Caenorhabditis elegans, obtaining chromatin interaction maps that provide a sequence-anchored delineation of salient aspects of Caenorhabditis elegans chromosome structure, demonstrating a high level of consistency in overall chromosome organization between biological samples collected under different conditions. In addition to the application of the method to defining nuclear architecture, we found the resulting chromatin interaction maps to be of sufficient resolution and sensitivity to enable detection of large-scale structural variants such as inversions or translocations. CONCLUSION Our streamlined protocol provides an accelerated, robust, and broadly applicable means of generating chromatin spatial organization maps and detecting genome rearrangements without a need for cellular or chromatin fractionation.
Collapse
|
33
|
Direct CRISPR spacer acquisition from RNA by a natural reverse transcriptase-Cas1 fusion protein. Science 2016; 351:aad4234. [PMID: 26917774 DOI: 10.1126/science.aad4234] [Citation(s) in RCA: 138] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
CRISPR systems mediate adaptive immunity in diverse prokaryotes. CRISPR-associated Cas1 and Cas2 proteins have been shown to enable adaptation to new threats in type I and II CRISPR systems by the acquisition of short segments of DNA (spacers) from invasive elements. In several type III CRISPR systems, Cas1 is naturally fused to a reverse transcriptase (RT). In the marine bacterium Marinomonas mediterranea (MMB-1), we showed that a RT-Cas1 fusion protein enables the acquisition of RNA spacers in vivo in a RT-dependent manner. In vitro, the MMB-1 RT-Cas1 and Cas2 proteins catalyze the ligation of RNA segments into the CRISPR array, which is followed by reverse transcription. These observations outline a host-mediated mechanism for reverse information flow from RNA to DNA.
Collapse
|
34
|
Maturation Pathway from Germline to Broad HIV-1 Neutralizer of a CD4-Mimic Antibody. Cell 2016; 165:449-63. [PMID: 26949186 DOI: 10.1016/j.cell.2016.02.022] [Citation(s) in RCA: 228] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Revised: 12/11/2015] [Accepted: 02/08/2016] [Indexed: 01/16/2023]
Abstract
Antibodies with ontogenies from VH1-2 or VH1-46-germline genes dominate the broadly neutralizing response against the CD4-binding site (CD4bs) on HIV-1. Here, we define with longitudinal sampling from time-of-infection the development of a VH1-46-derived antibody lineage that matured to neutralize 90% of HIV-1 isolates. Structures of lineage antibodies CH235 (week 41 from time-of-infection, 18% breadth), CH235.9 (week 152, 77%), and CH235.12 (week 323, 90%) demonstrated the maturing epitope to focus on the conformationally invariant portion of the CD4bs. Similarities between CH235 lineage and five unrelated CD4bs lineages in epitope focusing, length-of-time to develop breadth, and extraordinary level of somatic hypermutation suggested commonalities in maturation among all CD4bs antibodies. Fortunately, the required CH235-lineage hypermutation appeared substantially guided by the intrinsic mutability of the VH1-46 gene, which closely resembled VH1-2. We integrated our CH235-lineage findings with a second broadly neutralizing lineage and HIV-1 co-evolution to suggest a vaccination strategy for inducing both lineages.
Collapse
|
35
|
Cas9 Variants Expand the Target Repertoire in Caenorhabditis elegans. Genetics 2016; 202:381-8. [PMID: 26680661 PMCID: PMC4788222 DOI: 10.1534/genetics.115.185041] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Accepted: 12/10/2015] [Indexed: 12/26/2022] Open
Abstract
The proliferation of CRISPR/Cas9-based methods in Caenorhabditis elegans has enabled efficient genome editing and precise genomic tethering of Cas9 fusion proteins. Experimental designs using CRISPR/Cas9 are currently limited by the need for a protospacer adjacent motif (PAM) in the target with the sequence NGG. Here we report the characterization of two modified Cas9 proteins in C. elegans that recognize NGA and NGCG PAMs. We found that each variant could stimulate homologous recombination with a donor template at multiple loci and that PAM specificity was comparable to that of wild-type Cas9. To directly compare effectiveness, we used CRISPR/Cas9 genome editing to generate a set of assay strains with a common single-guide RNA (sgRNA) target sequence, but that differ in the juxtaposed PAM (NGG, NGA, or NGCG). In this controlled setting, we determined that the NGA PAM Cas9 variant can be as effective as wild-type Cas9. We similarly edited a genomic target to study the influence of the base following the NGA PAM. Using four strains with four NGAN PAMs differing only at the fourth position and adjacent to the same sgRNA target, we observed that efficient homologous replacement was attainable with any base in the fourth position, with an NGAG PAM being the most effective. In addition to demonstrating the utility of two Cas9 mutants in C. elegans and providing reagents that permit CRISPR/Cas9 experiments with fewer restrictions on potential targets, we established a means to benchmark the efficiency of different Cas9::PAM combinations that avoids variations owing to differences in the sgRNA sequence.
Collapse
|
36
|
Persistence and evolution of allergen-specific IgE repertoires during subcutaneous specific immunotherapy. J Allergy Clin Immunol 2015; 137:1535-44. [PMID: 26559321 DOI: 10.1016/j.jaci.2015.09.027] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2015] [Revised: 07/24/2015] [Accepted: 09/23/2015] [Indexed: 10/22/2022]
Abstract
BACKGROUND Specific immunotherapy (SIT) is the only treatment with proved long-term curative potential in patients with allergic disease. Allergen-specific IgE is the causative agent of allergic disease, and antibodies contribute to SIT, but the effects of SIT on aeroallergen-specific B-cell repertoires are not well understood. OBJECTIVE We sought to characterize the IgE sequences expressed by allergen-specific B cells and track the fate of these B-cell clones during SIT. METHODS We used high-throughput antibody gene sequencing and identification of allergen-specific IgE with combinatorial antibody fragment library technology to analyze immunoglobulin repertoires of blood and the nasal mucosa from aeroallergen-sensitized subjects before and during the first year of subcutaneous SIT. RESULTS Of 52 distinct allergen-specific IgE heavy chains from 8 allergic donors, 37 were also detected by using high-throughput antibody gene sequencing of blood samples, nasal mucosal samples, or both. The allergen-specific clones had increased persistence, higher likelihood of belonging to clones expressing other switched isotypes, and possibly larger clone size than the rest of the IgE repertoire. Clone members in nasal tissue showed close mutational relationships. CONCLUSION In the future, combining functional binding studies, deep antibody repertoire sequencing, and information on clinical outcomes in larger studies might aid assessment of SIT mechanisms and efficacy.
Collapse
|
37
|
Functional relevance of "seed" and "non-seed" sequences in microRNA-mediated promotion of C. elegans developmental progression. RNA (NEW YORK, N.Y.) 2015; 21:1980-1992. [PMID: 26385508 PMCID: PMC4604436 DOI: 10.1261/rna.053793.115] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/26/2012] [Accepted: 08/05/2015] [Indexed: 06/05/2023]
Abstract
The founding heterochronic microRNAs, lin-4 and let-7, together with their validated targets and well-characterized phenotypes in C. elegans, offer an opportunity to test functionality of microRNAs in a developmental context. In this study, we defined sequence requirements at the microRNA level for these two microRNAs, evaluating lin-4 and let-7 mutant microRNAs for their ability to support temporal development under conditions where the wild-type lin-4 and let-7 gene products are absent. For lin-4, we found a strong requirement for seed sequences, with function drastically affected by several central mutations in the seed sequence, while rescue was retained by a set of mutations peripheral to the seed. let-7 rescuing activity was retained to a surprising degree by a variety of central seed mutations, while several non-seed mutant effects support potential noncanonical contributions to let-7 function. Taken together, this work illustrates both the functional partnership between seed and non-seed sequences in mediating C. elegans temporal development and a diversity among microRNA effectors in the contributions of seed and non-seed regions to activity.
Collapse
|
38
|
A requirement for ERK-dependent Dicer phosphorylation in coordinating oocyte-to-embryo transition in C. elegans. Dev Cell 2014; 31:614-28. [PMID: 25490268 PMCID: PMC4261158 DOI: 10.1016/j.devcel.2014.11.004] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Revised: 08/12/2014] [Accepted: 11/04/2014] [Indexed: 02/03/2023]
Abstract
Signaling pathways and small RNAs direct diverse cellular events, but few examples are known of defined signaling pathways directly regulating small RNA biogenesis. We show that ERK phosphorylates Dicer on two conserved residues in its RNase IIIb and double-stranded RNA (dsRNA)-binding domains and that phosphorylation of these residues is necessary and sufficient to trigger Dicer's nuclear translocation in worms, mice, and human cells. Phosphorylation of Dicer on either site inhibits Dicer function in the female germline and dampens small RNA repertoire. Our data demonstrate that ERK phosphorylates and inhibits Dicer during meiosis I for oogenesis to proceed normally in Caenorhabditis elegans and that this inhibition is released before fertilization for embryogenesis to proceed normally. The conserved Dicer residues, their phosphorylation by ERK, and the consequences of the resulting modifications implicate an ERK-Dicer nexus as a fundamental component of the oocyte-to-embryo transition and an underlying mechanism coupling extracellular cues to small RNA production.
Collapse
|
39
|
Abstract
To study target sequence specificity, selectivity, and reaction kinetics of Streptococcus pyogenes Cas9 activity, we challenged libraries of random variant targets with purified Cas9::guide RNA complexes in vitro. Cleavage kinetics were nonlinear, with a burst of initial activity followed by slower sustained cleavage. Consistent with other recent analyses of Cas9 sequence specificity, we observe considerable (albeit incomplete) impairment of cleavage for targets mutated in the PAM sequence or in ‘seed’ sequences matching the proximal 8 bp of the guide. A second target region requiring close homology was located at the other end of the guide::target duplex (positions 13–18 relative to the PAM). Sequences flanking the guide+PAM region had measurable (albeit modest) effects on cleavage. In addition, the first-base Guanine constraint commonly imposed by gRNA expression systems has little effect on overall cleavage efficiency. Taken together, these studies provide an in vitro understanding of the complexities of Cas9–gRNA interaction and cleavage beyond the general paradigm of site determination based on the ‘seed’ sequence and PAM.
Collapse
|
40
|
Human responses to influenza vaccination show seroconversion signatures and convergent antibody rearrangements. Cell Host Microbe 2014; 16:105-14. [PMID: 24981332 DOI: 10.1016/j.chom.2014.05.013] [Citation(s) in RCA: 180] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Revised: 03/17/2014] [Accepted: 05/16/2014] [Indexed: 01/08/2023]
Abstract
B cells produce a diverse antibody repertoire by undergoing gene rearrangements. Pathogen exposure induces the clonal expansion of B cells expressing antibodies that can bind the infectious agent. To assess human B cell responses to trivalent seasonal influenza and monovalent pandemic H1N1 vaccination, we sequenced gene rearrangements encoding the immunoglobulin heavy chain, a major determinant of epitope recognition. The magnitude of B cell clonal expansions correlates with an individual's secreted antibody response to the vaccine, and the expanded clones are enriched with those expressing influenza-specific monoclonal antibodies. Additionally, B cell responses to pandemic influenza H1N1 vaccination and infection in different people show a prominent family of convergent antibody heavy chain gene rearrangements specific to influenza antigens. These results indicate that microbes can induce specific signatures of immunoglobulin gene rearrangements and that pathogen exposure can potentially be assessed from B cell repertoires.
Collapse
|
41
|
Convergent antibody signatures in human dengue. Cell Host Microbe 2013; 13:691-700. [PMID: 23768493 DOI: 10.1016/j.chom.2013.05.008] [Citation(s) in RCA: 192] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 04/03/2013] [Accepted: 04/30/2013] [Indexed: 01/05/2023]
Abstract
Dengue is the most prevalent mosquito-borne viral disease in humans, and the lack of early prognostics, vaccines, and therapeutics contributes to immense disease burden. To identify patterns that could be used for sequence-based monitoring of the antibody response to dengue, we examined antibody heavy-chain gene rearrangements in longitudinal peripheral blood samples from 60 dengue patients. Comparing signatures between acute dengue, postrecovery, and healthy samples, we found increased expansion of B cell clones in acute dengue patients, with higher overall clonality in secondary infection. Additionally, we observed consistent antibody sequence features in acute dengue in the highly variable major antigen-binding determinant, complementarity-determining region 3 (CDR3), with specific CDR3 sequences highly enriched in acute samples compared to postrecovery, healthy, or non-dengue samples. Dengue thus provides a striking example of a human viral infection where convergent immune signatures can be identified in multiple individuals. Such signatures could facilitate surveillance of immunological memory in communities.
Collapse
|
42
|
Effects of aging, cytomegalovirus infection, and EBV infection on human B cell repertoires. THE JOURNAL OF IMMUNOLOGY 2013; 192:603-11. [PMID: 24337376 DOI: 10.4049/jimmunol.1301384] [Citation(s) in RCA: 148] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Elderly humans show decreased humoral immunity to pathogens and vaccines, yet the effects of aging on B cells are not fully known. Chronic viral infection by CMV is implicated as a driver of clonal T cell proliferations in some aging humans, but whether CMV or EBV infection contributes to alterations in the B cell repertoire with age is unclear. We have used high-throughput DNA sequencing of IGH gene rearrangements to study the BCR repertoires over two successive years in 27 individuals ranging in age from 20 to 89 y. Some features of the B cell repertoire remain stable with age, but elderly subjects show increased numbers of B cells with long CDR3 regions, a trend toward accumulation of more highly mutated IgM and IgG Ig genes, and persistent clonal B cell populations in the blood. Seropositivity for CMV or EBV infection alters B cell repertoires, regardless of the individual's age: EBV infection correlates with the presence of persistent clonal B cell expansions, whereas CMV infection correlates with the proportion of highly mutated Ab genes. These findings isolate effects of aging from those of chronic viral infection on B cell repertoires and provide a baseline for understanding human B cell responses to vaccination or infectious stimuli.
Collapse
|
43
|
Unusual DNA packaging characteristics in endoreduplicated Caenorhabditis elegans oocytes defined by in vivo accessibility to an endogenous nuclease activity. Epigenetics Chromatin 2013; 6:37. [PMID: 24279402 PMCID: PMC3819648 DOI: 10.1186/1756-8935-6-37] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 09/17/2013] [Indexed: 01/29/2023] Open
Abstract
Background Germ cells in animals are highly specialized to preserve the genome. A distinct set of chromatin structures must be properly established in germ cells to maintain cell fate and genome integrity. We describe DNA-surface interactions in activated Caenorhabditis elegans oocytes that are revealed through the activity of an endogenous nuclease ('endocleavage’). Results Our analysis began with an unexpected observation that a majority (>50%) of DNA from ovulated but unfertilized C. elegans oocytes can be recovered in fragments of approximately 500 base pairs or shorter, cleaved at regular intervals (10 to 11 nt) along the DNA helix. In some areas of the genome, DNA cleavage patterns in these endoreduplicated oocytes appear consistent from cell-to-cell, indicating coherent rotational positioning of the DNA in chromatin. Particularly striking in this analysis are arrays of sensitive sites with a periodicity of approximately 10 bp that persist for several hundred base pairs of genomic DNA, longer than a single nucleosome core. Genomic regions with a strong bias toward a 10-nt periodic occurrence of A(n)/T(n) (so-called PATC regions) appear to exhibit a high degree of rotational constraint in endocleavage phasing, with a strong tendency for the periodic A(n)/T(n) sites to remain on the face of the helix protected from nuclease digestion. Conclusion The present analysis provides evidence for an unusual structure in C. elegans oocytes in which genomic DNA and associated protein structures are coherently linked.
Collapse
|
44
|
Comprehensive whole-genome sequencing of an early-stage primary myelofibrosis patient defines low mutational burden and non-recurrent candidate genes. Haematologica 2013; 98:1689-96. [PMID: 23872309 DOI: 10.3324/haematol.2013.092379] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
In order to identify novel somatic mutations associated with classic BCR/ABL1-negative myeloproliferative neoplasms, we performed high-coverage genome sequencing of DNA from peripheral blood granulocytes and cultured skin fibroblasts from a patient with MPL W515K-positive primary myelofibrosis. The primary myelofibrosis genome had a low somatic mutation rate, consistent with that observed in similar hematopoietic tumor genomes. Interfacing of whole-genome DNA sequence data with RNA expression data identified three somatic mutations of potential functional significance: i) a nonsense mutation in CARD6, implicated in modulation of NF-kappaB activation; ii) a 19-base pair deletion involving a potential regulatory region in the 5'-untranslated region of BRD2, implicated in transcriptional regulation and cell cycle control; and iii) a non-synonymous point mutation in KIAA0355, an uncharacterized protein. Additional mutations in three genes (CAP2, SOX30, and MFRP) were also evident, albeit with no support for expression at the RNA level. Re-sequencing of these six genes in 178 patients with polycythemia vera, essential thrombocythemia, and myelofibrosis did not identify recurrent somatic mutations in these genes. Finally, we describe methods for reducing false-positive variant calls in the analysis of hematologic malignancies with a low somatic mutation rate. This trial is registered with ClinicalTrials.gov (NCT01108159).
Collapse
|
45
|
Minicircle DNA vectors achieve sustained expression reflected by active chromatin and transcriptional level. Mol Ther 2012. [PMID: 23183534 DOI: 10.1038/mt.2012.244] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Current efforts in nonviral gene therapy are plagued by a pervasive difficulty in sustaining therapeutic levels of delivered transgenes. Minicircles (plasmid derivatives with the same expression cassette but lacking a bacterial backbone) show sustained expression and hold promise for therapeutic use where persistent transgene expression is required. To characterize the widely-observed silencing process affecting expression of foreign DNA in mammals, we used a system in which mouse liver presented with either plasmid or minicircle consistently silences plasmid but not minicircle expression. We found that preferential silencing of plasmid DNA occurs at a nuclear stage that precedes transport of mRNA to the cytoplasm, evident from a consistent >25-fold minicircle/plasmid transcript difference observed in both nuclear and total RNA. Among possible mechanisms of nuclear silencing, our data favor chromatin-linked transcriptional blockage rather than targeted degradation, aberrant processing, or compromised mRNA transport. In particular, we observe dramatic enrichment of H3K27 trimethylation on plasmid sequences. Also, it appears that Pol II can engage the modified plasmid chromatin, potentially in a manner that is not productive in the synthesis of high levels of new transcript. We outline a scenario in which sustained differences at the chromatin level cooperate to determine the activity of foreign DNA.
Collapse
|
46
|
Antibody lineages with evidence of somatic hypermutation persisting for >4 years in a South African subject with broad neutralizing activity. Retrovirology 2012. [PMCID: PMC3442051 DOI: 10.1186/1742-4690-9-s2-p85] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
47
|
The inference of phased haplotypes for the immunoglobulin H chain V region gene loci by analysis of VDJ gene rearrangements. THE JOURNAL OF IMMUNOLOGY 2011; 188:1333-40. [PMID: 22205028 DOI: 10.4049/jimmunol.1102097] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
The existence of many highly similar genes in the lymphocyte receptor gene loci makes them difficult to investigate, and the determination of phased "haplotypes" has been particularly problematic. However, V(D)J gene rearrangements provide an opportunity to infer the association of Ig genes along the chromosomes. The chromosomal distribution of H chain genes in an Ig genotype can be inferred through analysis of VDJ rearrangements in individuals who are heterozygous at points within the IGH locus. We analyzed VDJ rearrangements from 44 individuals for whom sufficient unique rearrangements were available to allow comprehensive genotyping. Nine individuals were identified who were heterozygous at the IGHJ6 locus and for whom sufficient suitable VDJ rearrangements were available to allow comprehensive haplotyping. Each of the 18 resulting IGHV│IGHD│IGHJ haplotypes was unique. Apparent deletion polymorphisms were seen that involved as many as four contiguous, functional IGHV genes. Two deletion polymorphisms involving multiple contiguous IGHD genes were also inferred. Three previously unidentified gene duplications were detected, where two sequences recognized as allelic variants of a single gene were both inferred to be on a single chromosome. Phased genomic data brings clarity to the study of the contribution of each gene to the available repertoire of rearranged VDJ genes. Analysis of rearrangement frequencies suggests that particular genes may have substantially different yet predictable propensities for rearrangement within different haplotypes. Together with data highlighting the extent of haplotypic variation within the population, this suggests that there may be substantial variability in the available Ab repertoires of different individuals.
Collapse
|
48
|
Initial antibodies binding to HIV-1 gp41 in acutely infected subjects are polyreactive and highly mutated. J Exp Med 2011; 208:2237-49. [PMID: 21987658 PMCID: PMC3201211 DOI: 10.1084/jem.20110363] [Citation(s) in RCA: 167] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2011] [Accepted: 08/31/2011] [Indexed: 01/20/2023] Open
Abstract
The initial antibody response to HIV-1 is targeted to envelope (Env) gp41, and is nonneutralizing and ineffective in controlling viremia. To understand the origins and characteristics of gp41-binding antibodies produced shortly after HIV-1 transmission, we isolated and studied gp41-reactive plasma cells from subjects acutely infected with HIV-1. The frequencies of somatic mutations were relatively high in these gp41-reactive antibodies. Reverted unmutated ancestors of gp41-reactive antibodies derived from subjects acutely infected with HIV-1 frequently did not react with autologous HIV-1 Env; however, these antibodies were polyreactive and frequently bound to host or bacterial antigens. In one large clonal lineage of gp41-reactive antibodies, reactivity to HIV-1 Env was acquired only after somatic mutations. Polyreactive gp41-binding antibodies were also isolated from uninfected individuals. These data suggest that the majority of gp41-binding antibodies produced after acute HIV-1 infection are cross-reactive responses generated by stimulating memory B cells that have previously been activated by non-HIV-1 antigens.
Collapse
|
49
|
Competition between ADAR and RNAi pathways for an extensive class of RNA targets. Nat Struct Mol Biol 2011; 18:1094-101. [PMID: 21909095 PMCID: PMC3190075 DOI: 10.1038/nsmb.2129] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2011] [Accepted: 07/28/2011] [Indexed: 12/22/2022]
Abstract
Adenosine deaminases that act on RNAs (ADARs) interact with double-stranded RNAs, deaminating adenosines to inosines. Previous studies of Caenorhabditis elegans suggested an antagonistic interaction between ADAR and RNAi machineries, with ADAR defects suppressed upon additional knockout of RNAi. These results suggest a pool of common RNA substrates capable of engaging both pathways. To define and characterize such substrates, we examined small RNA and mRNA populations of ADAR mutants and identified a distinct set of loci from which RNAi-dependent short RNAs are dramatically upregulated. At these same loci, we observe populations of multiply edited transcripts, supporting a specific role for ADARs in preventing access to the RNAi pathway for an extensive population of dsRNAs. Characterization of these loci reveal an extensive overlap with non-coding and intergenic regions, suggesting that the landscape of ADAR targets may extend beyond previously annotated classes of transcripts.
Collapse
|
50
|
Abstract
The discovery of microRNAs (miRNAs) lin-4 and let-7 as temporal regulators in Caenorhabditis elegans led to broader searches for novel miRNAs and their biological roles. Unlike protein-coding genes and some long noncoding RNAs, canonical metazoan miRNAs are not known to contain introns within their genomic precursor sequences. Because the short length of miRNAs complicates a statistically definitive assignment of split genes in RNA sequencing data sets, we took an experimental approach toward testing the compatibility of splicing and functional miRNA biogenesis. To definitively evaluate the possibility that miRNAs could derive from interrupted genes, we constructed intron-interrupted variants of C. elegans lin-4 and assayed for their miRNA-encoding capability and biological activity in the developing organism. Our studies indicate that (1) intron-containing miRNAs (inc-miRs) can be efficiently spliced and processed to produce miRNAs with normal termini, and (2) these miRNAs can be functional in full rescue of developmental phenotypes in null mutants lacking endogenous lin-4. This study provides the first evidence to support the ability of intron-interrupted miRNA precursors to produce functional regulators and identifies an additional modality available for metazoan miRNA production.
Collapse
|