1
|
Genomic data resources of the Brain Somatic Mosaicism Network for neuropsychiatric diseases. Sci Data 2023; 10:813. [PMID: 37985666 PMCID: PMC10662356 DOI: 10.1038/s41597-023-02645-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 10/16/2023] [Indexed: 11/22/2023] Open
Abstract
Somatic mosaicism is defined as an occurrence of two or more populations of cells having genomic sequences differing at given loci in an individual who is derived from a single zygote. It is a characteristic of multicellular organisms that plays a crucial role in normal development and disease. To study the nature and extent of somatic mosaicism in autism spectrum disorder, bipolar disorder, focal cortical dysplasia, schizophrenia, and Tourette syndrome, a multi-institutional consortium called the Brain Somatic Mosaicism Network (BSMN) was formed through the National Institute of Mental Health (NIMH). In addition to genomic data of affected and neurotypical brains, the BSMN also developed and validated a best practices somatic single nucleotide variant calling workflow through the analysis of reference brain tissue. These resources, which include >400 terabytes of data from 1087 subjects, are now available to the research community via the NIMH Data Archive (NDA) and are described here.
Collapse
|
2
|
Schizophrenia-associated somatic copy-number variants from 12,834 cases reveal recurrent NRXN1 and ABCB11 disruptions. CELL GENOMICS 2023; 3:100356. [PMID: 37601975 PMCID: PMC10435376 DOI: 10.1016/j.xgen.2023.100356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 02/21/2022] [Accepted: 06/09/2023] [Indexed: 08/22/2023]
Abstract
While germline copy-number variants (CNVs) contribute to schizophrenia (SCZ) risk, the contribution of somatic CNVs (sCNVs)-present in some but not all cells-remains unknown. We identified sCNVs using blood-derived genotype arrays from 12,834 SCZ cases and 11,648 controls, filtering sCNVs at loci recurrently mutated in clonal blood disorders. Likely early-developmental sCNVs were more common in cases (0.91%) than controls (0.51%, p = 2.68e-4), with recurrent somatic deletions of exons 1-5 of the NRXN1 gene in five SCZ cases. Hi-C maps revealed ectopic, allele-specific loops forming between a potential cryptic promoter and non-coding cis-regulatory elements upon 5' deletions in NRXN1. We also observed recurrent intragenic deletions of ABCB11, encoding a transporter implicated in anti-psychotic response, in five treatment-resistant SCZ cases and showed that ABCB11 is specifically enriched in neurons forming mesocortical and mesolimbic dopaminergic projections. Our results indicate potential roles of sCNVs in SCZ risk.
Collapse
|
3
|
Sex Differences in the Human Brain Transcriptome of Cases With Schizophrenia. Biol Psychiatry 2022; 91:92-101. [PMID: 34154796 PMCID: PMC8463632 DOI: 10.1016/j.biopsych.2021.03.020] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 03/16/2021] [Accepted: 03/18/2021] [Indexed: 01/03/2023]
Abstract
BACKGROUND While schizophrenia differs between males and females in the age of onset, symptomatology, and disease course, the molecular mechanisms underlying these differences remain uncharacterized. METHODS To address questions about the sex-specific effects of schizophrenia, we performed a large-scale transcriptome analysis of RNA sequencing data from 437 controls and 341 cases from two distinct cohorts from the CommonMind Consortium. RESULTS Analysis across the cohorts identified a reproducible gene expression signature of schizophrenia that was highly concordant with previous work. Differential expression across sex was reproducible across cohorts and identified X- and Y-linked genes, as well as those involved in dosage compensation. Intriguingly, the sex expression signature was also enriched for genes involved in neurexin family protein binding and synaptic organization. Differential expression analysis testing a sex-by-diagnosis interaction effect did not identify any genome-wide signature after multiple testing corrections. Gene coexpression network analysis was performed to reduce dimensionality from thousands of genes to dozens of modules and elucidate interactions among genes. We found enrichment of coexpression modules for sex-by-diagnosis differential expression signatures, which were highly reproducible across the two cohorts and involved a number of diverse pathways, including neural nucleus development, neuron projection morphogenesis, and regulation of neural precursor cell proliferation. CONCLUSIONS Overall, our results indicate that the effect size of sex differences in schizophrenia gene expression signatures is small and underscore the challenge of identifying robust sex-by-diagnosis signatures, which will require future analyses in larger cohorts.
Collapse
|
4
|
Comprehensive identification of somatic nucleotide variants in human brain tissue. Genome Biol 2021; 22:92. [PMID: 33781308 PMCID: PMC8006362 DOI: 10.1186/s13059-021-02285-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 02/01/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. RESULTS Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. CONCLUSIONS This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.
Collapse
|
5
|
EKLF/KLF1 expression defines a unique macrophage subset during mouse erythropoiesis. eLife 2021; 10:61070. [PMID: 33570494 PMCID: PMC7932694 DOI: 10.7554/elife.61070] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 02/10/2021] [Indexed: 12/17/2022] Open
Abstract
Erythroblastic islands are a specialized niche that contain a central macrophage surrounded by erythroid cells at various stages of maturation. However, identifying the precise genetic and transcriptional control mechanisms in the island macrophage remains difficult due to macrophage heterogeneity. Using unbiased global sequencing and directed genetic approaches focused on early mammalian development, we find that fetal liver macrophages exhibit a unique expression signature that differentiates them from erythroid and adult macrophage cells. The importance of erythroid Krüppel-like factor (EKLF)/KLF1 in this identity is shown by expression analyses in EKLF-/- and in EKLF-marked macrophage cells. Single-cell sequence analysis simplifies heterogeneity and identifies clusters of genes important for EKLF-dependent macrophage function and novel cell surface biomarkers. Remarkably, this singular set of macrophage island cells appears transiently during embryogenesis. Together, these studies provide a detailed perspective on the importance of EKLF in the establishment of the dynamic gene expression network within erythroblastic islands in the developing embryo and provide the means for their efficient isolation.
Collapse
|
6
|
Global landscape and genetic regulation of RNA editing in cortical samples from individuals with schizophrenia. Nat Neurosci 2019; 22:1402-1412. [PMID: 31455887 PMCID: PMC6791127 DOI: 10.1038/s41593-019-0463-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Accepted: 07/09/2019] [Indexed: 12/28/2022]
Abstract
RNA editing critically regulates neurodevelopment and normal neuronal function. The global landscape of RNA editing was surveyed across 364 schizophrenia cases and 383 control postmortem brain samples from the CommonMind Consortium, comprising two regions: dorsolateral prefrontal cortex and anterior cingulate cortex. In schizophrenia, RNA editing sites in genes encoding AMPA-type glutamate receptors and postsynaptic density proteins were less edited, whereas those encoding translation initiation machinery were edited more. These sites replicate between brain regions, map to 3'-untranslated regions and intronic regions, share common sequence motifs and overlap with binding sites for RNA-binding proteins crucial for neurodevelopment. These findings cross-validate in hundreds of non-overlapping dorsolateral prefrontal cortex samples. Furthermore, ~30% of RNA editing sites associate with cis-regulatory variants (editing quantitative trait loci or edQTLs). Fine-mapping edQTLs with schizophrenia risk loci revealed co-localization of eleven edQTLs with six loci. The findings demonstrate widespread altered RNA editing in schizophrenia and its genetic regulation, and suggest a causal and mechanistic role of RNA editing in schizophrenia neuropathology.
Collapse
|
7
|
Abstract
How gene expression correlates with schizophrenia across individuals is beginning to be examined through analyses of RNA-seq from postmortem brains of individuals with disease and control brains. Here we focus on variation in allele-specific expression, following up on the CommonMind Consortium (CMC) RNA-seq experiments of nearly 600 human dorsolateral prefrontal cortex (DLPFC) samples. Analyzing the extent of allelic expression bias-a hallmark of imprinting-we find that the number of imprinted human genes is consistent with lower estimates (≈0.5% of all genes), and thus contradicts much higher estimates. Moreover, the handful of putatively imprinted genes are all in close genomic proximity to known imprinted genes. Joint analysis of the imprinted genes across hundreds of individuals allowed us to establish how allelic bias depends on various factors. We find that age and genetic ancestry have gene-specific, differential effect on allelic bias. In contrast, allelic bias appears to be independent of schizophrenia.
Collapse
|
8
|
Mapping and characterizing N6-methyladenine in eukaryotic genomes using single-molecule real-time sequencing. Genome Res 2018; 28:1067-1078. [PMID: 29764913 PMCID: PMC6028124 DOI: 10.1101/gr.231068.117] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2017] [Accepted: 05/01/2018] [Indexed: 01/31/2023]
Abstract
N6-Methyladenine (m6dA) has been discovered as a novel form of DNA methylation prevalent in eukaryotes; however, methods for high-resolution mapping of m6dA events are still lacking. Single-molecule real-time (SMRT) sequencing has enabled the detection of m6dA events at single-nucleotide resolution in prokaryotic genomes, but its application to detecting m6dA in eukaryotic genomes has not been rigorously examined. Herein, we identified unique characteristics of eukaryotic m6dA methylomes that fundamentally differ from those of prokaryotes. Based on these differences, we describe the first approach for mapping m6dA events using SMRT sequencing specifically designed for the study of eukaryotic genomes and provide appropriate strategies for designing experiments and carrying out sequencing in future studies. We apply the novel approach to study two eukaryotic genomes. For green algae, we construct the first complete genome-wide map of m6dA at single-nucleotide and single-molecule resolution. For human lymphoblastoid cells (hLCLs), it was necessary to integrate SMRT sequencing data with independent sequencing data. The joint analyses suggest putative m6dA events are enriched in the promoters of young full-length LINE-1 elements (L1s), but call for validation by additional methods. These analyses demonstrate a general method for rigorous mapping and characterization of m6dA events in eukaryotic genomes.
Collapse
|
9
|
Cocaine-Induced Chromatin Modifications Associate With Increased Expression and Three-Dimensional Looping of Auts2. Biol Psychiatry 2017; 82:794-805. [PMID: 28577753 PMCID: PMC5671915 DOI: 10.1016/j.biopsych.2017.04.013] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Revised: 04/03/2017] [Accepted: 04/04/2017] [Indexed: 12/29/2022]
Abstract
BACKGROUND Exposure to drugs of abuse alters the epigenetic landscape of the brain's reward regions, such as the nucleus accumbens. We investigated how combinations of chromatin modifications affect genes that regulate responses to cocaine. We focused on Auts2, a gene linked to human evolution and cognitive disorders, which displays strong clustering of cocaine-induced chromatin modifications in this brain region. METHODS We combined chromosome conformation capture, circularized chromosome conformation capture, and related approaches with behavioral paradigms relevant to cocaine phenotypes. Cell type-specific functions were assessed by fluorescence-activated cell sorting and viral-mediated overexpression in Cre-dependent mouse lines. RESULTS We observed that Auts2 gene expression is increased by repeated cocaine administration specifically in D2-type medium spiny neurons in the nucleus accumbens, an effect seen in male but not female mice. Auts2 messenger RNA expression was also upregulated postmortem in the nucleus accumbens of male human cocaine addicts. We obtained evidence that chromosomal looping, bypassing 1524 kb of linear genome, connects Auts2 to the Caln1 gene locus under baseline conditions. This looping was disrupted after repeated cocaine exposure, resulting in increased expression of both genes in D2-type medium spiny neurons. Cocaine exposure reduces binding of CCCTC-binding factor, a chromosomal scaffolding protein, and increases histone and DNA methylation at the Auts-Caln1 loop base in the nucleus accumbens. Cell type-specific overexpression of Auts2 or Caln1 in D2-type medium spiny neurons demonstrated that both genes promote cocaine reward. CONCLUSIONS These findings suggest that cocaine-induced alterations of neuronal three-dimensional genome organization destabilize higher order chromatin at specific loci that regulate responses to the drug.
Collapse
|
10
|
Abstract
Monoallelic expression not due to cis-regulatory sequence polymorphism poses an intriguing problem in epigenetics because it requires the unequal treatment of two segments of DNA that are present in the same nucleus and that can indeed have absolutely identical sequences. Here, I focus on a few recent developments in the field of monoallelic expression that are of particular interest and raise interesting questions for future work. One development is regarding analyses of imprinted genes, in which recent work suggests the possibility that intriguing networks of imprinted genes exist and are important for genetic and physiological studies. Another issue that has been raised in recent years by a number of publications is the question of how skewed allelic expression should be for it to be designated as monoallelic expression and, further, what methods are appropriate or inappropriate for analyzing genomic data to examine allele-specific expression. Perhaps the most exciting recent development in mammalian monoallelic expression is a clever and carefully executed analysis of genetic diversity of autosomal genes subject to random monoallelic expression (RMAE), which provides compelling evidence for distinct evolutionary forces acting on random monoallelically expressed genes.
Collapse
|
11
|
Intersection of diverse neuronal genomes and neuropsychiatric disease: The Brain Somatic Mosaicism Network. Science 2017; 356:356/6336/eaal1641. [PMID: 28450582 DOI: 10.1126/science.aal1641] [Citation(s) in RCA: 167] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Neuropsychiatric disorders have a complex genetic architecture. Human genetic population-based studies have identified numerous heritable sequence and structural genomic variants associated with susceptibility to neuropsychiatric disease. However, these germline variants do not fully account for disease risk. During brain development, progenitor cells undergo billions of cell divisions to generate the ~80 billion neurons in the brain. The failure to accurately repair DNA damage arising during replication, transcription, and cellular metabolism amid this dramatic cellular expansion can lead to somatic mutations. Somatic mutations that alter subsets of neuronal transcriptomes and proteomes can, in turn, affect cell proliferation and survival and lead to neurodevelopmental disorders. The long life span of individual neurons and the direct relationship between neural circuits and behavior suggest that somatic mutations in small populations of neurons can significantly affect individual neurodevelopment. The Brain Somatic Mosaicism Network has been founded to study somatic mosaicism both in neurotypical human brains and in the context of complex neuropsychiatric disorders.
Collapse
|
12
|
Single molecule-level detection and long read-based phasing of epigenetic variations in bacterial methylomes. Nat Commun 2015; 6:7438. [PMID: 26074426 PMCID: PMC4490391 DOI: 10.1038/ncomms8438] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Accepted: 05/08/2015] [Indexed: 12/22/2022] Open
Abstract
Beyond its role in host defense, bacterial DNA methylation also plays important roles in the regulation of gene expression, virulence and antibiotic resistance. Bacterial cells in a clonal population can generate epigenetic heterogeneity to increase population-level phenotypic plasticity. Single molecule, real-time (SMRT) sequencing enables the detection of N6-methyladenine and N4-methylcytosine, two major types of DNA modifications comprising the bacterial methylome. However, existing SMRT sequencing-based methods for studying bacterial methylomes rely on a population-level consensus that lacks the single-cell resolution required to observe epigenetic heterogeneity. Here, we present SMALR (single-molecule modification analysis of long reads), a novel framework for single molecule-level detection and phasing of DNA methylation. Using seven bacterial strains, we show that SMALR yields significantly improved resolution and reveals distinct types of epigenetic heterogeneity. SMALR is a powerful new tool that enables de novo detection of epigenetic heterogeneity and empowers investigation of its functions in bacterial populations. Bacterial DNA methylation is involved in many processes, from host defense to antibiotic resistance, however current methods for examining methylated genomes lack single-cell resolution. Here Beaulaurier et al. present Single Molecule Modification Analysis of Long Reads, a new tool for de novo detection of epigenetic heterogeneity.
Collapse
|
13
|
Redefining the MED13L syndrome. Eur J Hum Genet 2015; 23:1308-17. [PMID: 25758992 DOI: 10.1038/ejhg.2015.26] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Revised: 12/19/2014] [Accepted: 01/06/2015] [Indexed: 11/09/2022] Open
Abstract
Congenital cardiac and neurodevelopmental deficits have been recently linked to the mediator complex subunit 13-like protein MED13L, a subunit of the CDK8-associated mediator complex that functions in transcriptional regulation through DNA-binding transcription factors and RNA polymerase II. Heterozygous MED13L variants cause transposition of the great arteries and intellectual disability (ID). Here, we report eight patients with predominantly novel MED13L variants who lack such complex congenital heart malformations. Rather, they depict a syndromic form of ID characterized by facial dysmorphism, ID, speech impairment, motor developmental delay with muscular hypotonia and behavioral difficulties. We thereby define a novel syndrome and significantly broaden the clinical spectrum associated with MED13L variants. A prominent feature of the MED13L neurocognitive presentation is profound language impairment, often in combination with articulatory deficits.
Collapse
|
14
|
(Epi)Genetic analyses of age-related macular degeneration: case-control and discordant twin studies. Hum Hered 2014; 78:59-72. [PMID: 25033836 DOI: 10.1159/000362814] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2013] [Accepted: 04/12/2014] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND/AIMS Phenotypic discordance in monozygotic (MZ) twin pairs can have an epigenetic or genetic basis. Although age-related macular degeneration (AMD) has a strong genetic component, few studies have addressed its epigenetic basis. METHODS Using SNP arrays, we evaluated differences in copy number variation (CNV) and allele-specific methylation (ASM) patterns (via methyl-sensitive restriction enzyme digestion of DNA) in MZ twin pairs from the US Twin Study of AMD. Further analyses examined the relationship between ASM and CNVs with AMD by both case/control analysis of ASM at candidate regions and by analysis of ASM and CNVs in twins discordant for AMD. RESULTS The frequency of ASM sites differs between cases and controls in regions surrounding the AMD candidate genes CFH, C2 and CFB. While ASM patterns show a substantial dependence on local sequence polymorphisms, we observed dissimilar patterns of ASM between MZ twins. The genes closest to the sites where discordant MZ twins have dissimilar patterns of ASM are enriched for genes implicated in gliosis, a process associated with neovascular AMD. Similar twin-based analyses revealed no AMD-associated CNVs. CONCLUSIONS Our results provide evidence of epigenetic influences beyond the known genetic susceptibility and implicate inflammatory responses and gliosis in the etiology of AMD.
Collapse
|
15
|
Allele-specific methylation occurs at genetic variants associated with complex disease. PLoS One 2014; 9:e98464. [PMID: 24911414 PMCID: PMC4049588 DOI: 10.1371/journal.pone.0098464] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Accepted: 05/03/2014] [Indexed: 12/17/2022] Open
Abstract
We hypothesize that the phenomenon of allele-specific methylation (ASM) may underlie the phenotypic effects of multiple variants identified by Genome-Wide Association studies (GWAS). We evaluate ASM in a human population and document its genome-wide patterns in an initial screen at up to 380,678 sites within the genome, or up to 5% of the total genomic CpGs. We show that while substantial inter-individual variation exists, 5% of assessed sites show evidence of ASM in at least six samples; the majority of these events (81%) are under genetic influence. Many of these cis-regulated ASM variants are also eQTLs in peripheral blood mononuclear cells and monocytes and/or in high linkage-disequilibrium with variants linked to complex disease. Finally, focusing on autoimmune phenotypes, we extend this initial screen to confirm the association of cis-regulated ASM with multiple complex disease-associated variants in an independent population using next-generation bisulfite sequencing. These four variants are implicated in complex phenotypes such as ulcerative colitis and AIDS progression disease (rs10491434), Celiac disease (rs2762051), Crohn's disease, IgA nephropathy and early-onset inflammatory bowel disease (rs713875) and height (rs6569648). Our results suggest cis-regulated ASM may provide a mechanistic link between the non-coding genetic changes and phenotypic variation observed in these diseases and further suggests a route to integrating DNA methylation status with GWAS results.
Collapse
|
16
|
|
17
|
Abstract
Monoallelic expression poses an intriguing problem in epigenetics because it requires the unequal treatment of two segments of DNA that are present in the same nucleus and which can have absolutely identical sequences. This review will consider different known types of monoallelic expression. For all monoallelically expressed genes, their respective allele-specific patterns of expression have the potential to affect brain function and dysfunction.
Collapse
|
18
|
Genome-wide mapping of methylated adenine residues in pathogenic Escherichia coli using single-molecule real-time sequencing. Nat Biotechnol 2012; 30:1232-9. [PMID: 23138224 DOI: 10.1038/nbt.2432] [Citation(s) in RCA: 247] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2012] [Accepted: 10/22/2012] [Indexed: 11/09/2022]
Abstract
Single-molecule real-time (SMRT) DNA sequencing allows the systematic detection of chemical modifications such as methylation but has not previously been applied on a genome-wide scale. We used this approach to detect 49,311 putative 6-methyladenine (m6A) residues and 1,407 putative 5-methylcytosine (m5C) residues in the genome of a pathogenic Escherichia coli strain. We obtained strand-specific information for methylation sites and a quantitative assessment of the frequency of methylation at each modified position. We deduced the sequence motifs recognized by the methyltransferase enzymes present in this strain without prior knowledge of their specificity. Furthermore, we found that deletion of a phage-encoded methyltransferase-endonuclease (restriction-modification; RM) system induced global transcriptional changes and led to gene amplification, suggesting that the role of RM systems extends beyond protecting host genomes from foreign DNA.
Collapse
|
19
|
Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases. Genome Res 2012; 23:129-41. [PMID: 23093720 PMCID: PMC3530673 DOI: 10.1101/gr.136739.111] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.
Collapse
|
20
|
Autosomal monoallelic expression in the mouse. Genome Biol 2012; 13:R10. [PMID: 22348269 PMCID: PMC3334567 DOI: 10.1186/gb-2012-13-2-r10] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Revised: 02/10/2012] [Accepted: 02/20/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Random monoallelic expression defines an unusual class of genes displaying random choice for expression between the maternal and paternal alleles. Once established, the allele-specific expression pattern is stably maintained and mitotically inherited. Examples of random monoallelic genes include those found on the X-chromosome and a subset of autosomal genes, which have been most extensively studied in humans. Here, we report a genome-wide analysis of random monoallelic expression in the mouse. We used high density mouse genome polymorphism mapping arrays to assess allele-specific expression in clonal cell lines derived from heterozygous mouse strains. RESULTS Over 1,300 autosomal genes were assessed for allele-specific expression, and greater than 10% of them showed random monoallelic expression. When comparing mouse and human, the number of autosomal orthologs demonstrating random monoallelic expression in both organisms was greater than would be expected by chance. Random monoallelic expression on the mouse autosomes is broadly similar to that in human cells: it is widespread throughout the genome, lacks chromosome-wide coordination, and varies between cell types. However, for some mouse genes, there appears to be skewing, in some ways resembling skewed X-inactivation, wherein one allele is more frequently active. CONCLUSIONS These data suggest that autosomal random monoallelic expression was present at least as far back as the last common ancestor of rodents and primates. Random monoallelic expression can lead to phenotypic variation beyond the phenotypic variation dictated by genotypic variation. Thus, it is important to take into account random monoallelic expression when examining genotype-phenotype correlation.
Collapse
|
21
|
|
22
|
Extensive sequence-influenced DNA methylation polymorphism in the human genome. Epigenetics Chromatin 2010; 3:11. [PMID: 20497546 PMCID: PMC2893533 DOI: 10.1186/1756-8935-3-11] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2010] [Accepted: 05/24/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Epigenetic polymorphisms are a potential source of human diversity, but their frequency and relationship to genetic polymorphisms are unclear. DNA methylation, an epigenetic mark that is a covalent modification of the DNA itself, plays an important role in the regulation of gene expression. Most studies of DNA methylation in mammalian cells have focused on CpG methylation present in CpG islands (areas of concentrated CpGs often found near promoters), but there are also interesting patterns of CpG methylation found outside of CpG islands. RESULTS We compared DNA methylation patterns on both alleles between many pairs (and larger groups) of related and unrelated individuals. Direct observation and simulation experiments revealed that around 10% of common single nucleotide polymorphisms (SNPs) reside in regions with differences in the propensity for local DNA methylation between the two alleles. We further showed that for the most common form of SNP, a polymorphism at a CpG dinucleotide, the presence of the CpG at the SNP positively affected local DNA methylation in cis. CONCLUSIONS Taken together with the known effect of DNA methylation on mutation rate, our results suggest an interesting interdependence between genetics and epigenetics underlying diversity in the human genome.
Collapse
|
23
|
An architectural role for a nuclear noncoding RNA: NEAT1 RNA is essential for the structure of paraspeckles. Mol Cell 2009; 33:717-26. [PMID: 19217333 DOI: 10.1016/j.molcel.2009.01.026] [Citation(s) in RCA: 1077] [Impact Index Per Article: 71.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2008] [Revised: 10/22/2008] [Accepted: 01/29/2009] [Indexed: 01/16/2023]
Abstract
NEAT1 RNA, a highly abundant 4 kb ncRNA, is retained in nuclei in approximately 10 to 20 large foci that we show are completely coincident with paraspeckles, nuclear domains implicated in mRNA nuclear retention. Depletion of NEAT1 RNA via RNAi eradicates paraspeckles, suggesting that it controls sequestration of the paraspeckle proteins PSP1 and p54, factors linked to A-I editing. Unlike overexpression of PSP1, NEAT1 overexpression increases paraspeckle number, and paraspeckles emanate exclusively from the NEAT1 transcription site. The PSP-1 RNA binding domain is required for its colocalization with NEAT1 RNA in paraspeckles, and biochemical analyses support that NEAT1 RNA binds with paraspeckle proteins. Unlike other nuclear-retained RNAs, NEAT1 RNA is not A-I edited, consistent with a structural role in paraspeckles. Collectively, results demonstrate that NEAT1 functions as an essential structural determinant of paraspeckles, providing a precedent for a ncRNA as the foundation of a nuclear domain.
Collapse
|
24
|
Abstract
Monoallelic expression with random choice between the maternal and paternal alleles defines an unusual class of genes comprising X-inactivated genes and a few autosomal gene families. Using a genome-wide approach, we assessed allele-specific transcription of about 4000 human genes in clonal cell lines and found that more than 300 were subject to random monoallelic expression. For a majority of monoallelic genes, we also observed some clonal lines displaying biallelic expression. Clonal cell lines reflect an independent choice to express the maternal, the paternal, or both alleles for each of these genes. This can lead to differences in expressed protein sequence and to differences in levels of gene expression. Unexpectedly widespread monoallelic expression suggests a mechanism that generates diversity in individual cells and their clonal descendants.
Collapse
|
25
|
Abstract
Differential DNA methylation is important for the epigenetic regulation of gene expression. Allele-specific methylation of the inactive X chromosome has been demonstrated at promoter CpG islands, but the overall pattern of methylation on the active X(Xa) and inactive X (Xi) chromosomes is unknown. We performed allele-specific analysis of more than 1000 informative loci along the human X chromosome. The Xa displays more than two times as much allele-specific methylation as Xi. This methylation is concentrated at gene bodies, affecting multiple neighboring CpGs. Before X inactivation, all of these Xa gene body-methylated sites are biallelically methylated. Thus, a bipartite methylation-demethylation program results in Xa-specific hypomethylation at gene promoters and hypermethylation at gene bodies. These results suggest a relationship between global methylation and expression potentiality.
Collapse
|
26
|
A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains. BMC Genomics 2007; 8:39. [PMID: 17270048 PMCID: PMC1800850 DOI: 10.1186/1471-2164-8-39] [Citation(s) in RCA: 739] [Impact Index Per Article: 43.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2006] [Accepted: 02/01/2007] [Indexed: 01/03/2023] Open
Abstract
Background Noncoding RNA species play a diverse set of roles in the eukaryotic cell. While much recent attention has focused on smaller RNA species, larger noncoding transcripts are also thought to be highly abundant in mammalian cells. To search for large noncoding RNAs that might control gene expression or mRNA metabolism, we used Affymetrix expression arrays to identify polyadenylated RNA transcripts displaying nuclear enrichment. Results This screen identified no more than three transcripts; XIST, and two unique noncoding nuclear enriched abundant transcripts (NEAT) RNAs strikingly located less than 70 kb apart on human chromosome 11: NEAT1, a noncoding RNA from the locus encoding for TncRNA, and NEAT2 (also known as MALAT-1). While the two NEAT transcripts share no significant homology with each other, each is conserved within the mammalian lineage, suggesting significant function for these noncoding RNAs. NEAT2 is extraordinarily well conserved for a noncoding RNA, more so than even XIST. Bioinformatic analyses of publicly available mouse transcriptome data support our findings from human cells as they confirm that the murine homologs of these noncoding RNAs are also nuclear enriched. RNA FISH analyses suggest that these noncoding RNAs function in mRNA metabolism as they demonstrate an intimate association of these RNA species with SC35 nuclear speckles in both human and mouse cells. These studies show that one of these transcripts, NEAT1 localizes to the periphery of such domains, whereas the neighboring transcript, NEAT2, is part of the long-sought polyadenylated component of nuclear speckles. Conclusion Our genome-wide screens in two mammalian species reveal no more than three abundant large non-coding polyadenylated RNAs in the nucleus; the canonical large noncoding RNA XIST and NEAT1 and NEAT2. The function of these noncoding RNAs in mRNA metabolism is suggested by their high levels of conservation and their intimate association with SC35 splicing domains in multiple mammalian species.
Collapse
|
27
|
A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains. BMC Genomics 2007; 8:39. [PMID: 17270048 DOI: 10.1186/1471-1-2164-8-39] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2006] [Accepted: 02/01/2007] [Indexed: 05/26/2023] Open
Abstract
BACKGROUND Noncoding RNA species play a diverse set of roles in the eukaryotic cell. While much recent attention has focused on smaller RNA species, larger noncoding transcripts are also thought to be highly abundant in mammalian cells. To search for large noncoding RNAs that might control gene expression or mRNA metabolism, we used Affymetrix expression arrays to identify polyadenylated RNA transcripts displaying nuclear enrichment. RESULTS This screen identified no more than three transcripts; XIST, and two unique noncoding nuclear enriched abundant transcripts (NEAT) RNAs strikingly located less than 70 kb apart on human chromosome 11: NEAT1, a noncoding RNA from the locus encoding for TncRNA, and NEAT2 (also known as MALAT-1). While the two NEAT transcripts share no significant homology with each other, each is conserved within the mammalian lineage, suggesting significant function for these noncoding RNAs. NEAT2 is extraordinarily well conserved for a noncoding RNA, more so than even XIST. Bioinformatic analyses of publicly available mouse transcriptome data support our findings from human cells as they confirm that the murine homologs of these noncoding RNAs are also nuclear enriched. RNA FISH analyses suggest that these noncoding RNAs function in mRNA metabolism as they demonstrate an intimate association of these RNA species with SC35 nuclear speckles in both human and mouse cells. These studies show that one of these transcripts, NEAT1 localizes to the periphery of such domains, whereas the neighboring transcript, NEAT2, is part of the long-sought polyadenylated component of nuclear speckles. CONCLUSION Our genome-wide screens in two mammalian species reveal no more than three abundant large non-coding polyadenylated RNAs in the nucleus; the canonical large noncoding RNA XIST and NEAT1 and NEAT2. The function of these noncoding RNAs in mRNA metabolism is suggested by their high levels of conservation and their intimate association with SC35 splicing domains in multiple mammalian species.
Collapse
|
28
|
Abstract
Drosophila N-cadherin (CadN) is an evolutionarily conserved classic cadherin which has a large, complex extracellular domain and a catenin-binding cytoplasmic domain. The CadN locus contains three modules of alternative exons (7a/b, 13a/b, and 18a/b) and undergoes alternative splicing to generate multiple isoforms. Using quantitative transcript analyses and green fluorescent protein-based cell sorting, we found that during development CadN alternative splicing is regulated in a temporal but not cell-type-specific fashion. In particular, exon 18b is predominantly expressed during early developmental stages, while exon 18a is prevalent at the late developmental and adult stages. All CadN isoforms share the same molecular architecture but have different sequences in their extracellular and transmembrane domains, suggesting functional diversity. In vitro quantitative cell aggregation assays revealed that all CadN isoforms mediate homophilic interactions, but the isoforms encoded by exon 18b have a higher adhesive activity than those by its alternative, 18a. Domain-swapping experiments further revealed that the different sequences in the transmembrane domains of isoforms are responsible for their differential adhesive activities. CadN alternative splicing might provide a novel mechanism to fine-tune its adhesive activity at different developmental stages or to restrict the use of high-affinity 18b-type isoforms at the adult stage.
Collapse
|
29
|
A deletion at the mouse Xist gene exposes trans-effects that alter the heterochromatin of the inactive X chromosome and the replication time and DNA stability of both X chromosomes. Genetics 2006; 174:1115-33. [PMID: 16980402 PMCID: PMC1667074 DOI: 10.1534/genetics.105.051375] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
The inactive X chromosome of female mammals displays several properties of heterochromatin including late replication, histone H4 hypoacetylation, histone H3 hypomethylation at lysine-4, and methylated CpG islands. We show that cre-Lox-mediated excision of 21 kb from both Xist alleles in female mouse fibroblasts led to the appearance of two histone modifications throughout the inactive X chromosome usually associated with euchromatin: histone H4 acetylation and histone H3 lysine-4 methylation. Despite these euchromatic properties, the inactive X chromosome was replicated even later in S phase than in wild-type female cells. Homozygosity for the deletion also caused regions of the active X chromosome that are associated with very high concentrations of LINE-1 elements to be replicated very late in S phase. Extreme late replication is a property of fragile sites and the 21-kb deletions destabilized the DNA of both X chromosomes, leading to deletions and translocations. This was accompanied by the phosphorylation of p53 at serine-15, an event that occurs in response to DNA damage, and the accumulation of gamma-H2AX, a histone involved in DNA repair, on the X chromosome. The Xist locus therefore maintains the DNA stability of both X chromosomes.
Collapse
|
30
|
An epigenetic state associated with areas of gene duplication. Genes Dev 2006; 16:723-9. [PMID: 16687731 PMCID: PMC1473183 DOI: 10.1101/gr.5023706] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2005] [Accepted: 04/06/2006] [Indexed: 11/24/2022]
Abstract
Asynchronous DNA replication is an epigenetically determined feature found in all cases of monoallelic expression, including genomic imprinting, X-inactivation, and random monoallelic expression of autosomal genes such as immunoglobulins and olfactory receptor genes. Most genes of the latter class were identified in experiments focused on genes functioning in the chemosensory and immune systems. We performed an unbiased survey of asynchronous replication in the mouse genome, excluding known asynchronously replicated genes. Fully 10% (eight of 80) of the genes tested exhibited asynchronous replication. A common feature of the newly identified asynchronously replicated areas is their proximity to areas of tandem gene duplication. Testing of other clustered areas supported the idea that such regions are enriched with asynchronously replicated genes.
Collapse
|
31
|
Dscam-mediated self- versus non-self-recognition by individual neurons. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2005; 69:485-8. [PMID: 16117684 DOI: 10.1101/sqb.2004.69.485] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
32
|
|
33
|
Monoallelic Expression and Asynchronous Replication of p120 Catenin in Mouse and Human Cells. J Biol Chem 2005; 280:1354-9. [PMID: 15522875 DOI: 10.1074/jbc.m411283200] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The number of autosomal mammalian genes subject to random monoallelic expression has been limited to genes highly specific to the function of chemosensory neurons or lymphocytes, making this phenomenon difficult to address systematically. Here we demonstrate that asynchronous DNA replication can be used as a marker for the identification of novel genes with monoallelic expression and identify p120 catenin, a gene involved in cell adhesion, as belonging to this class. p120 is widely expressed; its presence in available cell lines allowed us to address quantitative aspects of monoallelic expression. We show that the epigenetic choice of active allele is clonally stable and that biallelic clones express p120 at twice the level of monoallelic clones. Unlike previous reports about genes of this type, we found that expression of p120 can be monoallelic in one cell type and strictly biallelic in another. We show that in human lymphoblasts, the silencing of one allele is incomplete. These unexpected properties are likely to be wide-spread, as we show that the Tlr4 gene shares them. Identification of monoallelic expression of a nearly ubiquitous gene indicates that this type of gene regulation is more common than previously thought. This has important implications for carcinogenesis and definition of cell identity.
Collapse
|
34
|
Analysis of Dscam diversity in regulating axon guidance in Drosophila mushroom bodies. Neuron 2004; 43:673-86. [PMID: 15339649 DOI: 10.1016/j.neuron.2004.07.020] [Citation(s) in RCA: 151] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2004] [Revised: 05/21/2004] [Accepted: 06/18/2004] [Indexed: 10/25/2022]
Abstract
Dscam is an immunoglobulin (Ig) superfamily member that regulates axon guidance and targeting in Drosophila. Alternative splicing potentially generates 38,016 isoforms differing in their extracellular Ig and transmembrane domains. We demonstrate that Dscam mediates the sorting of axons in the developing mushroom body (MB). This correlates with the precise spatiotemporal pattern of Dscam protein expression. We demonstrate that MB neurons express different arrays of Dscam isoforms and that single MB neurons express multiple isoforms. Two different Dscam isoforms differing in their extracellular domains introduced as transgenes into single mutant cells partially rescued the mutant phenotype. Expression of one isoform of Dscam in a cohort of MB neurons induced dominant phenotypes, while expression of a single isoform in a single cell did not. We propose that different extracellular domains of Dscam share a common function and that differences in isoforms expressed on the surface of neighboring axons influence interactions between them.
Collapse
|
35
|
Abstract
Members of the Ly49 gene family of natural killer (NK) cell receptors in mice are expressed in seemingly stochastic combinations such that each NK cell expresses a handful of family members. A transcriptional switch appears to establish this interesting pattern of expression.
Collapse
|
36
|
Selective pressures on the olfactory receptor repertoire since the human-chimpanzee divergence. Proc Natl Acad Sci U S A 2004; 101:9019-22. [PMID: 15178761 PMCID: PMC428465 DOI: 10.1073/pnas.0401566101] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The availability of the sequence of the chimpanzee genome provides an opportunity to examine human genes and their chimpanzee orthologs and to analyze selective pressures that have been shaping the olfactory receptor repertoire since the human-chimpanzee divergence. We determined the ratio of nonsynonymous to synonymous changes for each of 186 orthologous pairs and then examined how the distribution of these ratios compares with the distribution expected under neutral drift. Consistent with the diminishing importance of olfaction for these species, we find no evidence for positive selection and we find evidence of weak purifying selection affecting over half of the repertoire.
Collapse
|
37
|
Mice cloned from olfactory sensory neurons. Nature 2004; 428:44-9. [PMID: 14990966 DOI: 10.1038/nature02375] [Citation(s) in RCA: 298] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2003] [Accepted: 01/14/2004] [Indexed: 11/08/2022]
Abstract
Cloning by nuclear transplantation has been successfully carried out in various mammals, including mice. Until now mice have not been cloned from post-mitotic cells such as neurons. Here, we have generated fertile mouse clones derived by transferring the nuclei of post-mitotic, olfactory sensory neurons into oocytes. These results indicate that the genome of a post-mitotic, terminally differentiated neuron can re-enter the cell cycle and be reprogrammed to a state of totipotency after nuclear transfer. Moreover, the pattern of odorant receptor gene expression and the organization of odorant receptor genes in cloned mice was indistinguishable from wild-type animals, indicating that irreversible changes to the DNA of olfactory neurons do not accompany receptor gene choice.
Collapse
|
38
|
Stochastic yet biased expression of multiple Dscam splice variants by individual cells. Nat Genet 2004; 36:240-6. [PMID: 14758360 DOI: 10.1038/ng1299] [Citation(s) in RCA: 191] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2003] [Accepted: 01/02/2004] [Indexed: 11/08/2022]
Abstract
The Drosophila melanogaster gene Dscam is essential for axon guidance and has 38,016 possible alternative splice forms. This diversity can potentially be used to distinguish cells. We analyzed the Dscam mRNA isoforms expressed by different cell types and individual cells. The choice of splice variants expressed is regulated both spatially and temporally. Different subtypes of photoreceptors express broad yet distinctive spectra of Dscam isoforms. Single-cell RT-PCR documented that individual cells express several different Dscam isoforms and allowed an estimation of the diversity that is present. For example, we estimate that each R3/R4 photoreceptor cell expresses 14-50 distinct mRNAs chosen from the spectrum of thousands of splice variants distinctive of its cell type. Thus, the Dscam repertoire of each cell is different from those of its neighbors, providing a potential mechanism for generating unique cell identity in the nervous system and elsewhere.
Collapse
|
39
|
Coordinated replication timing of monoallelically expressed genes along human autosomes. Hum Mol Genet 2004; 13:651-8. [PMID: 14734625 DOI: 10.1093/hmg/ddh062] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A number of genes in the mammalian genome are expressed from only one of two alleles in either an imprinted or random manner. Those belonging to the random class include X-linked genes subject to X inactivation, as well as a number of autosomal genes, including odorant receptors, immunoglobulins, T-cell receptors, interleukins, natural killer-cell receptors and pheromone receptors. Monoallelically expressed genes display the unusual property of asynchronous replication and for those genes whose transcription is randomly monoallelic, the asynchronous replication is also random. In mice, recent work has shown that the random asynchronous replication of distributed autosomal genes is coordinated at the whole chromosome level, indicative of chromosome-pair non-equivalence. Here, we show that autosome-pair non-equivalence is present in human cells, and demonstrate its ability to cross the centromere. Additionally, by examining the replication of these genes in a number of human trisomies, we consistently find one allele replicating early and the other two alleles replicating late, similar to previous observations in X trisomies.
Collapse
|
40
|
Abstract
Rearrangement of antigen receptor genes generates a vast array of antigen receptors on lymphocytes. The establishment of allelic exclusion in immunoglobulin genes requires differential treatment of the two sequence identical alleles. In the case of the kappa immunoglobulin locus, changes in chromatin structure, methylation, and replication timing of the two alleles are all potentially involved in regulating rearrangement. Additionally, germline transcription of the kappa locus which precedes rearrangement has been proposed to reflect an opening of the chromatin structure rendering it available for rearrangement. As the initial restriction of rearrangement to one allele is critical to the establishment of allelic exclusion, a key question is whether or not germline transcription at the kappa locus is monoallelic or biallelic. We have used a sensitive reverse transcription-polymerase chain reaction (RT-PCR) assay and an RNA-fluorescence in situ hybridization (FISH) to show that germline transcription of the kappa locus is biallelic in wild-type immature B cells and in recombination activating gene (RAG)-/-, mu+ B cells. Therefore, germline transcription is unlikely to dictate which allele will be rearranged first and rather reflects a general opening on both alleles that must be accompanied by a mechanism allowing one of the two alleles to be rearranged first.
Collapse
|
41
|
Coordination of the random asynchronous replication of autosomal loci. Nat Genet 2003; 33:339-41. [PMID: 12577058 DOI: 10.1038/ng1102] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2002] [Accepted: 01/19/2003] [Indexed: 11/08/2022]
Abstract
Random monoallelic expression and asynchronous replication define an unusual class of autosomal mammalian genes. We show that every cell has randomly chosen either the maternal or paternal copy of each given autosome pair, such that alleles of these genes scattered across the chosen chromosome replicate earlier than the alleles on the homologous chromosome. Thus, chromosome-pair non-equivalence, rather than being limited to X-chromosome inactivation, is a fundamental property of mouse chromosomes.
Collapse
|
42
|
Abstract
Thousands of neurons expressing a given mouse odorant receptor project to a few glomeruli in the olfactory bulb. New observations on mice expressing small odorant receptor transgenes provide support for the idea that interdependence is involved in the maturation of this remarkable convergence.
Collapse
|
43
|
Abstract
The development of mature B cells involves a series of molecular decisions which culminate in the expression of a single light-chain and heavy-chain antigen receptor on the cell surface. There are two alleles for each receptor locus, so the ultimate choice of one receptor type must involve a process of allelic exclusion. One way to do this is with a feedback mechanism that downregulates rearrangement after the generation of a productive receptor molecule, but recent work suggests that monoallelic epigenetic changes may also take place even before rearrangement. To better understand the basis for distinguishing between alleles, we have analysed DNA replication timing. Here we show that all of the B-cell-receptor loci (mu, kappa and lambda) and the TCRbeta locus replicate asynchronously. This pattern, which is established randomly in each cell early in development and maintained by cloning, represents an epigenetic mark for allelic exclusion, because it is almost always the early-replicating allele which is initially selected to undergo rearrangement in B cells. These results indicate that allelic exclusion in the immune system may be very similar to the process of X chromosome inactivation.
Collapse
|
44
|
Convergent projections of Drosophila olfactory neurons to specific glomeruli in the antennal lobe. Nat Neurosci 2000; 3:780-5. [PMID: 10903570 DOI: 10.1038/77680] [Citation(s) in RCA: 232] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Candidate Drosophila olfactory receptors (ORs) provide molecular tools to investigate how the organization of the Drosophila olfactory system determines the coding of olfactory stimuli. Neurons in the third antennal segment and maxillary palp appear to express different ORs. Individual olfactory neurons send axonal projections to glomeruli in the antennal lobe. Using transgenic flies, we provide evidence that the neurons expressing a given OR gene, which have cell bodies distributed among neurons expressing other ORs, converge in their projections to topographically fixed glomeruli in the antennal lobe. This convergence allows for the formation of an odotopic map in the antennal lobe whose organization could provide a basis for olfactory discrimination in Drosophila.
Collapse
|
45
|
Allele-specific expression patterns of interleukin-2 and Pax-5 revealed by a sensitive single-cell RT-PCR analysis. Curr Biol 2000; 10:789-92. [PMID: 10898982 DOI: 10.1016/s0960-9822(00)00565-0] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Autosomal genes that are subject to random allelic inactivation (RAI), like imprinted genes [1] and genes subject to X-inactivation [2], require mechanisms that dictate the differential transcriptional regulation of two sequence-identical alleles. RAI genes include olfactory receptor genes [3], and the various genes encoding antigen receptors on lymphocytes (immunoglobulin genes, T cell receptor genes and NK receptor genes [4] [5] [6] [7]). These observations raise the possibility that other genes might be similarly regulated. Moreover, an interesting possibility is that certain genes might be monoallelically expressed in some cells and biallelically expressed in others. Recently, reports of monoallelic expression of interleukin-2 (IL-2) [8] [9] and IL-4 [10] [11] have raised the possibility that the cytokine gene family may be subject to monoallelic expression. Another report suggests that the gene encoding the transcription factor Pax-5, which is involved in B-cell (and cerebellar) development [12] [13], is also subject to monoallelic expression [14]. Using a novel single-cell reverse transcription-polymerase chain reaction (RT-PCR) approach, we have analyzed the IL-2 and Pax-5 genes in mice. We found that IL-2 is monoallelically transcribed in some T cells and biallelically transcribed in others, raising interesting questions regarding cytokine gene regulation. Additionally, our analyses suggest that Pax-5 is consistently biallelically transcribed. Thus, the IL-2 gene and other cytokine genes may be regulated in a stochastic manner that results in 0, 1 or 2 alleles of a given cytokine gene expressed in each T cell. This type of regulation could account for the wide variety of different combinations of cytokine genes expressed in individual T cells and therefore plays a role in the generation of T cells with a range of different effector functions.
Collapse
|
46
|
Abstract
To make the most of a small number of neurons, the nematode olfactory system includes neurons that are bilaterally symmetrical in morphology, but differ in the sets of genes they express. An intriguing recent example is the axon contact-mediated asymmetry in expression of the str-2 odorant receptor gene.
Collapse
|
47
|
Abstract
In mice, individual olfactory neurons express one of the thousand distinct olfactory receptor genes [1] [2] [3]. Neurons that express a given receptor converge on distinct loci in the olfactory bulb to form structures called glomeruli [4] [5] [6]. The olfactory receptor is involved in an instructive manner in this axonal convergence [6] [7] but little is known about the mechanisms involved in maintaining convergence. We have previously created a transgenic olfactory receptor locus that functions independently of the endogenous loci [8]. Here, we show that, although the projections of neurons expressing this ectopic transgenic olfactory receptor always converge in newborn mice, surprisingly, in adult mice, convergence is not always maintained. Moreover, in adult mice there is a positive correlation between the number of neurons expressing the transgenic receptor and the probability of maintaining convergence. These observations, taken together with the variability observed in wild-type [4] [6] and genetically manipulated mice ([6] and our unpublished observations), suggest that olfactory neurons require the presence of other similar axons to maintain a glomerulus. We call this phenomenon interdependence.
Collapse
|
48
|
Abstract
In the mouse, individual olfactory neurons express one of a thousand distinct olfactory receptor genes. Furthermore, only one allele of the expressed gene is transcribed. This phenomenon, random allelic inactivation, along with the observation that the olfactory receptor genes reside in large chromosomal arrays, suggests a role for long-range gene regulation in olfactory receptor gene choice. We have constructed a 300-kb yeast artificial chromosome (YAC) transgene in which a single receptor gene is marked while maintaining its coding region. This 300-kb piece of DNA functions as an independent olfactory receptor gene locus in directing olfactory receptor gene choice in both the olfactory system and the accessory olfactory system (vomeronasal organ, VNO). Furthermore, the transgene, like endogenous olfactory receptor loci, is subject to allelic inactivation.
Collapse
|
49
|
Abstract
We have taken advantage of the availability of a large amount of Drosophila genomic DNA sequence in the Berkeley Drosophila Genome Project database ( approximately 1/5 of the genome) to identify a family of novel seven transmembrane domain encoding genes that are putative Drosophila olfactory receptors. Members of the family are expressed in distinct subsets of olfactory neurons, and certain family members are restricted to distinct portions of the olfactory system. This pattern of expression has interesting similarities to and differences from the expression patterns observed for olfactory receptors in vertebrates. The Drosophila olfactory system is simpler than mammalian systems, yet it is complex enough to present a fascinating system in which to study neural information processing. Moreover, the powerful genetic manipulations available in Drosophila, when combined with electrophysiological and behavioral analyses, make this an attractive model system in which to study olfactory discrimination.
Collapse
|
50
|
Abstract
Recent studies have shed light on the different relationships between odorant receptor expression and the specification of neural identity in the olfactory systems of vertebrates and invertebrates. In mice, neuronal identity and axon guidance are specified by the single expressed olfactory receptor, whereas in C. elegans, neuronal identity appears to be independent of receptor expression.
Collapse
|