1
|
Brovkina MV, Chapman MA, Holding ML, Clowney EJ. Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays. BMC Biol 2023; 21:179. [PMID: 37612705 PMCID: PMC10463633 DOI: 10.1186/s12915-023-01673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/01/2023] [Indexed: 08/25/2023] Open
Abstract
BACKGROUND The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms-as diverse as bats, dolphins, and elephants-in only 10-20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these "outward-looking" families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression. RESULTS Here, we show that these tandem arrays of outward-looking genes occupy AT-biased isochores and comprise the "tissue-specific" gene class that lack CpG islands in their promoters. Models of mammalian genome evolution have not incorporated the sharply different functions and transcriptional patterns of genes in AT- versus GC-biased regions. To examine the relationship between gene family expansion, sequence content, and allelic diversity, we use population genetic data and comparative analysis. First, we find that AT bias can emerge during evolutionary expansion of gene families in cis. Second, human genes in AT-biased isochores or with GC-poor promoters experience relatively low rates of de novo point mutation today but are enriched for non-synonymous variants. Finally, we find that isochores containing gene clusters exhibit low rates of recombination. CONCLUSIONS Our analyses suggest that tolerance of non-synonymous variation and low recombination are two forces that have produced the depletion of GC bases in outward-facing gene arrays. In turn, high AT content exerts a profound effect on their chromatin organization and transcriptional regulation.
Collapse
Affiliation(s)
- Margarita V Brovkina
- Graduate Program in Cellular and Molecular Biology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Margaret A Chapman
- Neurosciences Graduate Program, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - E Josephine Clowney
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI, USA.
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
2
|
Characterization of Host-Specific Genes from Pine- and Grass-Associated Species of the Fusarium fujikuroi Species Complex. Pathogens 2022; 11:pathogens11080858. [PMID: 36014979 PMCID: PMC9415769 DOI: 10.3390/pathogens11080858] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 07/20/2022] [Accepted: 07/25/2022] [Indexed: 11/16/2022] Open
Abstract
The Fusarium fujikuroi species complex (FFSC) includes socioeconomically important pathogens that cause disease for numerous crops and synthesize a variety of secondary metabolites that can contaminate feedstocks and food. Here, we used comparative genomics to elucidate processes underlying the ability of pine-associated and grass-associated FFSC species to colonize tissues of their respective plant hosts. We characterized the identity, possible functions, evolutionary origins, and chromosomal positions of the host-range-associated genes encoded by the two groups of fungi. The 72 and 47 genes identified as unique to the respective genome groups were potentially involved in diverse processes, ranging from transcription, regulation, and substrate transport through to virulence/pathogenicity. Most genes arose early during the evolution of Fusarium/FFSC and were only subsequently retained in some lineages, while some had origins outside Fusarium. Although differences in the densities of these genes were especially noticeable on the conditionally dispensable chromosome of F. temperatum (representing the grass-associates) and F. circinatum (representing the pine-associates), the host-range-associated genes tended to be located towards the subtelomeric regions of chromosomes. Taken together, these results demonstrate that multiple mechanisms drive the emergence of genes in the grass- and pine-associated FFSC taxa examined. It also highlighted the diversity of the molecular processes potentially underlying niche-specificity in these and other Fusarium species.
Collapse
|
3
|
Chaux-Jukic F, O'Donnell S, Craig RJ, Eberhard S, Vallon O, Xu Z. Architecture and evolution of subtelomeres in the unicellular green alga Chlamydomonas reinhardtii. Nucleic Acids Res 2021; 49:7571-7587. [PMID: 34165564 PMCID: PMC8287924 DOI: 10.1093/nar/gkab534] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 06/01/2021] [Accepted: 06/08/2021] [Indexed: 02/06/2023] Open
Abstract
In most eukaryotes, subtelomeres are dynamic genomic regions populated by multi-copy sequences of different origins, which can promote segmental duplications and chromosomal rearrangements. However, their repetitive nature has complicated the efforts to sequence them, analyse their structure and infer how they evolved. Here, we use recent genome assemblies of Chlamydomonas reinhardtii based on long-read sequencing to comprehensively describe the subtelomere architecture of the 17 chromosomes of this model unicellular green alga. We identify three main repeated elements present at subtelomeres, which we call Sultan, Subtile and Suber, alongside three chromosome extremities with ribosomal DNA as the only identified component of their subtelomeres. The most common architecture, present in 27 out of 34 subtelomeres, is a heterochromatic array of Sultan elements adjacent to the telomere, followed by a transcribed Spacer sequence, a G-rich microsatellite and transposable elements. Sequence similarity analyses suggest that Sultan elements underwent segmental duplications within each subtelomere and rearranged between subtelomeres at a much lower frequency. Analysis of other green algae reveals species-specific repeated elements that are shared across subtelomeres, with an overall organization similar to C. reinhardtii. This work uncovers the complexity and evolution of subtelomere architecture in green algae.
Collapse
Affiliation(s)
- Frédéric Chaux-Jukic
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Samuel O'Donnell
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Rory J Craig
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, Edinburgh, UK
| | - Stephan Eberhard
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Olivier Vallon
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Zhou Xu
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| |
Collapse
|
4
|
Li B. Keeping Balance Between Genetic Stability and Plasticity at the Telomere and Subtelomere of Trypanosoma brucei. Front Cell Dev Biol 2021; 9:699639. [PMID: 34291053 PMCID: PMC8287324 DOI: 10.3389/fcell.2021.699639] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 06/08/2021] [Indexed: 12/13/2022] Open
Abstract
Telomeres, the nucleoprotein complexes at chromosome ends, are well-known for their essential roles in genome integrity and chromosome stability. Yet, telomeres and subtelomeres are frequently less stable than chromosome internal regions. Many subtelomeric genes are important for responding to environmental cues, and subtelomeric instability can facilitate organismal adaptation to extracellular changes, which is a common theme in a number of microbial pathogens. In this review, I will focus on the delicate and important balance between stability and plasticity at telomeres and subtelomeres of a kinetoplastid parasite, Trypanosoma brucei, which causes human African trypanosomiasis and undergoes antigenic variation to evade the host immune response. I will summarize the current understanding about T. brucei telomere protein complex, the telomeric transcript, and telomeric R-loops, focusing on their roles in maintaining telomere and subtelomere stability and integrity. The similarities and differences in functions and underlying mechanisms of T. brucei telomere factors will be compared with those in human and yeast cells.
Collapse
Affiliation(s)
- Bibo Li
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological, and Environmental Sciences, College of Sciences and Health Professions, Cleveland State University, Cleveland, OH, United States.,Case Comprehensive Cancer Center, Case Western Reserve University, Cleveland, OH, United States.,Department of Inflammation and Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, United States.,Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, OH, United States
| |
Collapse
|
5
|
The Power of Stress: The Telo-Hormesis Hypothesis. Cells 2021; 10:cells10051156. [PMID: 34064566 PMCID: PMC8151059 DOI: 10.3390/cells10051156] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 04/29/2021] [Accepted: 05/06/2021] [Indexed: 02/06/2023] Open
Abstract
Adaptative response to stress is a strategy conserved across evolution to promote survival. In this context, the groundbreaking findings of Miroslav Radman on the adaptative value of changing mutation rates opened new avenues in our understanding of stress response. Inspired by this work, we explore here the putative beneficial effects of changing the ends of eukaryotic chromosomes, the telomeres, in response to stress. We first summarize basic principles in telomere biology and then describe how various types of stress can alter telomere structure and functions. Finally, we discuss the hypothesis of stress-induced telomere signaling with hormetic effects.
Collapse
|
6
|
Xu Z, Green B, Benoit N, Sobel JD, Schatz MC, Wheelan S, Cormack BP. Cell wall protein variation, break-induced replication, and subtelomere dynamics in Candida glabrata. Mol Microbiol 2021; 116:260-276. [PMID: 33713372 DOI: 10.1111/mmi.14707] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 02/12/2021] [Accepted: 02/23/2021] [Indexed: 01/08/2023]
Abstract
Candida glabrata is an opportunistic pathogen of humans, responsible for up to 30% of disseminated candidiasis. Adherence of C. glabrata to host cells is mediated by adhesin-like proteins (ALPs), about half of which are encoded in the subtelomeres. We performed a de novo assembly of two C. glabrata strains, BG2 and BG3993, using long single-molecule real-time (SMRT) reads, and constructed high-quality telomere-to-telomere assemblies of all 13 chromosomes to assess differences between C. glabrata strains. We documented variation between strains, and in agreement with earlier studies, found high (~0.5%-1%) frequencies of SNVs across the genome, including within subtelomeric regions. We documented changes in ALP gene structure and complement: there are large length differences in ALP genes in different strains, resulting from copy number variation in tandem repeats. We compared strains to characterize chromosome rearrangement events including within the poorly characterized subtelomeric regions. We show that rearrangements within the subtelomere regions all affect ALP-encoding genes, and 14/16 involve just the most terminal ALP gene. We present evidence that these rearrangements are mediated by break-induced replication. This study highlights the constrained nature of subtelomeric changes impacting ALP gene complement and subtelomere structure.
Collapse
Affiliation(s)
- Zhuwei Xu
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Brian Green
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD, USA.,AgriMetis, Lutherville, MD, USA
| | - Nicole Benoit
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Jack D Sobel
- Division of Infectious Diseases, Wayne State University School of Medicine, Detroit, MI, USA
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Sarah Wheelan
- Department of Oncology, The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Brendan P Cormack
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| |
Collapse
|
7
|
Abstract
Mutation of the human genome results in three classes of genomic variation: single nucleotide variants; short insertions or deletions; and large structural variants (SVs). Some mutations occur during normal processes, such as meiotic recombination or B cell development, and others result from DNA replication or aberrant repair of breaks in sequence-specific contexts. Regardless of mechanism, mutations are subject to selection, and some hotspots can manifest in disease. Here, we discuss genomic regions prone to mutation, mechanisms contributing to mutation susceptibility, and the processes leading to their accumulation in normal and somatic genomes. With further, more accurate human genome sequencing, additional mutation hotspots, mechanistic details of their formation, and the relevance of hotspots to evolution and disease are likely to be discovered.
Collapse
|
8
|
Young E, Abid HZ, Kwok PY, Riethman H, Xiao M. Comprehensive Analysis of Human Subtelomeres by Whole Genome Mapping. PLoS Genet 2020; 16:e1008347. [PMID: 31986135 PMCID: PMC7004388 DOI: 10.1371/journal.pgen.1008347] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 02/06/2020] [Accepted: 10/15/2019] [Indexed: 12/03/2022] Open
Abstract
Detailed comprehensive knowledge of the structures of individual long-range telomere-terminal haplotypes are needed to understand their impact on telomere function, and to delineate the population structure and evolution of subtelomere regions. However, the abundance of large evolutionarily recent segmental duplications and high levels of large structural variations have complicated both the mapping and sequence characterization of human subtelomere regions. Here, we use high throughput optical mapping of large single DNA molecules in nanochannel arrays for 154 human genomes from 26 populations to present a comprehensive look at human subtelomere structure and variation. The results catalog many novel long-range subtelomere haplotypes and determine the frequencies and contexts of specific subtelomeric duplicons on each chromosome arm, helping to clarify the currently ambiguous nature of many specific subtelomere structures as represented in the current reference sequence (HG38). The organization and content of some duplicons in subtelomeres appear to show both chromosome arm and population-specific trends. Based upon these trends we estimate a timeline for the spread of these duplication blocks.
Collapse
Affiliation(s)
- Eleanor Young
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Heba Z. Abid
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, University of California–San Francisco, San Francisco, CA, United States of America
- Department of Dermatology, University of California–San Francisco, San Francisco, CA, United States of America
- Institute for Human Genetics, University of California–San Francisco, San Francisco, CA, United States of America
| | - Harold Riethman
- Medical Diagnostic & Translational Sciences, Old Dominium University, Norfolk, VA, United States of America
| | - Ming Xiao
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
- Institute of Molecular Medicine and Infectious Disease in the School of Medicine, Drexel University, Philadelphia, PA, United States of America
| |
Collapse
|
9
|
Genome maps across 26 human populations reveal population-specific patterns of structural variation. Nat Commun 2019; 10:1025. [PMID: 30833565 PMCID: PMC6399254 DOI: 10.1038/s41467-019-08992-7] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Accepted: 02/12/2019] [Indexed: 01/10/2023] Open
Abstract
Large structural variants (SVs) in the human genome are difficult to detect and study by conventional sequencing technologies. With long-range genome analysis platforms, such as optical mapping, one can identify large SVs (>2 kb) across the genome in one experiment. Analyzing optical genome maps of 154 individuals from the 26 populations sequenced in the 1000 Genomes Project, we find that phylogenetic population patterns of large SVs are similar to those of single nucleotide variations in 86% of the human genome, while ~2% of the genome has high structural complexity. We are able to characterize SVs in many intractable regions of the genome, including segmental duplications and subtelomeric, pericentromeric, and acrocentric areas. In addition, we discover ~60 Mb of non-redundant genome content missing in the reference genome sequence assembly. Our results highlight the need for a comprehensive set of alternate haplotypes from different populations to represent SV patterns in the genome. Large structural variants (SV) are understudied in human genetics research because of the difficulty to detect them in the routinely generated short-read sequencing data. Here, the authors generate optical genome maps of 154 individuals from 26 populations that allow comprehensive examination of large SVs.
Collapse
|
10
|
Suarez-Gonzalez A, Hefer CA, Lexer C, Cronk QCB, Douglas CJ. Scale and direction of adaptive introgression between black cottonwood (Populus trichocarpa) and balsam poplar (P. balsamifera). Mol Ecol 2018; 27:1667-1680. [DOI: 10.1111/mec.14561] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 02/17/2018] [Accepted: 02/23/2018] [Indexed: 12/31/2022]
Affiliation(s)
| | - Charles A. Hefer
- Department of Botany; University of British Columbia; Vancouver BC Canada
| | - Christian Lexer
- Department of Botany and Biodiversity Research; University of Vienna; Vienna Austria
| | | | | |
Collapse
|
11
|
Abstract
The terminal regions of eukaryotic chromosomes, composed of telomere repeat sequences and sub-telomeric sequences, represent some of the most variable and rapidly evolving regions of the genome. The sub-telomeric regions are characterized by segmentally duplicated repetitive DNA elements, interstitial telomere repeat sequences and families of variable genes. Sub-telomeric repeat sequence families are shared among multiple chromosome ends, often rendering detailed sequence characterization difficult. These regions are composed of constitutive heterochromatin and are subjected to high levels of meiotic recombination. Dysfunction within telomere repeat arrays, either due to disruption in the chromatin structure or because of telomere shortening, can lead to chromosomal fusion and the generation of large-scale genomic rearrangements across the genome. The dynamic nature of telomeric regions, therefore, provides functionally useful variation to create genetic diversity, but also provides a mechanism for rapid genomic evolution that can lead to reproductive isolation and speciation. This article is part of the theme issue 'Understanding diversity in telomere dynamics'.This article is part of the theme issue 'Understanding diversity in telomere dynamics'.
Collapse
Affiliation(s)
- Duncan M Baird
- Division of Cancer and Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| |
Collapse
|
12
|
Muraki K, Murnane JP. The DNA damage response at dysfunctional telomeres, and at interstitial and subtelomeric DNA double-strand breaks. Genes Genet Syst 2017; 92:135-152. [PMID: 29162774 DOI: 10.1266/ggs.17-00014] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
In mammals, DNA double-strand breaks (DSBs) are primarily repaired by classical non-homologous end joining (C-NHEJ), although homologous recombination repair and alternative NHEJ (A-NHEJ), which involve DSB processing, can also occur. These pathways are tightly regulated to maintain chromosome integrity. The ends of chromosomes, called telomeres, contain telomeric DNA that forms a cap structure in cooperation with telomeric proteins to prevent the activation of the DNA damage response and chromosome fusion at chromosome termini. Telomeres and subtelomeric regions are poor substrates for DNA replication; therefore, regions near telomeres are prone to replication fork stalling and chromosome breakage. Moreover, DSBs near telomeres are poorly repaired. As a result, when DSBs occur near telomeres in normal cells, the cells stop proliferating, while in cancer cells, subtelomeric DSBs induce rearrangements due to the absence of cell cycle checkpoints. The sensitivity of subtelomeric regions to DSBs is due to the improper regulation of processing, because although C-NHEJ is functional at subtelomeric DSBs, excessive processing results in an increased frequency of large deletions and chromosome rearrangements involving A-NHEJ.
Collapse
Affiliation(s)
- Keiko Muraki
- Institute for Protein Research, Osaka University.,Department of Radiation Oncology, University of California, San Francisco
| | - John P Murnane
- Department of Radiation Oncology, University of California, San Francisco
| |
Collapse
|
13
|
Cavalieri V, Geraci F, Spinelli G. Diversification of spatiotemporal expression and copy number variation of the echinoid hbox12/pmar1/micro1 multigene family. PLoS One 2017; 12:e0174404. [PMID: 28350855 PMCID: PMC5370098 DOI: 10.1371/journal.pone.0174404] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Accepted: 03/08/2017] [Indexed: 11/26/2022] Open
Abstract
Changes occurring during evolution in the cis-regulatory landscapes of individual members of multigene families might impart diversification in their spatiotemporal expression and function. The archetypal member of the echinoid hbox12/pmar1/micro1 family is hbox12-a, a homeobox-containing gene expressed exclusively by dorsal blastomeres, where it governs the dorsal/ventral gene regulatory network during embryogenesis of the sea urchin Paracentrotus lividus. Here we describe the inventory of the hbox12/pmar1/micro1 genes in P. lividus, highlighting that gene copy number variation occurs across individual sea urchins of the same species. We show that the various hbox12/pmar1/micro1 genes group into three subfamilies according to their spatiotemporal expression, which ranges from broad transcription throughout development to transient expression in either the animal hemisphere or micromeres of the early embryo. Interestingly, the promoter regions of those genes showing comparable expression patterns are highly similar, while differing from those of the other subfamilies. Strikingly, phylogenetic analysis suggests that the hbox12/pmar1/micro1 genes are species-specific, exhibiting extensive divergence in their noncoding, but not in their coding, sequences across three distinct sea urchin species. In spite of this, two micromere-specific genes of P. lividus possess a TCF/LEF-binding motif in a similar position, and their transcription relies on Wnt/β-catenin signaling, similar to the pmar1 and micro1 genes, which in other sea urchin species are involved in micromere specification. Altogether, our findings suggest that the hbox12/pmar1/micro1 gene family evolved rather rapidly, generating paralogs whose cis-regulatory sequences diverged following multiple rounds of duplication from a common ancestor.
Collapse
Affiliation(s)
- Vincenzo Cavalieri
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Viale delle Scienze Edificio 16, Palermo, Italy
- Advanced Technologies Network Center (ATeN), University of Palermo, Viale delle Scienze Edificio 18, Palermo, Italy
- * E-mail: (VC); (GS)
| | - Fabiana Geraci
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Viale delle Scienze Edificio 16, Palermo, Italy
| | - Giovanni Spinelli
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Viale delle Scienze Edificio 16, Palermo, Italy
- * E-mail: (VC); (GS)
| |
Collapse
|
14
|
Jehi SE, Wu F, Li B. Trypanosoma brucei TIF2 suppresses VSG switching by maintaining subtelomere integrity. Cell Res 2014; 24:870-85. [PMID: 24810301 DOI: 10.1038/cr.2014.60] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Revised: 03/11/2014] [Accepted: 04/07/2014] [Indexed: 12/21/2022] Open
Abstract
Subtelomeres consist of sequences adjacent to telomeres and contain genes involved in important cellular functions, as subtelomere instability is associated with several human diseases. Balancing between subtelomere stability and plasticity is particularly important for Trypanosoma brucei, a protozoan parasite that causes human African trypanosomiasis. T. brucei regularly switches its major variant surface antigen, variant surface glycoprotein (VSG), to evade the host immune response, and VSGs are expressed exclusively from subtelomeres in a strictly monoallelic fashion. Telomere proteins are important for protecting chromosome ends from illegitimate DNA processes. However, whether they contribute to subtelomere integrity and stability has not been well studied. We have identified a novel T. brucei telomere protein, T. brucei TRF-Interacting Factor 2 (TbTIF2), as a functional homolog of mammalian TIN2. A transient depletion of TbTIF2 led to an elevated VSG switching frequency and an increased amount of DNA double-strand breaks (DSBs) in both active and silent subtelomeric bloodstream form expression sites (BESs). Therefore, TbTIF2 plays an important role in VSG switching regulation and is important for subtelomere integrity and stability. TbTIF2 depletion increased the association of TbRAD51 with the telomeric and subtelomeric chromatin, and TbRAD51 deletion further increased subtelomeric DSBs in TbTIF2-depleted cells, suggesting that TbRAD51-mediated DSB repair is the underlying mechanism of subsequent VSG switching. Surprisingly, significantly more TbRAD51 associated with the active BES than with the silent BESs upon TbTIF2 depletion, and TbRAD51 deletion induced much more DSBs in the active BES than in the silent BESs in TbTIF2-depleted cells, suggesting that TbRAD51 preferentially repairs DSBs in the active BES.
Collapse
Affiliation(s)
- Sanaa E Jehi
- Department of Biological, Geological, and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH 44115, USA
| | - Fan Wu
- Department of Biological, Geological, and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH 44115, USA
| | - Bibo Li
- 1] Department of Biological, Geological, and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH 44115, USA [2] The Rockefeller University, New York, NY 10065, USA [3] Department of Molecular Genetics, Cleveland Clinic Lerner Research Institute, Cleveland, OH 44195, USA [4] Case Comprehensive Cancer Center, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
15
|
Shadravan F. Sex bias in copy number variation of olfactory receptor gene family depends on ethnicity. Front Genet 2013; 4:32. [PMID: 23503716 PMCID: PMC3596775 DOI: 10.3389/fgene.2013.00032] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Accepted: 02/26/2013] [Indexed: 12/22/2022] Open
Abstract
Gender plays a pivotal role in the human genetic identity and is also manifested in many genetic disorders particularly mental retardation. In this study its effect on copy number variation (CNV), known to cause genetic disorders was explored. As the olfactory receptor (OR) repertoire comprises the largest human gene family, it was selected for this study, which was carried out within and between three populations, derived from 150 individuals from the 1000 Genome Project. Analysis of 3872 CNVs detected among 791 OR loci, in which 307 loci showed CNV, revealed the following novel findings: Sex bias in CNV was significantly more prevalent in uncommon than common CNV variants of OR pseudogenes, in which the male genome showed more CNVs; and in one-copy number loss compared to complete deletion of OR pseudogenes; both findings implying a more recent evolutionary role for gender. Sex bias in copy number gain was also detected. Another novel finding was that the observed sex bias was largely dependent on ethnicity and was in general absent in East Asians. Using a CNV public database for sick children (International Standard Cytogenomic Array Consortium) the application of these findings for improving clinical molecular diagnostics is discussed by showing an example of sex bias in CNV among kids with autism. Additional clinical relevance is discussed, as the most polymorphic CNV-enriched OR cluster in the human genome, located on chr 15q11.2, is found near the Prader–Willi syndrome/Angelman syndrome bi-directionally imprinted region associated with two well-known mental retardation syndromes. As olfaction represents the primitive cognition in most mammals, arguably in competition with the development of a larger brain, the extensive retention of OR pseudogenes in females of this study, might point to a parent-of-origin indirect regulatory role for OR pseudogenes in the embryonic development of human brain. Thus any perturbation in the temporal regulation of olfactory system could lead to developmental delay disorders including mental retardation.
Collapse
Affiliation(s)
- Farideh Shadravan
- *Correspondence: Farideh Shadravan, 2584 San Jose Ave, San Francisco, CA 94112, USA. e-mail:
| |
Collapse
|
16
|
Research proceedings on primate comparative genomics. Zool Res 2013; 33:108-18. [DOI: 10.3724/sp.j.1141.2012.01108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
17
|
Abstract
Subtelomeres are an incredibly dynamic part of the human genome located at the ends of chromosomes just proximal to telomere repeats. Although subtelomeric variation contributes to normal polymorphism in the human genome and is a by-product of rapid evolution in these regions, rearrangements in subtelomeres can also cause intellectual disabilities and birth defects, making robust methods of detecting copy number variation in chromosome ends a must for cytogenetics labs. In recent years, methods for detecting structural variation in subtelomeres have moved from fluorescence in situ hybridization (FISH) to array technology; however, FISH is still necessary to determine the chromosomal structure of subtelomeric gains and losses identified by arrays.
Collapse
Affiliation(s)
- M Katharine Rudd
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA.
| |
Collapse
|
18
|
Abstract
Telomeres are ends of chromosomes that play an important part in the biology of eukaryotic cells. Through the coordinated action of the telomerase and networks of other proteins and factors, the length and integrity of telomeres are maintained to prevent telomere dysfunction that has been linked to senescence, aging, diseases, and cancer. The tools and assays being used to study telomeres are being broadened, which has allowed us to derive a more detailed, high-resolution picture of the various players and pathways at work at the telomeres.
Collapse
Affiliation(s)
- Zhou Songyang
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
19
|
Sankaranarayanan K, Nikjoo H. Ionising radiation and genetic risks. XVI. A genome-based framework for risk estimation in the light of recent advances in genome research. Int J Radiat Biol 2010; 87:161-78. [DOI: 10.3109/09553002.2010.518214] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
20
|
Dhillon B, Cavaletto JR, Wood KV, Goodwin SB. Accidental amplification and inactivation of a methyltransferase gene eliminates cytosine methylation in Mycosphaerella graminicola. Genetics 2010; 186:67-77. [PMID: 20610411 PMCID: PMC2940312 DOI: 10.1534/genetics.110.117408] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A de novo search for repetitive elements in the genome sequence of the wheat pathogen Mycosphaerella graminicola identified a family of repeats containing a DNA cytosine methyltransferase sequence (MgDNMT). All 23 MgDNMT sequences identified carried signatures of repeat induced point mutation (RIP). All copies were subtelomeric in location except for one on chromosome 6. Synteny with M. fijiensis implied that the nontelomeric copy on chromosome 6 served as a template for subsequent amplifications. Southern analysis revealed that the MgDNMT sequence also was amplified in 15 additional M. graminicola isolates from various geographical regions. However, this amplification event was specific to M. graminicola; a search for MgDNMT homologs identified only a single, unmutated copy in the genomes of 11 other ascomycetes. A genome-wide methylation assay revealed that M. graminicola lacks cytosine methylation, as expected if its MgDNMT gene is inactivated. Methylation was present in several other species tested, including the closest known relatives of M. graminicola, species S1 and S2. Therefore, the observed changes most likely occurred within the past 10,500 years since the divergence between M. graminicola and S1. Our data indicate that the recent amplification of a single-copy MgDNMT gene made it susceptible to RIP, resulting in complete loss of cytosine methylation in M. graminicola.
Collapse
Affiliation(s)
- Braham Dhillon
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
| | | | | | | |
Collapse
|
21
|
Arnoult N, Schluth-Bolard C, Letessier A, Drascovic I, Bouarich-Bourimi R, Campisi J, Kim SH, Boussouar A, Ottaviani A, Magdinier F, Gilson E, Londoño-Vallejo A. Replication timing of human telomeres is chromosome arm-specific, influenced by subtelomeric structures and connected to nuclear localization. PLoS Genet 2010; 6:e1000920. [PMID: 20421929 PMCID: PMC2858680 DOI: 10.1371/journal.pgen.1000920] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Accepted: 03/22/2010] [Indexed: 12/12/2022] Open
Abstract
The mechanisms governing telomere replication in humans are still poorly understood. To fill this gap, we investigated the timing of replication of single telomeres in human cells. Using in situ hybridization techniques, we have found that specific telomeres have preferential time windows for replication during the S-phase and that these intervals do not depend upon telomere length and are largely conserved between homologous chromosomes and between individuals, even in the presence of large subtelomeric segmental polymorphisms. Importantly, we show that one copy of the 3.3 kb macrosatellite repeat D4Z4, present in the subtelomeric region of the late replicating 4q35 telomere, is sufficient to confer both a more peripheral localization and a later-replicating property to a de novo formed telomere. Also, the presence of β-satellite repeats next to a newly created telomere is sufficient to delay its replication timing. Remarkably, several native, non-D4Z4–associated, late-replicating telomeres show a preferential localization toward the nuclear periphery, while several early-replicating telomeres are associated with the inner nuclear volume. We propose that, in humans, chromosome arm–specific subtelomeric sequences may influence both the spatial distribution of telomeres in the nucleus and their replication timing. Functional telomeres are essential for genome stability. While replication of telomeres has been extensively studied in model organisms such as the baker's yeast, little is known about the mechanisms that govern the replication of human telomeres. In this study, we have determined the timing of replication of telomeres of individual human chromosomes and its association with potential modulating factors such as particular subtelomeric sequences, the presence of heterochromatic regions, and nuclear localization. We have found that native telomeres associated with D4Z4 sequences—a macrosatellite naturally located in the subtelomeric regions of 4q, 10q, and acrocentric chromosome extremities—replicate later than others. We also present descriptive and experimental evidence indicating that nuclear localization influences the timing of telomere replication. These results contribute to our understanding of telomere metabolism in humans.
Collapse
Affiliation(s)
- Nausica Arnoult
- Telomeres and Cancer Laboratory, Institut Curie, CNRS, UPMC University Paris 06, Paris, France
| | - Caroline Schluth-Bolard
- Epigenetics and Telomere Regulation, CNRS ENS UCBL IFR128, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Anne Letessier
- Functional Organization and Plasticity of Mammalian Genomes, Institut Curie, UPMC University Paris 06, Paris, France
| | - Irena Drascovic
- Telomeres and Cancer Laboratory, Institut Curie, CNRS, UPMC University Paris 06, Paris, France
| | | | - Judith Campisi
- Lawrence Berkeley Laboratory, Berkeley, California, United States of America
- Buck Institute for Age Research, Novato, California, United States of America
| | - Sahn-ho Kim
- Lawrence Berkeley Laboratory, Berkeley, California, United States of America
| | - Amina Boussouar
- Epigenetics and Telomere Regulation, CNRS ENS UCBL IFR128, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Alexandre Ottaviani
- Epigenetics and Telomere Regulation, CNRS ENS UCBL IFR128, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Frédérique Magdinier
- Epigenetics and Telomere Regulation, CNRS ENS UCBL IFR128, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Eric Gilson
- Epigenetics and Telomere Regulation, CNRS ENS UCBL IFR128, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Arturo Londoño-Vallejo
- Telomeres and Cancer Laboratory, Institut Curie, CNRS, UPMC University Paris 06, Paris, France
- * E-mail:
| |
Collapse
|
22
|
Axelsson E, Albrechtsen A, van AP, Li L, Megens HJ, Vereijken ALJ, Crooijmans RPMA, Groenen MAM, Ellegren H, Willerslev E, Nielsen R. Segregation distortion in chicken and the evolutionary consequences of female meiotic drive in birds. Heredity (Edinb) 2010; 105:290-8. [DOI: 10.1038/hdy.2009.193] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
|
23
|
Marques-Bonet T, Ryder OA, Eichler EE. Sequencing primate genomes: what have we learned? Annu Rev Genomics Hum Genet 2009; 10:355-86. [PMID: 19630567 DOI: 10.1146/annurev.genom.9.081307.164420] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We summarize the progress in whole-genome sequencing and analyses of primate genomes. These emerging genome datasets have broadened our understanding of primate genome evolution revealing unexpected and complex patterns of evolutionary change. This includes the characterization of genome structural variation, episodic changes in the repeat landscape, differences in gene expression, new models regarding speciation, and the ephemeral nature of the recombination landscape. The functional characterization of genomic differences important in primate speciation and adaptation remains a significant challenge. Limited access to biological materials, the lack of detailed phenotypic data and the endangered status of many critical primate species have significantly attenuated research into the genetic basis of primate evolution. Next-generation sequencing technologies promise to greatly expand the number of available primate genome sequences; however, such draft genome sequences will likely miss critical genetic differences within complex genomic regions unless dedicated efforts are put forward to understand the full spectrum of genetic variation.
Collapse
Affiliation(s)
- Tomas Marques-Bonet
- Department of Genome Sciences, University of Washington and the Howard Hughes Medical Institute, Seattle, Washington 98105, USA.
| | | | | |
Collapse
|
24
|
High-resolution SNP arrays in mental retardation diagnostics: how much do we gain? Eur J Hum Genet 2009; 18:178-85. [PMID: 19809473 DOI: 10.1038/ejhg.2009.154] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
We used Affymetrix 6.0 GeneChip SNP arrays to characterize copy number variations (CNVs) in a cohort of 70 patients previously characterized on lower-density oligonucleotide arrays affected by idiopathic mental retardation and dysmorphic features. The SNP array platform includes approximately 900,000 SNP probes and 900,000 non-SNP oligonucleotide probes at an average distance of 0.7 Kb, which facilitates coverage of the whole genome, including coding and noncoding regions. The high density of probes is critical for detecting small CNVs, but it can lead to data interpretation problems. To reduce the number of false positives, parameters were set to consider only imbalances >75 Kb encompassing at least 80 probe sets. The higher resolution of the SNP array platform confirmed the increased ability to detect small CNVs, although more than 80% of these CNVs overlapped to copy number 'neutral' polymorphism regions and 4.4% of them did not contain known genes. In our cohort of 70 patients, of the 51 previously evaluated as 'normal' on the Agilent 44K array, the SNP array platform disclosed six additional CNV changes, including three in three patients, which may be pathogenic. This suggests that about 6% of individuals classified as 'normal' using the lower-density oligonucleotide array could be found to be affected by a genomic disorder when evaluated with the higher-density microarray platforms.
Collapse
|
25
|
Knape K, Beyer A, Stary A, Buchbauer G, Wolschann P. Evolutionary trace of human odorant receptors of chromosome 17. FLAVOUR FRAG J 2009. [DOI: 10.1002/ffj.1933] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
26
|
Costa V, Casamassimi A, Roberto R, Gianfrancesco F, Matarazzo MR, D'Urso M, D'Esposito M, Rocchi M, Ciccodicola A. DDX11L: a novel transcript family emerging from human subtelomeric regions. BMC Genomics 2009; 10:250. [PMID: 19476624 PMCID: PMC2705379 DOI: 10.1186/1471-2164-10-250] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Accepted: 05/28/2009] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND The subtelomeric regions of human chromosomes exhibit an extraordinary plasticity. To date, due to the high GC content and to the presence of telomeric repeats, the subtelomeric sequences are underrepresented in the genomic libraries and consequently their sequences are incomplete in the finished human genome sequence, and still much remains to be learned about subtelomere organization, evolution and function. Indeed, only in recent years, several studies have disclosed, within human subtelomeres, novel gene family members. RESULTS During a project aimed to analyze genes located in the telomeric region of the long arm of the human X chromosome, we have identified a novel transcript family, DDX11L, members of which map to 1pter, 2q13/14.1, 2qter, 3qter, 6pter, 9pter/9qter, 11pter, 12pter, 15qter, 16pter, 17pter, 19pter, 20pter/20qter, Xpter/Xqter and Yqter. Furthermore, we partially sequenced the underrepresented subtelomeres of human chromosomes showing a common evolutionary origin. CONCLUSION Our data indicate that an ancestral gene, originated as a rearranged portion of the primate DDX11 gene, and propagated along many subtelomeric locations, is emerging within subtelomeres of human chromosomes, defining a novel gene family. These findings support the possibility that the high plasticity of these regions, sites of DNA exchange among different chromosomes, could trigger the emergence of new genes.
Collapse
Affiliation(s)
- Valerio Costa
- Institute of Genetics and Biophysics A, Buzzati-Traverso , CNR, 80131 Naples, Italy.
| | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Bailey JA, Kidd JM, Eichler EE. Human copy number polymorphic genes. Cytogenet Genome Res 2009; 123:234-43. [PMID: 19287160 DOI: 10.1159/000184713] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2008] [Indexed: 11/19/2022] Open
Abstract
Recent large-scale genomic studies within human populations have identified numerous genomic regions as copy number variant (CNV). As these CNV regions often overlap coding regions of the genome, large lists of potentially copy number polymorphic genes have been produced that are candidates for disease association. Most of the current data regarding normal genic variation, however, has been generated using BAC or SNP microarrays, which lack precision especially with respect to exons. To address this, we assessed 2,790 candidate CNV genes defined from available studies in nine well-characterized HapMap individuals by designing a customized oligonucleotide microarray targeted specifically to exons. Using exon array comparative genomic hybridization (aCGH), we detected 255 (9%) of the candidates as true CNVs including 134 with evidence of variation over the entire gene. Individuals differed in copy number from the control by an average of 100 gene loci. Both partial- and whole-gene CNVs were strongly associated with segmental duplications (55 and 71%, respectively) as well as regions of positive selection. We confirmed 37% of the whole-gene CNVs using the fosmid end sequence pair (ESP) structural variation map for these same individuals. If we modify the end sequence pair mapping strategy to include low-sequence identity ESPs (98-99.5%) and ESPs with an everted orientation, we can capture 82% of the missed genes leading to more complete ascertainment of structural variation within duplicated genes. Our results indicate that segmental duplications are the source of the majority of full-length copy number polymorphic genes, most of the variant genes are organized as tandem duplications, and a significant fraction of these genes will represent paralogs with levels of sequence diversity beyond thresholds of allelic variation. In addition, these data provide a targeted set of CNV genes enriched for regions likely to be associated with human phenotypic differences due to copy number changes and present a source of copy number responsive oligonucleotide probes for future association studies.
Collapse
Affiliation(s)
- J A Bailey
- Department of Pathology, Case Western University School of Medicine and University Hospitals of Cleveland, Cleveland, OH, USA.
| | | | | |
Collapse
|
28
|
Riethman H. Human subtelomeric copy number variations. Cytogenet Genome Res 2009; 123:244-52. [PMID: 19287161 DOI: 10.1159/000184714] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/23/2008] [Indexed: 01/15/2023] Open
Abstract
Copy number variation is a defining characteristic of human subtelomeres. Human subtelomeric segmental duplication regions ('Subtelomeric Repeats') comprise about 25% of the most distal 500 kb and 80% of the most distal 100 kb in human DNA. Huge allelic disparities seen in subtelomeric DNA sequence content and organization are postulated to have an impact on the dosage of transcripts embedded within the duplicated sequences, on the transcription of genes in adjacent single copy DNA regions, and on the chromatin structures mediating telomere functions including chromosome stability. In addition to the complex duplicon substructure and huge allelic variations in extended subtelomere regions, both copy number variation and alternative sequence organizations for DNA characterize the sequences immediately adjacent to terminal (TTAGGG)n tracts ('subterminal DNA'). The structural variation in subterminal DNA is likely to have important consequences for expression of subterminal transcripts such as a newly-discovered gene family encoding actin-interacting proteins and a non-coding telomeric repeat containing RNA (TERRA) transcript family critical for telomere integrity. Major immediate challenges include discovering the full extent and nature of subtelomeric structural and copy number variation in humans, and developing methods for tracking individual allelic variants in the context of total genomic DNA.
Collapse
Affiliation(s)
- H Riethman
- The Wistar Institute, Philadelphia, PA, USA
| |
Collapse
|
29
|
Perry GH. The evolutionary significance of copy number variation in the human genome. Cytogenet Genome Res 2009; 123:283-7. [PMID: 19287166 DOI: 10.1159/000184719] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/22/2008] [Indexed: 12/27/2022] Open
Abstract
Copy number variation provides the raw material for gene family expansion and diversification, which is an important evolutionary force. Moreover, copy number variants (CNVs) can influence gene transcriptional and translational levels and have been associated with complex disease susceptibility. Therefore, natural selection may have affected at least some of the greater than one thousand CNVs thus far discovered among the genomes of phenotypically normal humans. While identifying and understanding particular instances of natural selection may shed light on important aspects of human evolutionary history, our ability to analyze CNVs in traditional population genetic frameworks has been limited. However, progress has been made by adapting some of these frameworks for use with copy number data. Moving forward, these efforts will be aided by non-human organism studies of the population genetics of copy number variation, and by more direct comparisons of within-species copy number variation and between-species copy number fixation.
Collapse
Affiliation(s)
- G H Perry
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
30
|
de Smith AJ, Walters RG, Froguel P, Blakemore AI. Human genes involved in copy number variation: mechanisms of origin, functional effects and implications for disease. Cytogenet Genome Res 2009; 123:17-26. [PMID: 19287135 DOI: 10.1159/000184688] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/04/2008] [Indexed: 11/19/2022] Open
Abstract
Copy number variants (CNVs) overlap over 7000 genes, many of which are pivotal in biological pathways. The implications of this are profound, with consequences for evolutionary studies, population genetics, gene function and human phenotype, including elucidation of genetic susceptibility to major common diseases, the heritability of which has thus far defied full explanation. Even though this research is still in its infancy, CNVs have already been associated with a number of monogenic, syndromic and complex diseases: the development of high throughput and high resolution techniques for CNV screening is likely to bring further new insights into the contribution of copy number variation to common diseases. Amongst genes overlapped by CNVs, significant enrichments for certain gene ontology categories have been identified, including those related to immune responses and interactions with the environment. Genes in both of these categories are thought to be important in evolutionary adaptation and to be particular targets of natural selection. Thus, a full appreciation of copy number variation may be important for our understanding of human evolution.
Collapse
Affiliation(s)
- A J de Smith
- Section of Genomic Medicine, Imperial College London, Hammersmith Hospital, London, UK
| | | | | | | |
Collapse
|
31
|
Shi P, Zhang J. Extraordinary diversity of chemosensory receptor gene repertoires among vertebrates. Results Probl Cell Differ 2009; 47:1-23. [PMID: 19145414 DOI: 10.1007/400_2008_4] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Chemosensation (smell and taste) is important to the survival and reproduction of vertebrates and is mediated by specific bindings of odorants, pheromones, and tastants by chemoreceptors that are encoded by several large gene families. This review summarizes recent comparative genomic and evolutionary studies of vertebrate chemoreceptor genes. It focuses on the remarkable diversity of chemoreceptor gene repertoires in terms of gene number and gene sequence across vertebrates and the evolutionary mechanisms that are responsible for generating this diversity. We argue that the great among-species variation of chemoreceptor gene repertoires is a result of adaptations of individual species to their environments and diets.
Collapse
Affiliation(s)
- P Shi
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, The Chinese Academy of Sciences, Arthur-Scheunert-Allee 114-116, 650223, Kunming, China.
| | | |
Collapse
|
32
|
Edelmann L, Hirschhorn K. Clinical Utility of Array CGH for the Detection of Chromosomal Imbalances Associated with Mental Retardation and Multiple Congenital Anomalies. Ann N Y Acad Sci 2008; 1151:157-66. [DOI: 10.1111/j.1749-6632.2008.03610.x] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
33
|
Abstract
Telomeres and subtelomere regions have vital roles in cellular homeostasis and can facilitate niche adaptation. However, information on telomere/subtelomere structure is still limited to a small number of organisms. Prior to initiation of this project, the Neurospora crassa genome assembly contained only 3 of the 14 telomeres. The missing telomeres were identified through bioinformatic mining of raw sequence data from the genome project and from clones in new cosmid and plasmid libraries. Their chromosomal locations were assigned on the basis of paired-end read information and/or by RFLP mapping. One telomere is attached to the ribosomal repeat array. The remaining chromosome ends have atypical structures in that they lack distinct subtelomere domains or other sequence features that are associated with telomeres in other organisms. Many of the chromosome ends terminate in highly AT-rich sequences that appear to be products of repeat-induced point mutation, although most are not currently repeated sequences. Several chromosome termini in the standard Oak Ridge wild-type strain were compared to their counterparts in an exotic wild type, Mauriceville. This revealed that the sequences immediately adjacent to the telomeres are usually genome specific. Finally, despite the absence of many features typically found in the telomere regions of other organisms, the Neurospora chromosome termini still retain the dynamic nature that is characteristic of chromosome ends.
Collapse
|
34
|
Abstract
Human telomeric DNA is complex and highly variable. Subterminal sequences are associated with cis-acting determinants of allele-specific (TTAGGG)n tract length regulation and may modulate susceptibility of (TTAGGG)n tracts to rapid deletion events. More extensive subtelomeric DNA tracts are filled with segmental duplications and segments that vary in copy number, leading to highly variable subtelomeric allele structures in the human population. RNA transcripts encoded in telomere regions include multicopy protein-encoding gene families and a variety of noncoding RNAs. One recently described family of (UUAGGG)n-containing subterminal RNAs appears to be critical for telomere integrity; these RNAs associate with telomeric chromatin and are regulated by RNA surveillance factors including human homologs of the yeast Est1p protein. An increasingly detailed and complete picture of telomeric DNA sequence organization and structural variation is essential for understanding and tracking allele-specific subterminal and subtelomeric features critical for human biology.
Collapse
Affiliation(s)
- Harold Riethman
- The Wistar Institute, Philadelphia, Pennsylvania 19104, USA.
| |
Collapse
|
35
|
Rudd MK, Endicott RM, Friedman C, Walker M, Young JM, Osoegawa K, de Jong PJ, Green ED, Trask BJ. Comparative sequence analysis of primate subtelomeres originating from a chromosome fission event. Genome Res 2008; 19:33-41. [PMID: 18952852 DOI: 10.1101/gr.083170.108] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Subtelomeres are concentrations of interchromosomal segmental duplications capped by telomeric repeats at the ends of chromosomes. The nature of the segments shared by different sets of human subtelomeres reflects their high rate of recent interchromosomal exchange. Here, we characterize the rearrangements incurred by the 15q subtelomere after it arose from a chromosome fission event in the common ancestor of great apes. We used FISH, sequencing of genomic clones, and PCR to map the breakpoint of this fission and track the fate of flanking sequence in human, chimpanzee, gorilla, orangutan, and macaque genomes. The ancestral locus, a cluster of olfactory receptor (OR) genes, lies internally on macaque chromosome 7. Sequence originating from this fission site is split between the terminus of 15q and the pericentromere of 14q in the great apes. Numerous structural rearrangements, including interstitial deletions and transfers of material to or from other subtelomeres, occurred subsequent to the fission, such that each species has a unique 15q structure and unique collection of ORs derived from the fission locus. The most striking rearrangement involved transfer of at least 200 kb from the fission-site region to the end of chromosome 4q, where much still resides in chimpanzee and gorilla, but not in human. This gross structural difference places the subtelomeric defect underlying facioscapulohumeral muscular dystrophy (FSHD) much closer to the telomere in human 4q than in the hybrid 4q-15q subtelomere of chimpanzee.
Collapse
Affiliation(s)
- M Katharine Rudd
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Young JM, Endicott RM, Parghi SS, Walker M, Kidd JM, Trask BJ. Extensive copy-number variation of the human olfactory receptor gene family. Am J Hum Genet 2008; 83:228-42. [PMID: 18674749 DOI: 10.1016/j.ajhg.2008.07.005] [Citation(s) in RCA: 106] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2008] [Revised: 06/25/2008] [Accepted: 07/07/2008] [Indexed: 11/24/2022] Open
Abstract
As much as a quarter of the human genome has been reported to vary in copy number between individuals, including regions containing about half of the members of the olfactory receptor (OR) gene family. We have undertaken a detailed study of copy-number variation of ORs to elucidate the selective and mechanistic forces acting on this gene family and the true impact of copy-number variation on human OR repertoires. We argue that the properties of copy-number variants (CNVs) and other sets of large genomic regions violate the assumptions of statistical methods that are commonly used in the assessment of gene enrichment. Using more appropriate methods, we provide evidence that OR enrichment in CNVs is not due to positive selection but is because of OR preponderance in segmentally duplicated regions, which are known to be frequently copy-number variable, and because purifying selection against CNVs is lower in OR-containing regions than in regions containing essential genes. We also combine multiplex ligation-dependent probe amplification (MLPA) and PCR to assay the copy numbers of 37 candidate CNV ORs in a panel of approximately 50 human individuals. We confirm copy-number variation of 18 ORs but find no variation in this human-diversity panel for 16 other ORs, highlighting the caveat that reported intervals often overrepresent true CNVs. The copy-number variation we describe is likely to underpin significant variation in olfactory abilities among human individuals. Finally, we show that both homology-based and homology-independent processes have played a recent role in remodeling the OR family.
Collapse
|
37
|
Abstract
There is growing appreciation that the human genome contains significant numbers of structural rearrangements, such as insertions, deletions, inversions, and large tandem repeats. Recent studies have defined approximately 5% of the human genome as structurally variant in the normal population, involving more than 800 independent genes. We present a detailed review of the various structural rearrangements identified to date in humans, with particular reference to their influence on human phenotypic variation. Our current knowledge of the extent of human structural variation shows that the human genome is a highly dynamic structure that shows significant large-scale variation from the currently published genome reference sequence.
Collapse
Affiliation(s)
- Andrew J Sharp
- Department of Genome Sciences, University of Washington, Howard Hughes Medical Institute, Seattle, Washington 98195, USA
| | | | | |
Collapse
|
38
|
Abstract
DNA sequence surveys in yeast and humans suggest that the forces shaping telomeric polymorphism and divergence are distinctly more dynamic than those in the euchromatic, gene-rich regions of the chromosomes. However, the generality of this pattern across outbreeding, multicellular eukaryotes has not been determined. To characterize the structure and evolution of Drosophila telomeres, we collected and analyzed molecular population genetics data from the X chromosome subtelomere in 58 lines of North American Drosophila melanogaster and 29 lines of African D. melanogaster. We found that Drosophila subtelomeres exhibit high levels of both structural and substitutional polymorphism relative to linked euchromatic regions. We also observed strikingly different patterns of variation in the North American and African samples. Moreover, our analyses of the polymorphism data identify a localized hotspot of recombination in the most-distal portion of the X subtelomere. While the levels of polymorphism decline sharply and in parallel with rates of crossing over per physical length over the distal first euchromatic megabase pairs of the X chromosome, our data suggest that they rise again sharply in the subtelomeric region (approximately 80 kbp). These patterns of historical recombination and geographic differentiation indicate that, similar to yeast and humans, Drosophila subtelomeric DNA is evolving very differently from euchromatic DNA.
Collapse
|
39
|
Perry GH, Ben-Dor A, Tsalenko A, Sampas N, Rodriguez-Revenga L, Tran CW, Scheffer A, Steinfeld I, Tsang P, Yamada NA, Park HS, Kim JI, Seo JS, Yakhini Z, Laderman S, Bruhn L, Lee C. The fine-scale and complex architecture of human copy-number variation. Am J Hum Genet 2008; 82:685-95. [PMID: 18304495 DOI: 10.1016/j.ajhg.2007.12.010] [Citation(s) in RCA: 253] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2007] [Revised: 12/12/2007] [Accepted: 12/31/2007] [Indexed: 11/27/2022] Open
Abstract
Despite considerable excitement over the potential functional significance of copy-number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at approximately 1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1020 of 1153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of currently known common human CNVs is likely smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously uncharacterized complexity.
Collapse
Affiliation(s)
- George H Perry
- Department of Pathology, Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Ambrosini A, Paul S, Hu S, Riethman H. Human subtelomeric duplicon structure and organization. Genome Biol 2008; 8:R151. [PMID: 17663781 PMCID: PMC2323237 DOI: 10.1186/gb-2007-8-7-r151] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2007] [Revised: 06/25/2007] [Accepted: 07/30/2007] [Indexed: 01/27/2023] Open
Abstract
The sequence divergence within subtelomeric duplicon families varies considerably, as does the organization of duplicon blocks at subtelomere alleles; a class of duplicon blocks was identified that are subtelomere-specific. Background Human subtelomeric segmental duplications ('subtelomeric repeats') comprise about 25% of the most distal 500 kb and 80% of the most distal 100 kb in human DNA. A systematic analysis of the duplication substructure of human subtelomeric regions was done in order to develop a detailed understanding of subtelomeric sequence organization and a nucleotide sequence-level characterization of subtelomeric duplicon families. Results The extent of nucleotide sequence divergence within subtelomeric duplicon families varies considerably, as does the organization of duplicon blocks at subtelomere alleles. Subtelomeric internal (TTAGGG)n-like tracts occur at duplicon boundaries, suggesting their involvement in the generation of the complex sequence organization. Most duplicons have copies at both subtelomere and non-subtelomere locations, but a class of duplicon blocks is identified that are subtelomere-specific. In addition, a group of six subterminal duplicon families are identified that, together with six single-copy telomere-adjacent segments, include all of the (TTAGGG)n-adjacent sequence identified so far in the human genome. Conclusion Identification of a class of duplicon blocks that is subtelomere-specific will facilitate high-resolution analysis of subtelomere repeat copy number variation as well as studies involving somatic subtelomere rearrangements. The significant levels of nucleotide sequence divergence within many duplicon families as well as the differential organization of duplicon blocks on subtelomere alleles may provide opportunities for allele-specific subtelomere marker development; this is especially true for subterminal regions, where divergence and organizational differences are the greatest. These subterminal sequence families comprise the immediate cis-elements for (TTAGGG)n tracts, and are prime candidates for subtelomeric sequences regulating telomere-specific (TTAGGG)n tract length in humans.
Collapse
Affiliation(s)
- Anthony Ambrosini
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - Sheila Paul
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
| | - Sufen Hu
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
| | - Harold Riethman
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
| |
Collapse
|
41
|
Menashe I, Abaffy T, Hasin Y, Goshen S, Yahalom V, Luetje CW, Lancet D. Genetic elucidation of human hyperosmia to isovaleric acid. PLoS Biol 2008; 5:e284. [PMID: 17973576 PMCID: PMC2043052 DOI: 10.1371/journal.pbio.0050284] [Citation(s) in RCA: 154] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2007] [Accepted: 08/31/2007] [Indexed: 11/23/2022] Open
Abstract
The genetic basis of odorant-specific variations in human olfactory thresholds, and in particular of enhanced odorant sensitivity (hyperosmia), remains largely unknown. Olfactory receptor (OR) segregating pseudogenes, displaying both functional and nonfunctional alleles in humans, are excellent candidates to underlie these differences in olfactory sensitivity. To explore this hypothesis, we examined the association between olfactory detection threshold phenotypes of four odorants and segregating pseudogene genotypes of 43 ORs genome-wide. A strong association signal was observed between the single nucleotide polymorphism variants in OR11H7P and sensitivity to the odorant isovaleric acid. This association was largely due to the low frequency of homozygous pseudogenized genotype in individuals with specific hyperosmia to this odorant, implying a possible functional role of OR11H7P in isovaleric acid detection. This predicted receptor–ligand functional relationship was further verified using the Xenopus oocyte expression system, whereby the intact allele of OR11H7P exhibited a response to isovaleric acid. Notably, we also uncovered another mechanism affecting general olfactory acuity that manifested as a significant inter-odorant threshold concordance, resulting in an overrepresentation of individuals who were hyperosmic to several odorants. An involvement of polymorphisms in other downstream transduction genes is one possible explanation for this observation. Thus, human hyperosmia to isovaleric acid is a complex trait, contributed to by both receptor and other mechanisms in the olfactory signaling pathway. Humans can accurately discern thousands of odors, yet there is considerable inter-individual variation in the ability to detect different odors, with individuals exhibiting low sensitivity (hyposmia), high sensitivity (hyperosmia), or even “blindness” (anosmia) to particular odors. Such differences are thought to stem from genetic differences in olfactory receptor (OR) genes, which encode proteins that initiate olfactory signaling. OR segregating pseudogenes, which have both functional and inactive alleles in the population, are excellent candidates for producing this olfactory phenotype diversity. Here, we provide evidence that a particular segregating OR gene is related to sensitivity to a sweaty odorant, isovaleric acid. We show that hypersensitivity towards this odorant is seen predominantly in individuals who carry at least one copy of the intact allele. Furthermore, we demonstrate that this hyperosmia is a complex trait, being driven by additional factors affecting general olfactory acuity. Our results highlight a functional role of segregating pseudogenes in human olfactory variability, and constitute a step towards deciphering the genetic basis of human olfactory variability. Genetic epidemiology analysis reveals a multifaceted mechanism underlying enhanced olfactory sensitivity to the sweaty odor of isovaleric acid in humans.
Collapse
Affiliation(s)
- Idan Menashe
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
- Crown Human Genome Center, Weizmann Institute of Science, Rehovot, Israel
| | - Tatjana Abaffy
- Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Yehudit Hasin
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
- Crown Human Genome Center, Weizmann Institute of Science, Rehovot, Israel
| | - Sivan Goshen
- Department of Otolaryngology, Meir Hospital, Kfar Saba, Israel
| | - Vered Yahalom
- National Blood Group Reference Laboratory, Magen David Adom National Blood Services Center, Ramat-Gan, Israel
| | - Charles W Luetje
- Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Doron Lancet
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
- Crown Human Genome Center, Weizmann Institute of Science, Rehovot, Israel
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
42
|
Cooper GM, Nickerson DA, Eichler EE. Mutational and selective effects on copy-number variants in the human genome. Nat Genet 2007; 39:S22-9. [PMID: 17597777 DOI: 10.1038/ng2054] [Citation(s) in RCA: 202] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Comprehensive descriptions of large insertion/deletion or segmental duplication polymorphisms (SDs) in the human genome have recently been generated. These annotations, known collectively as structural or copy-number variants (CNVs), include thousands of discrete genomic regions and span hundreds of millions of nucleotides. Here we review the genomic distribution of CNVs, which is strongly correlated with gene, repeat and segmental duplication content. We explore the evolutionary mechanisms giving rise to this nonrandom distribution, considering the available data on both human polymorphisms and the fixed changes that differentiate humans from other species. It is likely that mutational biases, selective effects and interactions between these forces all contribute substantially to the spectrum of human copy-number variation. Although defining these variants with nucleotide-level precision remains a largely unmet but critical challenge, our understanding of their potential medical impact and evolutionary importance is rapidly emerging.
Collapse
Affiliation(s)
- Gregory M Cooper
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | | | |
Collapse
|
43
|
Abstract
The olfactory system meets niche- and species-specific demands by an accelerated evolution of its odorant receptor repertoires. In this review, we describe evolutionary processes that have shaped olfactory and vomeronasal receptor gene families in vertebrate genomes. We emphasize three important periods in the evolution of the olfactory system evident by comparative genomics: the adaptation to land in amphibian ancestors, the decline of olfaction in primates, and the delineation of putative pheromone receptors concurrent with rodent speciation. The rapid evolution of odorant receptor genes, the sheer size of the repertoire, as well as their wide distribution in the genome, presents a developmental challenge: how are these ever-changing odorant receptor repertoires coordinated within the olfactory system? A central organizing principle in olfaction is the specialization of sensory neurons resulting from each sensory neuron expressing only ~one odorant receptor allele. In this review, we also discuss this mutually exclusive expression of odorant receptor genes. We have considered several models to account for co-regulation of odorant receptor repertoires, as well as discussed a new hypothesis that invokes important epigenetic properties of the system.
Collapse
Affiliation(s)
- Marijo B Kambere
- Department of Molecular Biology and Biochemistry, Wesleyan University, Middletown, CT, USA
| | - Robert P Lane
- Department of Molecular Biology and Biochemistry, Wesleyan University, Middletown, CT, USA
| |
Collapse
|
44
|
Babcock M, Yatsenko S, Hopkins J, Brenton M, Cao Q, de Jong P, Stankiewicz P, Lupski JR, Sikela JM, Morrow BE. Hominoid lineage specific amplification of low-copy repeats on 22q11.2 (LCR22s) associated with velo-cardio-facial/digeorge syndrome. Hum Mol Genet 2007; 16:2560-71. [PMID: 17675367 DOI: 10.1093/hmg/ddm197] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Segmental duplications or low-copy repeats (LCRs) constitute approximately 5% of the sequenced portion of the human genome and are associated with many human congenital anomaly disorders. The low-copy repeats on chromosome 22q11.2 (LCR22s) mediate chromosomal rearrangements resulting in deletions, duplications and translocations. The evolutionary mechanisms leading to LCR22 formation is unknown. Four genes, USP18, BCR, GGTLA and GGT, map adjacent to the LCR22s and pseudogene copies are located within them. It has been hypothesized that gene duplication occurred during primate evolution, followed by recombination events, forming pseudogene copies. We investigated whether gene duplication could be detected in non-human hominoid species. FISH mapping was performed using probes to the four functional gene loci. There was evidence for a single copy in humans but additional copies in hominoid species. We then compared LCR22 copy number using LCR22 FISH probes. Lineage specific LCR22 variation was detected in the hominoid species supporting the hypothesis. To independently validate initial findings, real time PCR, and screening of gorilla BAC library filters were performed. This was compared to array comparative genome hybridization data available. The most striking finding was a dramatic amplification of LCR22s in the gorilla. The LCR22s localized to the telomeric or subtelomeric bands of gorilla chromosomes. The most parsimonious explanation is that the LCR22s became amplified by inter-chromosomal recombination between telomeric bands. In summary, our results are consistent with a lineage specific coupling between gene and LCR22 duplication events. The LCR22s thus serve as an important model for evolution of genome variation.
Collapse
Affiliation(s)
- Melanie Babcock
- Department of Molecular Genetics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA. mbabcock@aecom
| | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation. BMC Genomics 2007; 8:211. [PMID: 17608949 PMCID: PMC1934372 DOI: 10.1186/1471-2164-8-211] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2007] [Accepted: 07/03/2007] [Indexed: 11/29/2022] Open
Abstract
Background The recent discovery of widespread copy number variation in humans has forced a shift away from the assumption of two copies per locus per cell throughout the autosomal genome. In particular, a SNP site can no longer always be accurately assigned one of three genotypes in an individual. In the presence of copy number variability, the individual may theoretically harbor any number of copies of each of the two SNP alleles. Results To address this issue, we have developed a method to infer a "generalized genotype" from raw SNP microarray data. Here we apply our approach to data from 48 individuals and uncover thousands of aberrant SNPs, most in regions that were previously unreported as copy number variants. We show that our allele-specific copy numbers follow Mendelian inheritance patterns that would be obscured in the absence of SNP allele information. The interplay between duplication and point mutation in our data shed light on the relative frequencies of these events in human history, showing that at least some of the duplication events were recurrent. Conclusion This new multi-allelic view of SNPs has a complicated role in disease association studies, and further work will be necessary in order to accurately assess its importance. Software to perform generalized genotyping from SNP array data is freely available online [1].
Collapse
|
46
|
Abstract
DNA copy number variation (CNV) represents a considerable source of human genetic diversity. Recently,1 a global map of copy number variation in the human genome has been drawn up which reveals not only the ubiquity but also the complexity of this type of variation. Thus, two human genomes may differ by more than 20 Mb and it is likely that the full extent of CNV still remains to be discovered. Nearly 3000 genes are associated with CNV. This high degree of variability with regard to gene copy number between two individuals challenges definitions of normality. Many CNVs are located in regions of complex genomic structure and this currently limits the extent to which these variants can be genotyped by using tagging SNPs. However, some CNVs are already amenable to genome-wide association studies so that their influence on human phenotypic diversity and disease susceptibility may soon be determined.
Collapse
|
47
|
Kehrer-Sawatzki H, Cooper DN. Understanding the recent evolution of the human genome: insights from human-chimpanzee genome comparisons. Hum Mutat 2007; 28:99-130. [PMID: 17024666 DOI: 10.1002/humu.20420] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The sequencing of the chimpanzee genome and the comparison with its human counterpart have begun to reveal the spectrum of genetic changes that has accompanied human evolution. In addition to gross karyotypic rearrangements such as the fusion that formed human chromosome 2 and the human-specific pericentric inversions of chromosomes 1 and 18, there is considerable submicroscopic structural variation involving deletions, duplications, and inversions. Lineage-specific segmental duplications, detected by array comparative genomic hybridization and direct sequence comparison, have made a very significant contribution to this structural divergence, which is at least three-fold greater than that due to nucleotide substitutions. Since structural genomic changes may have given rise to irreversible functional differences between the diverging species, their detailed analysis could help to identify the biological processes that have accompanied speciation. To this end, interspecies comparisons have revealed numerous human-specific gains and losses of genes as well as changes in gene expression. The very considerable structural diversity (polymorphism) evident within both lineages has, however, hampered the analysis of the structural divergence between the human and chimpanzee genomes. The concomitant evaluation of genetic divergence and diversity at the nucleotide level has nevertheless served to identify many genes that have evolved under positive selection and may thus have been involved in the development of human lineage-specific traits. Genes that display signs of weak negative selection have also been identified and could represent candidate loci for complex genomic disorders. Here, we review recent progress in comparing the human and chimpanzee genomes and discuss how the differences detected have improved our understanding of the evolution of the human genome.
Collapse
|
48
|
Kehrer-Sawatzki H, Cooper DN. Structural divergence between the human and chimpanzee genomes. Hum Genet 2006; 120:759-78. [PMID: 17066299 DOI: 10.1007/s00439-006-0270-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2006] [Accepted: 09/19/2006] [Indexed: 01/17/2023]
Abstract
The structural microheterogeneity evident between the human and chimpanzee genomes is quite considerable and includes inversions and duplications as well as deletions, ranging in size from a few base-pairs up to several megabases (Mb). Insertions and deletions have together given rise to at least 150 Mb of genomic DNA sequence that is either present or absent in humans as compared to chimpanzees. Such regions often contain paralogous sequences and members of multigene families thereby ensuring that the human and chimpanzee genomes differ by a significant fraction of their gene content. There is as yet no evidence to suggest that the large chromosomal rearrangements which serve to distinguish the human and chimpanzee karyotypes have influenced either speciation or the evolution of lineage-specific traits. However, the myriad submicroscopic rearrangements in both genomes, particularly those involving copy number variation, are unlikely to represent exclusively neutral changes and hence promise to facilitate the identification of genes that have been important for human-specific evolution.
Collapse
|
49
|
Rehmeyer C, Li W, Kusaba M, Kim YS, Brown D, Staben C, Dean R, Farman M. Organization of chromosome ends in the rice blast fungus, Magnaporthe oryzae. Nucleic Acids Res 2006; 34:4685-701. [PMID: 16963777 PMCID: PMC1635262 DOI: 10.1093/nar/gkl588] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Eukaryotic pathogens of humans often evade the immune system by switching the expression of surface proteins encoded by subtelomeric gene families. To determine if plant pathogenic fungi use a similar mechanism to avoid host defenses, we sequenced the 14 chromosome ends of the rice blast pathogen, Magnaporthe oryzae. One telomere is directly joined to ribosomal RNA-encoding genes, at the end of the ∼2 Mb rDNA array. Two are attached to chromosome-unique sequences, and the remainder adjoin a distinct subtelomere region, consisting of a telomere-linked RecQ-helicase (TLH) gene flanked by several blocks of tandem repeats. Unlike other microbes, M.oryzae exhibits very little gene amplification in the subtelomere regions—out of 261 predicted genes found within 100 kb of the telomeres, only four were present at more than one chromosome end. Therefore, it seems unlikely that M.oryzae uses switching mechanisms to evade host defenses. Instead, the M.oryzae telomeres have undergone frequent terminal truncation, and there is evidence of extensive ectopic recombination among transposons in these regions. We propose that the M.oryzae chromosome termini play more subtle roles in host adaptation by promoting the loss of terminally-positioned genes that tend to trigger host defenses.
Collapse
Affiliation(s)
- Cathryn Rehmeyer
- Department of Plant Pathology, University of KentuckyLexington, KY 40546 USA
| | - Weixi Li
- Department of Biology, University of KentuckyLexington, KY 40546 USA
| | - Motoaki Kusaba
- Department of Plant Pathology, University of KentuckyLexington, KY 40546 USA
| | - Yun-Sik Kim
- Department of Plant Pathology, University of KentuckyLexington, KY 40546 USA
| | - Doug Brown
- Center for Integrated Fungal Research, North Carolina State UniversityRaleigh, NC 27695 USA
| | - Chuck Staben
- Department of Biology, University of KentuckyLexington, KY 40546 USA
| | - Ralph Dean
- Center for Integrated Fungal Research, North Carolina State UniversityRaleigh, NC 27695 USA
| | - Mark Farman
- Department of Plant Pathology, University of KentuckyLexington, KY 40546 USA
- To whom correspondence should be addressed. Tel: 859 257 7445, ext. 80728; Fax: 859 323 1961;
| |
Collapse
|
50
|
Doggett NA, Xie G, Meincke LJ, Sutherland RD, Mundt MO, Berbari NS, Davy BE, Robinson ML, Rudd MK, Weber JL, Stallings RL, Han C. A 360-kb interchromosomal duplication of the human HYDIN locus. Genomics 2006; 88:762-771. [PMID: 16938426 DOI: 10.1016/j.ygeno.2006.07.012] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2006] [Revised: 07/06/2006] [Accepted: 07/19/2006] [Indexed: 11/16/2022]
Abstract
The HYDIN gene located in human chromosome band 16q22.2 is a large gene encompassing 423 kb of genomic DNA that has been suggested as a candidate for an autosomal recessive form of congenital hydrocephalus. We have found that the human HYDIN locus has been very recently duplicated, with a nearly identical 360-kb paralogous segment inserted on chromosome 1q21.1. The duplication, among the largest interchromosomal segmental duplications described in humans, is not accounted for in the current human genome assembly and appears to be part of a greater than 550-kb contig that must lie within 1 of the 11 sequence gaps currently remaining in 1q21.1. Both copies of the HYDIN gene are expressed in alternatively spliced transcripts. Elucidation of the role of HYDIN in human disease susceptibility will require careful discrimination among the paralogous copies.
Collapse
Affiliation(s)
- Norman A Doggett
- DOE Joint Genome Institute and Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
| | - Gary Xie
- DOE Joint Genome Institute and Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Linda J Meincke
- DOE Joint Genome Institute and Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Robert D Sutherland
- DOE Joint Genome Institute and Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Mark O Mundt
- DOE Joint Genome Institute and Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Nicolas S Berbari
- Division of Molecular and Human Genetics, Children's Research Institute, Ohio State University, 700 Children's Drive, Columbus, OH 43205, USA
| | - Brian E Davy
- Division of Molecular and Human Genetics, Children's Research Institute, Ohio State University, 700 Children's Drive, Columbus, OH 43205, USA
| | - Michael L Robinson
- Division of Molecular and Human Genetics, Children's Research Institute, Ohio State University, 700 Children's Drive, Columbus, OH 43205, USA
| | - M Katharine Rudd
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, C3-168, Seattle, WA 98109, USA
| | - James L Weber
- Center for Medical Genetics, Marshfield Medical Research Foundation, 1000 North Oak Avenue, Marshfield, WI 54449, USA
| | - Raymond L Stallings
- Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, TX 78229, USA
| | - Cliff Han
- DOE Joint Genome Institute and Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|