1
|
Grochowski CM, Bengtsson JD, Du H, Gandhi M, Lun MY, Mehaffey MG, Park K, Höps W, Benito E, Hasenfeld P, Korbel JO, Mahmoud M, Paulin LF, Jhangiani SN, Hwang JP, Bhamidipati SV, Muzny DM, Fatih JM, Gibbs RA, Pendleton M, Harrington E, Juul S, Lindstrand A, Sedlazeck FJ, Pehlivan D, Lupski JR, Carvalho CMB. Inverted triplications formed by iterative template switches generate structural variant diversity at genomic disorder loci. CELL GENOMICS 2024; 4:100590. [PMID: 38908378 PMCID: PMC11293582 DOI: 10.1016/j.xgen.2024.100590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 12/27/2023] [Accepted: 05/31/2024] [Indexed: 06/24/2024]
Abstract
The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a complex genomic rearrangement (CGR). Although it has been identified as an important pathogenic DNA mutation signature in genomic disorders and cancer genomes, its architecture remains unresolved. Here, we studied the genomic architecture of DUP-TRP/INV-DUP by investigating the DNA of 24 patients identified by array comparative genomic hybridization (aCGH) on whom we found evidence for the existence of 4 out of 4 predicted structural variant (SV) haplotypes. Using a combination of short-read genome sequencing (GS), long-read GS, optical genome mapping, and single-cell DNA template strand sequencing (strand-seq), the haplotype structure was resolved in 18 samples. The point of template switching in 4 samples was shown to be a segment of ∼2.2-5.5 kb of 100% nucleotide similarity within inverted repeat pairs. These data provide experimental evidence that inverted low-copy repeats act as recombinant substrates. This type of CGR can result in multiple conformers generating diverse SV haplotypes in susceptible dosage-sensitive loci.
Collapse
Affiliation(s)
| | | | - Haowei Du
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Mira Gandhi
- Pacific Northwest Research Institute, Seattle, WA 98122, USA
| | - Ming Yin Lun
- Pacific Northwest Research Institute, Seattle, WA 98122, USA
| | | | - KyungHee Park
- Pacific Northwest Research Institute, Seattle, WA 98122, USA
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Eva Benito
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Medhat Mahmoud
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Luis F Paulin
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Shalini N Jhangiani
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - James Paul Hwang
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sravya V Bhamidipati
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jawid M Fatih
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Richard A Gibbs
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | | | | | - Sissel Juul
- Oxford Nanopore Technologies, New York, NY 10013, USA
| | - Anna Lindstrand
- Department of Molecular Medicine and Surgery, Karolinska Institutet, 171 76 Stockholm, Sweden; Department of Clinical Genetics and Genomics, Karolinska University Hospital, 171 76 Stockholm, Sweden
| | - Fritz J Sedlazeck
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Computer Science, Rice University, Houston TX 77030, USA
| | - Davut Pehlivan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Section of Neurology and Developmental Neuroscience, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston, TX 77030, USA
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA
| | | |
Collapse
|
2
|
Chatterjee S, Starrett GJ. Microhomology-mediated repair machinery and its relationship with HPV-mediated oncogenesis. J Med Virol 2024; 96:e29674. [PMID: 38757834 DOI: 10.1002/jmv.29674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 04/19/2024] [Accepted: 05/06/2024] [Indexed: 05/18/2024]
Abstract
Human Papillomaviruses (HPV) are a diverse family of non-enveloped dsDNA viruses that infect the skin and mucosal epithelia. Persistent HPV infections can lead to cancer frequently involving integration of the virus into the host genome, leading to sustained oncogene expression and loss of capsid and genome maintenance proteins. Microhomology-mediated double-strand break repair, a DNA double-stranded breaks repair pathway present in many organisms, was initially thought to be a backup but it's now seen as vital, especially in homologous recombination-deficient contexts. Increasing evidence has identified microhomology (MH) near HPV integration junctions, suggesting MH-mediated repair pathways drive integration. In this comprehensive review, we present a detailed summary of both the mechanisms underlying MH-mediated repair and the evidence for its involvement in HPV integration in cancer. Lastly, we highlight the involvement of these processes in the integration of other DNA viruses and the broader implications on virus lifecycles and host innate immune response.
Collapse
Affiliation(s)
- Subhajit Chatterjee
- Laboratory of Cellular Oncology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Gabriel J Starrett
- Laboratory of Cellular Oncology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
3
|
Grochowski CM, Bengtsson JD, Du H, Gandhi M, Lun MY, Mehaffey MG, Park K, Höps W, Benito-Garagorri E, Hasenfeld P, Korbel JO, Mahmoud M, Paulin LF, Jhangiani SN, Muzny DM, Fatih JM, Gibbs RA, Pendleton M, Harrington E, Juul S, Lindstrand A, Sedlazeck FJ, Pehlivan D, Lupski JR, Carvalho CMB. Break-induced replication underlies formation of inverted triplications and generates unexpected diversity in haplotype structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.02.560172. [PMID: 37873367 PMCID: PMC10592851 DOI: 10.1101/2023.10.02.560172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Background The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a type of complex genomic rearrangement (CGR) hypothesized to result from replicative repair of DNA due to replication fork collapse. It is often mediated by a pair of inverted low-copy repeats (LCR) followed by iterative template switches resulting in at least two breakpoint junctions in cis . Although it has been identified as an important mutation signature of pathogenicity for genomic disorders and cancer genomes, its architecture remains unresolved and is predicted to display at least four structural variation (SV) haplotypes. Results Here we studied the genomic architecture of DUP-TRP/INV-DUP by investigating the genomic DNA of 24 patients with neurodevelopmental disorders identified by array comparative genomic hybridization (aCGH) on whom we found evidence for the existence of 4 out of 4 predicted SV haplotypes. Using a combination of short-read genome sequencing (GS), long- read GS, optical genome mapping and StrandSeq the haplotype structure was resolved in 18 samples. This approach refined the point of template switching between inverted LCRs in 4 samples revealing a DNA segment of ∼2.2-5.5 kb of 100% nucleotide similarity. A prediction model was developed to infer the LCR used to mediate the non-allelic homology repair. Conclusions These data provide experimental evidence supporting the hypothesis that inverted LCRs act as a recombinant substrate in replication-based repair mechanisms. Such inverted repeats are particularly relevant for formation of copy-number associated inversions, including the DUP-TRP/INV-DUP structures. Moreover, this type of CGR can result in multiple conformers which contributes to generate diverse SV haplotypes in susceptible loci .
Collapse
|
4
|
Jiang Y. Contribution of Microhomology to Genome Instability: Connection between DNA Repair and Replication Stress. Int J Mol Sci 2022; 23:12937. [PMID: 36361724 PMCID: PMC9657218 DOI: 10.3390/ijms232112937] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 10/23/2022] [Accepted: 10/23/2022] [Indexed: 11/23/2023] Open
Abstract
Microhomology-mediated end joining (MMEJ) is a highly mutagenic pathway to repair double-strand breaks (DSBs). MMEJ was thought to be a backup pathway of homologous recombination (HR) and canonical nonhomologous end joining (C-NHEJ). However, it attracts more attention in cancer research due to its special function of microhomology in many different aspects of cancer. In particular, it is initiated with DNA end resection and upregulated in homologous recombination-deficient cancers. In this review, I summarize the following: (1) the recent findings and contributions of MMEJ to genome instability, including phenotypes relevant to MMEJ; (2) the interaction between MMEJ and other DNA repair pathways; (3) the proposed mechanistic model of MMEJ in DNA DSB repair and a new connection with microhomology-mediated break-induced replication (MMBIR); and (4) the potential clinical application by targeting MMEJ based on synthetic lethality for cancer therapy.
Collapse
Affiliation(s)
- Yuning Jiang
- Department of Radiation Oncology, University of Virginia, Charlottesville, VA 22903, USA
| |
Collapse
|
5
|
Lorenz A, Mpaulo SJ. Gene conversion: a non-Mendelian process integral to meiotic recombination. Heredity (Edinb) 2022; 129:56-63. [PMID: 35393552 PMCID: PMC9273591 DOI: 10.1038/s41437-022-00523-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/01/2022] [Accepted: 03/02/2022] [Indexed: 12/30/2022] Open
Abstract
Meiosis is undoubtedly the mechanism that underpins Mendelian genetics. Meiosis is a specialised, reductional cell division which generates haploid gametes (reproductive cells) carrying a single chromosome complement from diploid progenitor cells harbouring two chromosome sets. Through this process, the hereditary material is shuffled and distributed into haploid gametes such that upon fertilisation, when two haploid gametes fuse, diploidy is restored in the zygote. During meiosis the transient physical connection of two homologous chromosomes (one originally inherited from each parent) each consisting of two sister chromatids and their subsequent segregation into four meiotic products (gametes), is what enables genetic marker assortment forming the core of Mendelian laws. The initiating events of meiotic recombination are DNA double-strand breaks (DSBs) which need to be repaired in a certain way to enable the homologous chromosomes to find each other. This is achieved by DSB ends searching for homologous repair templates and invading them. Ultimately, the repair of meiotic DSBs by homologous recombination physically connects homologous chromosomes through crossovers. These physical connections provided by crossovers enable faithful chromosome segregation. That being said, the DSB repair mechanism integral to meiotic recombination also produces genetic transmission distortions which manifest as postmeiotic segregation events and gene conversions. These processes are non-reciprocal genetic exchanges and thus non-Mendelian.
Collapse
Affiliation(s)
- Alexander Lorenz
- Institute of Medical Sciences (IMS), University of Aberdeen, Aberdeen, UK.
| | - Samantha J Mpaulo
- Institute of Medical Sciences (IMS), University of Aberdeen, Aberdeen, UK
| |
Collapse
|
6
|
Jorritsma RN. How Well Does Evolution Explain Endogenous Retroviruses?-A Lakatosian Assessment. Viruses 2021; 14:v14010014. [PMID: 35062218 PMCID: PMC8781664 DOI: 10.3390/v14010014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 12/16/2021] [Accepted: 12/18/2021] [Indexed: 02/06/2023] Open
Abstract
One of the most sophisticated philosophies of science is the methodology of scientific research programmes (MSRP), developed by Imre Lakatos. According to MSRP, scientists are working within so-called research programmes, consisting of a hard core of fixed convictions and a flexible protective belt of auxiliary hypotheses. Anomalies are accommodated by changes to the protective belt that do not affect the hard core. Under MSRP, research programmes are appraised as 'progressive' if they successfully predict novel facts but are judged as 'degenerative' if they merely offer ad hoc solutions to anomalies. This paper applies these criteria to the evolutionary research programme as it has performed during half a century of ERV research. It describes the early history of the field and the emergence of the endogenization-amplification theory on the origins of retroviral-like sequences. It then discusses various predictions and postdictions that were generated by the programme, regarding orthologous ERVs in different species, the presence of target site duplications and the divergence of long terminal repeats, and appraises how the programme has dealt with data that did not conform to initial expectations. It is concluded that the evolutionary research programme has been progressive with regard to the issues here examined.
Collapse
Affiliation(s)
- Ruben N Jorritsma
- Philosophy Group, Wageningen University & Research, 6700 EW Wageningen, The Netherlands
| |
Collapse
|
7
|
Distinct patterns of complex rearrangements and a mutational signature of microhomeology are frequently observed in PLP1 copy number gain structural variants. Genome Med 2019; 11:80. [PMID: 31818324 PMCID: PMC6902434 DOI: 10.1186/s13073-019-0676-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 10/10/2019] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND We investigated the features of the genomic rearrangements in a cohort of 50 male individuals with proteolipid protein 1 (PLP1) copy number gain events who were ascertained with Pelizaeus-Merzbacher disease (PMD; MIM: 312080). We then compared our new data to previous structural variant mutagenesis studies involving the Xq22 region of the human genome. The aggregate data from 159 sequenced join-points (discontinuous sequences in the reference genome that are joined during the rearrangement process) were studied. Analysis of these data from 150 individuals enabled the spectrum and relative distribution of the underlying genomic mutational signatures to be delineated. METHODS Genomic rearrangements in PMD individuals with PLP1 copy number gain events were investigated by high-density customized array or clinical chromosomal microarray analysis and breakpoint junction sequence analysis. RESULTS High-density customized array showed that the majority of cases (33/50; ~ 66%) present with single duplications, although complex genomic rearrangements (CGRs) are also frequent (17/50; ~ 34%). Breakpoint mapping to nucleotide resolution revealed further previously unknown structural and sequence complexities, even in single duplications. Meta-analysis of all studied rearrangements that occur at the PLP1 locus showed that single duplications were found in ~ 54% of individuals and that, among all CGR cases, triplication flanked by duplications is the most frequent CGR array CGH pattern observed. Importantly, in ~ 32% of join-points, there is evidence for a mutational signature of microhomeology (highly similar yet imperfect sequence matches). CONCLUSIONS These data reveal a high frequency of CGRs at the PLP1 locus and support the assertion that replication-based mechanisms are prominent contributors to the formation of CGRs at Xq22. We propose that microhomeology can facilitate template switching, by stabilizing strand annealing of the primer using W-C base complementarity, and is a mutational signature for replicative repair.
Collapse
|
8
|
Song X, Beck CR, Du R, Campbell IM, Coban-Akdemir Z, Gu S, Breman AM, Stankiewicz P, Ira G, Shaw CA, Lupski JR. Predicting human genes susceptible to genomic instability associated with Alu/ Alu-mediated rearrangements. Genome Res 2018; 28:1228-1242. [PMID: 29907612 PMCID: PMC6071635 DOI: 10.1101/gr.229401.117] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Accepted: 06/06/2018] [Indexed: 12/14/2022]
Abstract
Alu elements, the short interspersed element numbering more than 1 million copies per human genome, can mediate the formation of copy number variants (CNVs) between substrate pairs. These Alu/Alu-mediated rearrangements (AAMRs) can result in pathogenic variants that cause diseases. To investigate the impact of AAMR on gene variation and human health, we first characterized Alus that are involved in mediating CNVs (CNV-Alus) and observed that these Alus tend to be evolutionarily younger. We then computationally generated, with the assistance of a supercomputer, a test data set consisting of 78 million Alu pairs and predicted ∼18% of them are potentially susceptible to AAMR. We further determined the relative risk of AAMR in 12,074 OMIM genes using the count of predicted CNV-Alu pairs and experimentally validated the predictions with 89 samples selected by correlating predicted hotspots with a database of CNVs identified by clinical chromosomal microarrays (CMAs) on the genomes of approximately 54,000 subjects. We fine-mapped 47 duplications, 40 deletions, and two complex rearrangements and examined a total of 52 breakpoint junctions of simple CNVs. Overall, 94% of the candidate breakpoints were at least partially Alu mediated. We successfully predicted all (100%) of Alu pairs that mediated deletions (n = 21) and achieved an 87% positive predictive value overall when including AAMR-generated deletions and duplications. We provided a tool, AluAluCNVpredictor, for assessing AAMR hotspots and their role in human disease. These results demonstrate the utility of our predictive model and provide insights into the genomic features and molecular mechanisms underlying AAMR.
Collapse
Affiliation(s)
- Xiaofei Song
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Christine R Beck
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Renqian Du
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Ian M Campbell
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Zeynep Coban-Akdemir
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Shen Gu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Amy M Breman
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Baylor Genetics, Houston, Texas 77021, USA
| | - Pawel Stankiewicz
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Baylor Genetics, Houston, Texas 77021, USA
| | - Grzegorz Ira
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Chad A Shaw
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Baylor Genetics, Houston, Texas 77021, USA
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas 77030, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Texas Children's Hospital, Houston, Texas 77030, USA
| |
Collapse
|
9
|
Cheung SW, Bi W. Novel applications of array comparative genomic hybridization in molecular diagnostics. Expert Rev Mol Diagn 2018; 18:531-542. [PMID: 29848116 DOI: 10.1080/14737159.2018.1479253] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
INTRODUCTION In 2004, the implementation of array comparative genomic hybridization (array comparative genome hybridization [CGH]) into clinical practice marked a new milestone for genetic diagnosis. Array CGH and single-nucleotide polymorphism (SNP) arrays enable genome-wide detection of copy number changes in a high resolution, and therefore microarray has been recognized as the first-tier test for patients with intellectual disability or multiple congenital anomalies, and has also been applied prenatally for detection of clinically relevant copy number variations in the fetus. Area covered: In this review, the authors summarize the evolution of array CGH technology from their diagnostic laboratory, highlighting exonic SNP arrays developed in the past decade which detect small intragenic copy number changes as well as large DNA segments for the region of heterozygosity. The applications of array CGH to human diseases with different modes of inheritance with the emphasis on autosomal recessive disorders are discussed. Expert commentary: An exonic array is a powerful and most efficient clinical tool in detecting genome wide small copy number variants in both dominant and recessive disorders. However, whole-genome sequencing may become the single integrated platform for detection of copy number changes, single-nucleotide changes as well as balanced chromosomal rearrangements in the near future.
Collapse
Affiliation(s)
- Sau W Cheung
- a Department of Molecular and Human Genetics , Baylor College of Medicine , Houston , TX , USA
| | - Weimin Bi
- a Department of Molecular and Human Genetics , Baylor College of Medicine , Houston , TX , USA.,b Baylor Genetics , Houston , TX , USA
| |
Collapse
|
10
|
Hillmer M, Summerer A, Mautner VF, Högel J, Cooper DN, Kehrer-Sawatzki H. Consideration of the haplotype diversity at nonallelic homologous recombination hotspots improves the precision of rearrangement breakpoint identification. Hum Mutat 2017; 38:1711-1722. [PMID: 28862369 DOI: 10.1002/humu.23319] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Revised: 08/07/2017] [Accepted: 08/26/2017] [Indexed: 01/30/2023]
Abstract
Precise characterization of nonallelic homologous recombination (NAHR) breakpoints is key to identifying those features that influence NAHR frequency. Until now, analysis of NAHR-mediated rearrangements has generally been performed by comparison of the breakpoint-spanning sequences with the human genome reference sequence. We show here that the haplotype diversity of NAHR hotspots may interfere with breakpoint-mapping. We studied the transmitting parents of individuals with germline type-1 NF1 deletions mediated by NAHR within the paralogous recombination site 1 (PRS1) or paralogous recombination site 2 (PRS2) hotspots. Several parental wild-type PRS1 and PRS2 haplotypes were identified that exhibited considerable sequence differences with respect to the reference sequence, which also affected the number of predicted PRDM9-binding sites. Sequence comparisons between the parental wild-type PRS1 or PRS2 haplotypes and the deletion breakpoint-spanning sequences from the patients (method #2) turned out to be an accurate means to assign NF1 deletion breakpoints and proved superior to crude reference sequence comparisons that neglect to consider haplotype diversity (method #1). The mean length of the deletion breakpoint regions assigned by method #2 was 269-bp in contrast to 502-bp by method #1. Our findings imply that paralog-specific haplotype diversity of NAHR hotspots (such as PRS2) and population-specific haplotype diversity must be taken into account in order to accurately ascertain NAHR-mediated rearrangement breakpoints.
Collapse
Affiliation(s)
- Morten Hillmer
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | - Anna Summerer
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | - Victor-Felix Mautner
- Department of Neurology, University Hospital Hamburg Eppendorf, Hamburg, Germany
| | - Josef Högel
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | | |
Collapse
|
11
|
Jahic A, Hinreiner S, Emberger W, Hehr U, Zuchner S, Beetz C. Doublet-Mediated DNA Rearrangement-A Novel and Potentially Underestimated Mechanism for the Formation of Recurrent Pathogenic Deletions. Hum Mutat 2016; 38:275-278. [PMID: 28008689 DOI: 10.1002/humu.23162] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2016] [Revised: 12/02/2016] [Accepted: 12/14/2016] [Indexed: 11/09/2022]
Abstract
Deletions and duplications of genomic DNA contribute to evolution, phenotypic diversity, and human disease. The underlying mechanisms are incompletely understood. We identified deletions of exon 10 of the SPAST gene in two unrelated families with hereditary spastic paraplegia. We excluded a founder event, but observed that the breakpoints map to identical repeat regions. These regions likely represent an intragenic "doublet," that is, an enigmatic class of local duplications. The fusion sequences for both deletions are compatible with recombination-based as well as with replication-based mechanisms. Searching the literature, we identified a partial SLC24A4 deletion that involved two copies of another doublet, and was likely formed in an analogous way. Comparing the SPAST and the SLC24A4 doublets with doublets identified previously suggested that many additional doublets have a high potential for triggering rearrangements. Considering that doublets are still being formed in the human genome, and that they likely create high local instability, we suggest that a two-step mechanism consisting of doublet generation and subsequent doublet-mediated deletion/duplication may underlie certain copy-number changes for which other mechanisms are currently assumed. Further studies are necessary to delineate the significance of the thus-far understudied doublets for the formation of copy-number variation.
Collapse
Affiliation(s)
- Amir Jahic
- Department of Clinical Chemistry and Laboratory Medicine, Jena University Hospital, Jena, Germany
| | - Sophie Hinreiner
- Department of Human Genetics, University of Regensburg, Regensburg, Germany
| | - Werner Emberger
- Department of Human Genetics, Graz Medical University, Graz, Austria
| | - Ute Hehr
- Department of Human Genetics, University of Regensburg, Regensburg, Germany
| | - Stephan Zuchner
- John T. Macdonald Department of Human Genetics and John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida
| | - Christian Beetz
- Department of Clinical Chemistry and Laboratory Medicine, Jena University Hospital, Jena, Germany
| |
Collapse
|
12
|
Duplication of chicken defensin7 gene generated by gene conversion and homologous recombination. Proc Natl Acad Sci U S A 2016; 113:13815-13820. [PMID: 27849592 DOI: 10.1073/pnas.1616948113] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Defensins constitute an evolutionary conserved family of cationic antimicrobial peptides that play a key role in host innate immune responses to infection. Defensin genes generally reside in complex genomic regions that are prone to structural variation, and defensin genes exhibit extensive copy number variation in humans and in other species. Copy number variation of defensin genes was examined in inbred lines of Leghorn and Fayoumi chickens, and a duplication of defensin7 was discovered in the Fayoumi breed. Analysis of junction sequences confirmed the occurrence of a simple tandem duplication of defensin7 with sequence identity at the junction, suggesting nonallelic homologous recombination between defensin7 and defensin6 The duplication event generated two chimeric promoters that are best explained by gene conversion followed by homologous recombination. Expression of defensin7 was not elevated in animals with two genes despite both genes being transcribed in the tissues examined. Computational prediction of promoter regions revealed the presence of several putative transcription factor binding sites generated by the duplication event. These data provide insight into the evolution and possible function of large gene families and specifically, the defensins.
Collapse
|
13
|
Guo X, Delio M, Haque N, Castellanos R, Hestand MS, Vermeesch JR, Morrow BE, Zheng D. Variant discovery and breakpoint region prediction for studying the human 22q11.2 deletion using BAC clone and whole genome sequencing analysis. Hum Mol Genet 2016; 25:3754-3767. [PMID: 27436579 DOI: 10.1093/hmg/ddw221] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Revised: 06/28/2016] [Accepted: 07/01/2016] [Indexed: 11/13/2022] Open
Abstract
Velo-cardio-facial syndrome/DiGeorge syndrome/22q11.2 deletion syndrome (22q11.2DS) is caused by meiotic non-allelic homologous recombination events between flanking low copy repeats termed LCR22A and LCR22D, resulting in a 3 million base pair (Mb) deletion. Due to their complex structure, large size and high sequence identity, genetic variation within LCR22s among different individuals has not been well characterized. In this study, we sequenced 13 BAC clones derived from LCR22A/D and aligned them with 15 previously available BAC sequences to create a new genetic variation map. The thousands of variants identified by this analysis were not uniformly distributed in the two LCR22s. Moreover, shared single nucleotide variants between LCR22A and LCR22D were enriched in the Breakpoint Cluster Region pseudogene (BCRP) block, suggesting the existence of a possible recombination hotspot there. Interestingly, breakpoints for atypical 22q11.2 rearrangements have previously been located to BCRPs To further explore this finding, we carried out in-depth analyses of whole genome sequence (WGS) data from two unrelated probands harbouring a de novo 3Mb 22q11.2 deletion and their normal parents. By focusing primarily on WGS reads uniquely mapped to LCR22A, using the variation map from our BAC analysis to help resolve allele ambiguity, and by performing PCR analysis, we infer that the deletion breakpoints were most likely located near or within the BCRP module. In summary, we found a high degree of sequence variation in LCR22A and LCR22D and a potential recombination breakpoint near or within the BCRP block, providing a starting point for future breakpoint mapping using additional trios.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Deyou Zheng
- Department of Neurology .,Department of Genetics.,Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| |
Collapse
|
14
|
De novo intrachromosomal gene conversion from OPN1MW to OPN1LW in the male germline results in Blue Cone Monochromacy. Sci Rep 2016; 6:28253. [PMID: 27339364 PMCID: PMC4919619 DOI: 10.1038/srep28253] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 06/01/2016] [Indexed: 12/21/2022] Open
Abstract
X-linked cone dysfunction disorders such as Blue Cone Monochromacy and X-linked Cone Dystrophy are characterized by complete loss (of) or reduced L- and M- cone function due to defects in the OPN1LW/OPN1MW gene cluster. Here we investigated 24 affected males from 16 families with either a structurally intact gene cluster or at least one intact single (hybrid) gene but harbouring rare combinations of common SNPs in exon 3 in single or multiple OPN1LW and OPN1MW gene copies. We assessed twelve different OPN1LW/MW exon 3 haplotypes by semi-quantitative minigene splicing assay. Nine haplotypes resulted in aberrant splicing of ≥20% of transcripts including the known pathogenic haplotypes (i.e. ‘LIAVA’, ‘LVAVA’) with absent or minute amounts of correctly spliced transcripts, respectively. De novo formation of the ‘LIAVA’ haplotype derived from an ancestral less deleterious ‘LIAVS’ haplotype was observed in one family with strikingly different phenotypes among affected family members. We could establish intrachromosomal gene conversion in the male germline as underlying mechanism. Gene conversion in the OPN1LW/OPN1MW genes has been postulated, however, we are first to demonstrate a de novo gene conversion within the lineage of a pedigree.
Collapse
|
15
|
Günther S, Elert-Dobkowska E, Soehn AS, Hinreiner S, Yoon G, Heller R, Hellenbroich Y, Hübner CA, Ray PN, Hehr U, Bauer P, Sulek A, Beetz C. High Frequency of Pathogenic Rearrangements in SPG11 and Extensive Contribution of Mutational Hotspots and Founder Alleles. Hum Mutat 2016; 37:703-9. [PMID: 27071356 DOI: 10.1002/humu.23000] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Revised: 03/11/2016] [Accepted: 03/29/2016] [Indexed: 12/27/2022]
Abstract
Biallelic loss-of-function mutations in SPG11 cause a wide spectrum of recessively inherited, neurodegenerative disorders including hereditary spastic paraplegia (HSP), amyotrophic lateral sclerosis, and Charcot-Marie-Tooth disease. By comprehensive screening of three large cohorts of HSP index patients, we identified 83 alleles with "small" mutations and 13 alleles that carry large genomic rearrangements. Including relevant data from previous studies, we estimate that copy number variants (CNVs) account for ∼19% of pathogenic SPG11 alleles. The breakpoints for all novel and some previously reported CNVs were determined by long-range PCR and sequencing. This revealed several Alu-associated recombination hotspots. We also found evidence for additional mutational mechanisms, including for a two-step event in which an Alu retrotransposition preceded the actual rearrangement. Apparently independent samples with identical breakpoints were analyzed by microsatellite PCRs. The resulting haplotypes suggested the existence of two rearrangement founder alleles. Our findings widen the spectra of mutations and mutational mechanisms in SPG11, underscore the pivotal role played by Alus, and are of high diagnostic relevance for a wide spectrum of clinical phenotypes including the most frequent form of recessive HSP.
Collapse
Affiliation(s)
- Sven Günther
- Department of Clinical Chemistry and Laboratory Medicine, Jena University Hospital, Jena, Germany
| | | | - Anne S Soehn
- Institute of Medical Genetics and Applied Genomics, University Hospital of Tuebingen, Tuebingen, Germany
| | - Sophie Hinreiner
- Center for Human Genetics, and Department of Human Genetics, University of Regensburg, Regensburg, Germany
| | - Grace Yoon
- Division of Clinical and Metabolic Genetics, Department of Paediatrics, University of Toronto, The Hospital for Sick Children, Toronto, Canada
| | - Raoul Heller
- Institute of Human Genetics, University Hospital of Cologne, Cologne, Germany
| | | | | | - Peter N Ray
- Department of Paediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Ute Hehr
- Center for Human Genetics, and Department of Human Genetics, University of Regensburg, Regensburg, Germany
| | - Peter Bauer
- Institute of Medical Genetics and Applied Genomics, University Hospital of Tuebingen, Tuebingen, Germany
| | - Anna Sulek
- Department of Genetics, Institute of Psychiatry and Neurology, Warsaw, Poland
| | - Christian Beetz
- Department of Clinical Chemistry and Laboratory Medicine, Jena University Hospital, Jena, Germany
| |
Collapse
|
16
|
Gu S, Yuan B, Campbell IM, Beck CR, Carvalho CMB, Nagamani SCS, Erez A, Patel A, Bacino CA, Shaw CA, Stankiewicz P, Cheung SW, Bi W, Lupski JR. Alu-mediated diverse and complex pathogenic copy-number variants within human chromosome 17 at p13.3. Hum Mol Genet 2015; 24:4061-77. [PMID: 25908615 DOI: 10.1093/hmg/ddv146] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 04/20/2015] [Indexed: 01/05/2023] Open
Abstract
Alu repetitive elements are known to be major contributors to genome instability by generating Alu-mediated copy-number variants (CNVs). Most of the reported Alu-mediated CNVs are simple deletions and duplications, and the mechanism underlying Alu-Alu-mediated rearrangement has been attributed to non-allelic homologous recombination (NAHR). Chromosome 17 at the p13.3 genomic region lacks extensive low-copy repeat architecture; however, it is highly enriched for Alu repetitive elements, with a fraction of 30% of total sequence annotated in the human reference genome, compared with the 10% genome-wide and 18% on chromosome 17. We conducted mechanistic studies of the 17p13.3 CNVs by performing high-density oligonucleotide array comparative genomic hybridization, specifically interrogating the 17p13.3 region with ∼150 bp per probe density; CNV breakpoint junctions were mapped to nucleotide resolution by polymerase chain reaction and Sanger sequencing. Studied rearrangements include 5 interstitial deletions, 14 tandem duplications, 7 terminal deletions and 13 complex genomic rearrangements (CGRs). Within the 17p13.3 region, Alu-Alu-mediated rearrangements were identified in 80% of the interstitial deletions, 46% of the tandem duplications and 50% of the CGRs, indicating that this mechanism was a major contributor for formation of breakpoint junctions. Our studies suggest that Alu repetitive elements facilitate formation of non-recurrent CNVs, CGRs and other structural aberrations of chromosome 17 at p13.3. The common observation of Alu-mediated rearrangement in CGRs and breakpoint junction sequences analysis further demonstrates that this type of mechanism is unlikely attributed to NAHR, but rather may be due to a recombination-coupled DNA replicative repair process.
Collapse
Affiliation(s)
- Shen Gu
- Department of Molecular & Human Genetics
| | - Bo Yuan
- Department of Molecular & Human Genetics
| | | | | | | | - Sandesh C S Nagamani
- Department of Molecular & Human Genetics, Texas Children's Hospital, Houston, TX 77030, USA and
| | - Ayelet Erez
- Department of Molecular & Human Genetics, Department of Biological Regulation, Weizmann Institute of Science, Rehovot, Israel
| | | | - Carlos A Bacino
- Department of Molecular & Human Genetics, Texas Children's Hospital, Houston, TX 77030, USA and
| | | | | | | | - Weimin Bi
- Department of Molecular & Human Genetics
| | - James R Lupski
- Department of Molecular & Human Genetics, Department of Pediatrics and Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA, Texas Children's Hospital, Houston, TX 77030, USA and
| |
Collapse
|
17
|
Pratto F, Brick K, Khil P, Smagulova F, Petukhova GV, Camerini-Otero RD. DNA recombination. Recombination initiation maps of individual human genomes. Science 2014; 346:1256442. [PMID: 25395542 DOI: 10.1126/science.1256442] [Citation(s) in RCA: 190] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
DNA double-strand breaks (DSBs) are introduced in meiosis to initiate recombination and generate crossovers, the reciprocal exchanges of genetic material between parental chromosomes. Here, we present high-resolution maps of meiotic DSBs in individual human genomes. Comparing DSB maps between individuals shows that along with DNA binding by PRDM9, additional factors may dictate the efficiency of DSB formation. We find evidence for both GC-biased gene conversion and mutagenesis around meiotic DSB hotspots, while frequent colocalization of DSB hotspots with chromosome rearrangement breakpoints implicates the aberrant repair of meiotic DSBs in genomic disorders. Furthermore, our data indicate that DSB frequency is a major determinant of crossover rate. These maps provide new insights into the regulation of meiotic recombination and the impact of meiotic recombination on genome function.
Collapse
Affiliation(s)
- Florencia Pratto
- National Institute of Diabetes, Digestive and Kidney Diseases, NIH, Bethesda, MD, USA
| | - Kevin Brick
- National Institute of Diabetes, Digestive and Kidney Diseases, NIH, Bethesda, MD, USA
| | - Pavel Khil
- National Institute of Diabetes, Digestive and Kidney Diseases, NIH, Bethesda, MD, USA
| | - Fatima Smagulova
- Department of Biochemistry and Molecular Biology, Uniformed Services University of Health Sciences, Bethesda, MD, USA
| | - Galina V Petukhova
- Department of Biochemistry and Molecular Biology, Uniformed Services University of Health Sciences, Bethesda, MD, USA.
| | | |
Collapse
|
18
|
Campbell IM, Gambin T, Dittwald P, Beck CR, Shuvarikov A, Hixson P, Patel A, Gambin A, Shaw CA, Rosenfeld JA, Stankiewicz P. Human endogenous retroviral elements promote genome instability via non-allelic homologous recombination. BMC Biol 2014; 12:74. [PMID: 25246103 PMCID: PMC4195946 DOI: 10.1186/s12915-014-0074-4] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Accepted: 09/11/2014] [Indexed: 11/13/2022] Open
Abstract
Background Recurrent rearrangements of the human genome resulting in disease or variation are mainly mediated by non-allelic homologous recombination (NAHR) between low-copy repeats. However, other genomic structures, including AT-rich palindromes and retroviruses, have also been reported to underlie recurrent structural rearrangements. Notably, recurrent deletions of Yq12 conveying azoospermia, as well as non-pathogenic reciprocal duplications, are mediated by human endogenous retroviral elements (HERVs). We hypothesized that HERV elements throughout the genome can serve as substrates for genomic instability and result in human copy-number variation (CNV). Results We developed parameters to identify HERV elements similar to those that mediate Yq12 rearrangements as well as recurrent deletions of 3q13.2q13.31. We used these parameters to identify HERV pairs genome-wide that may cause instability. Our analysis highlighted 170 pairs, flanking 12.1% of the genome. We cross-referenced these predicted susceptibility regions with CNVs from our clinical databases for potentially HERV-mediated rearrangements and identified 78 CNVs. We subsequently molecularly confirmed recurrent deletion and duplication rearrangements at four loci in ten individuals, including reciprocal rearrangements at two loci. Breakpoint sequencing revealed clustering in regions of high sequence identity enriched in PRDM9-mediated recombination hotspot motifs. Conclusions The presence of deletions and reciprocal duplications suggests NAHR as the causative mechanism of HERV-mediated CNV, even though the length and the sequence homology of the HERV elements are less than currently thought to be required for NAHR. We propose that in addition to HERVs, other repetitive elements, such as long interspersed elements, may also be responsible for the formation of recurrent CNVs via NAHR. Electronic supplementary material The online version of this article (doi:10.1186/s12915-014-0074-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Paweł Stankiewicz
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Rm ABBR-R809, Houston, TX, USA.
| |
Collapse
|
19
|
Seplyarskiy VB, Logacheva MD, Penin AA, Baranova MA, Leushkin EV, Demidenko NV, Klepikova AV, Kondrashov FA, Kondrashov AS, James TY. Crossing-over in a hypervariable species preferentially occurs in regions of high local similarity. Mol Biol Evol 2014; 31:3016-25. [PMID: 25135947 PMCID: PMC4209137 DOI: 10.1093/molbev/msu242] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Recombination between double-stranded DNA molecules is a key genetic process which occurs in a wide variety of organisms. Usually, crossing-over (CO) occurs during meiosis between genotypes with 98.0–99.9% sequence identity, because within-population nucleotide diversity only rarely exceeds 2%. However, some species are hypervariable and it is unclear how CO can occur between genotypes with less than 90% sequence identity. Here, we study CO in Schizophyllum commune, a hypervariable cosmopolitan basidiomycete mushroom, a frequently encountered decayer of woody substrates. We crossed two haploid individuals, from the United States and from Russia, and obtained genome sequences for their 17 offspring. The average genetic distance between the parents was 14%, making it possible to study CO at very high resolution. We found reduced levels of linkage disequilibrium between loci flanking the CO sites indicating that they are mostly confined to hotspots of recombination. Furthermore, CO events preferentially occurred in regions under stronger negative selection, in particular within exons that showed reduced levels of nucleotide diversity. Apparently, in hypervariable species CO must avoid regions of higher divergence between the recombining genomes due to limitations imposed by the mismatch repair system, with regions under strong negative selection providing the opportunity for recombination. These patterns are opposite to those observed in a number of less variable species indicating that population genomics of hypervariable species may reveal novel biological phenomena.
Collapse
Affiliation(s)
- Vladimir B Seplyarskiy
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Institute of Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Maria D Logacheva
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Institute of Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Aleksey A Penin
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Institute of Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia Department of Biology, Lomonosov Moscow State University, Moscow, Moscow, Russia
| | - Maria A Baranova
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Institute of Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Evgeny V Leushkin
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Institute of Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Natalia V Demidenko
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Department of Biology, Lomonosov Moscow State University, Moscow, Moscow, Russia
| | - Anna V Klepikova
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Department of Biology, Lomonosov Moscow State University, Moscow, Moscow, Russia
| | - Fyodor A Kondrashov
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG) Barcelona, Spain Universitat Pompeu Fabra (UPF), Barcelona, Spain Institució Catalana de Recerca i Estudis Avançats (ICREA), 23 Pg. Lluís Companys, Barcelona, Spain
| | - Alexey S Kondrashov
- School of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Timothy Y James
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
20
|
The Alu-rich genomic architecture of SPAST predisposes to diverse and functionally distinct disease-associated CNV alleles. Am J Hum Genet 2014; 95:143-61. [PMID: 25065914 DOI: 10.1016/j.ajhg.2014.06.014] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Accepted: 06/30/2014] [Indexed: 01/27/2023] Open
Abstract
Intragenic copy-number variants (CNVs) contribute to the allelic spectrum of both Mendelian and complex disorders. Although pathogenic deletions and duplications in SPAST (mutations in which cause autosomal-dominant spastic paraplegia 4 [SPG4]) have been described, their origins and molecular consequences remain obscure. We mapped breakpoint junctions of 54 SPAST CNVs at nucleotide resolution. Diverse combinations of exons are deleted or duplicated, highlighting the importance of particular exons for spastin function. Of the 54 CNVs, 38 (70%) appear to be mediated by an Alu-based mechanism, suggesting that the Alu-rich genomic architecture of SPAST renders this locus susceptible to various genome rearrangements. Analysis of breakpoint Alus further informs a model of Alu-mediated CNV formation characterized by small CNV size and potential involvement of mechanisms other than homologous recombination. Twelve deletions (22%) overlap part of SPAST and a portion of a nearby, directly oriented gene, predicting novel chimeric genes in these subjects' genomes. cDNA from a subject with a SPAST final exon deletion contained multiple SPAST:SLC30A6 fusion transcripts, indicating that SPAST CNVs can have transcriptional effects beyond the gene itself. SLC30A6 has been implicated in Alzheimer disease, so these fusion gene data could explain a report of spastic paraplegia and dementia cosegregating in a family with deletion of the final exon of SPAST. Our findings provide evidence that the Alu genomic architecture of SPAST predisposes to diverse CNV alleles with distinct transcriptional--and possibly phenotypic--consequences. Moreover, we provide further mechanistic insights into Alu-mediated copy-number change that are extendable to other loci.
Collapse
|
21
|
Abstract
Gene conversion (conversion), the unidirectional transfer of DNA sequence information, occurs as a byproduct of recombinational repair of broken or damaged DNA molecules. Whereas excision repair processes replace damaged DNA by copying the complementary sequence from the undamaged strand of duplex DNA, recombinational mechanisms copy similar sequence, usually in another molecule, to replace the damaged sequence. In mitotic cells the other molecule is usually a sister chromatid, and the repair does not lead to genetic change. Less often a homologous chromosome or homologous sequence in an ectopic position is used. Conversion results from repair in two ways. First, if there was a double-strand gap at the site of a break, homologous sequence will be used as the template for synthesis to fill the gap, thus transferring sequence information in both strands. Second, recombinational repair uses complementary base pairing, and the heteroduplex molecule so formed is a source of conversion, both as heteroduplex and when donor (undamaged template) information is retained after correction of mismatched bases in heteroduplex. There are mechanisms that favour the use of sister molecules that must fail before ectopic homology can be used. Meiotic recombination events lead to the formation of crossovers required in meiosis for orderly segregation of pairs of homologous chromosomes. These events result from recombinational repair of programmed double-strand breaks, but in contrast with mitotic recombination, meiotic recombinational events occur predominantly between homologous chromosomes, so that transfer of sequence differences by conversion is very frequent. Transient recombination events that do not form crossovers form both between homologous chromosomes and between regions of ectopic homology, and leave their mark in the occurrence of frequent non-crossover conversion, including ectopic conversion.
Collapse
|
22
|
MacArthur JAL, Spector TD, Lindsay SJ, Mangino M, Gill R, Small KS, Hurles ME. The rate of nonallelic homologous recombination in males is highly variable, correlated between monozygotic twins and independent of age. PLoS Genet 2014; 10:e1004195. [PMID: 24603440 PMCID: PMC3945173 DOI: 10.1371/journal.pgen.1004195] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Accepted: 01/06/2014] [Indexed: 01/23/2023] Open
Abstract
Nonallelic homologous recombination (NAHR) between highly similar duplicated sequences generates chromosomal deletions, duplications and inversions, which can cause diverse genetic disorders. Little is known about interindividual variation in NAHR rates and the factors that influence this. We estimated the rate of deletion at the CMT1A-REP NAHR hotspot in sperm DNA from 34 male donors, including 16 monozygotic (MZ) co-twins (8 twin pairs) aged 24 to 67 years old. The average NAHR rate was 3.5×10−5 with a seven-fold variation across individuals. Despite good statistical power to detect even a subtle correlation, we observed no relationship between age of unrelated individuals and the rate of NAHR in their sperm, likely reflecting the meiotic-specific origin of these events. We then estimated the heritability of deletion rate by calculating the intraclass correlation (ICC) within MZ co-twins, revealing a significant correlation between MZ co-twins (ICC = 0.784, p = 0.0039), with MZ co-twins being significantly more correlated than unrelated pairs. We showed that this heritability cannot be explained by variation in PRDM9, a known regulator of NAHR, or variation within the NAHR hotspot itself. We also did not detect any correlation between Body Mass Index (BMI), smoking status or alcohol intake and rate of NAHR. Our results suggest that other, as yet unidentified, genetic or environmental factors play a significant role in the regulation of NAHR and are responsible for the extensive variation in the population for the probability of fathering a child with a genomic disorder resulting from a pathogenic deletion. Many genetic disorders are caused by deletions of specific regions of DNA in sperm or egg cells that go on to produce a child. This can occur through ectopic homologous recombination between highly similar segments of DNA at different positions within the genome. Little is known about the differences in rates of deletion between individuals or the factors that influence this. We analysed the rate of deletion at one such section of DNA in sperm DNA from 34 male donors, including 16 monozygotic co-twins. We observed a seven-fold variation in deletion rate across individuals. Deletion rate is significantly correlated between monozygote co-twins, indicating that deletion rate is heritable. This heritability cannot be explained by age, any known genetic regulator of deletion rate, Body Mass Index, smoking status or alcohol intake. Our results suggest that other, as yet unidentified, genetic or environmental factors play a significant role in the regulation of deletion. These factors are responsible for the extensive variation in the population for the probability of fathering a child with a genomic disorder resulting from a pathogenic deletion.
Collapse
Affiliation(s)
| | - Timothy D. Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, St. Thomas' Hospital, London, United Kingdom
| | - Sarah J. Lindsay
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Massimo Mangino
- Department of Twin Research and Genetic Epidemiology, King's College London, St. Thomas' Hospital, London, United Kingdom
| | - Raj Gill
- Department of Twin Research and Genetic Epidemiology, King's College London, St. Thomas' Hospital, London, United Kingdom
| | - Kerrin S. Small
- Department of Twin Research and Genetic Epidemiology, King's College London, St. Thomas' Hospital, London, United Kingdom
| | - Matthew E. Hurles
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
23
|
Bengesser K, Vogt J, Mussotter T, Mautner VF, Messiaen L, Cooper DN, Kehrer-Sawatzki H. Analysis of crossover breakpoints yields new insights into the nature of the gene conversion events associated with large NF1 deletions mediated by nonallelic homologous recombination. Hum Mutat 2013; 35:215-26. [PMID: 24186807 DOI: 10.1002/humu.22473] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 10/18/2013] [Indexed: 12/31/2022]
Abstract
Large NF1 deletions are mediated by nonallelic homologous recombination (NAHR). An in-depth analysis of gene conversion operating in the breakpoint-flanking regions of large NF1 deletions was performed to investigate whether the rate of discontinuous gene conversion during NAHR with crossover is increased, as has been previously noted in NAHR-mediated rearrangements. All 20 germline type-1 NF1 deletions analyzed were mediated by NAHR associated with continuous gene conversion within the breakpoint-flanking regions. Continuous gene conversion was also observed in 31/32 type-2 NF1 deletions investigated. In contrast to the meiotic type-1 NF1 deletions, type-2 NF1 deletions are predominantly of post-zygotic origin. Our findings therefore imply that the mitotic as well as the meiotic NAHR intermediates of large NF1 deletions are processed by long-patch mismatch repair (MMR), thereby ensuring gene conversion tract continuity instead of the discontinuous gene conversion that is characteristic of short-patch repair. However, the single type-2 NF1 deletion not exhibiting continuous gene conversion was processed without MMR, yielding two different deletion-bearing chromosomes, which were distinguishable in terms of their breakpoint positions. Our findings indicate that MMR failure during NAHR, followed by post-meiotic/mitotic segregation, has the potential to give rise to somatic mosaicism in human genomic rearrangements by generating breakpoint heterogeneity.
Collapse
|
24
|
Shuvarikov A, Campbell IM, Dittwald P, Neill NJ, Bialer MG, Moore C, Wheeler PG, Wallace SE, Hannibal MC, Murray MF, Giovanni MA, Terespolsky D, Sodhi S, Cassina M, Viskochil D, Moghaddam B, Herman K, Brown CW, Beck CR, Gambin A, Cheung SW, Patel A, Lamb AN, Shaffer LG, Ellison JW, Ravnan JB, Stankiewicz P, Rosenfeld JA. Recurrent HERV-H-mediated 3q13.2-q13.31 deletions cause a syndrome of hypotonia and motor, language, and cognitive delays. Hum Mutat 2013; 34:1415-23. [PMID: 23878096 DOI: 10.1002/humu.22384] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 07/11/2013] [Indexed: 11/09/2022]
Abstract
We describe the molecular and clinical characterization of nine individuals with recurrent, 3.4-Mb, de novo deletions of 3q13.2-q13.31 detected by chromosomal microarray analysis. All individuals have hypotonia and language and motor delays; they variably express mild to moderate cognitive delays (8/9), abnormal behavior (7/9), and autism spectrum disorders (3/9). Common facial features include downslanting palpebral fissures with epicanthal folds, a slightly bulbous nose, and relative macrocephaly. Twenty-eight genes map to the deleted region, including four strong candidate genes, DRD3, ZBTB20, GAP43, and BOC, with important roles in neural and/or muscular development. Analysis of the breakpoint regions based on array data revealed directly oriented human endogenous retrovirus (HERV-H) elements of ~5 kb in size and of >95% DNA sequence identity flanking the deletion. Subsequent DNA sequencing revealed different deletion breakpoints and suggested nonallelic homologous recombination (NAHR) between HERV-H elements as a mechanism of deletion formation, analogous to HERV-I-flanked and NAHR-mediated AZFa deletions. We propose that similar HERV elements may also mediate other recurrent deletion and duplication events on a genome-wide scale. Observation of rare recurrent chromosomal events such as these deletions helps to further the understanding of mechanisms behind naturally occurring variation in the human genome and its contribution to genetic disease.
Collapse
Affiliation(s)
- Andrey Shuvarikov
- Signature Genomic Laboratories, PerkinElmer, Inc, Spokane, Washington
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Fawcett JA, Innan H. The role of gene conversion in preserving rearrangement hotspots in the human genome. Trends Genet 2013; 29:561-8. [PMID: 23953668 DOI: 10.1016/j.tig.2013.07.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Revised: 06/20/2013] [Accepted: 07/08/2013] [Indexed: 11/27/2022]
Abstract
Hotspots of non-allelic homologous recombination (NAHR) have a crucial role in creating genetic diversity and are also associated with dozens of genomic disorders. Recent studies suggest that many human NAHR hotspots have been preserved throughout the evolution of primates. NAHR hotspots are likely to remain active as long as the segmental duplications (SDs) promoting NAHR retain sufficient similarity. Here, we propose an evolutionary model of SDs that incorporates the effect of gene conversion and compare it with a null model that assumes SDs evolve independently without gene conversion. The gene conversion model predicts a much longer lifespan of NAHR hotspots compared with the null model. We show that the literature on copy number variants (CNVs) and genomic disorders, and also the results of additional analysis of CNVs, are all more consistent with the gene conversion model.
Collapse
Affiliation(s)
- Jeffrey A Fawcett
- Graduate University for Advanced Studies, Hayama, Kanagawa 240-0193, Japan
| | | |
Collapse
|
26
|
Weier C, Haffner MC, Mosbruger T, Esopi DM, Hicks J, Zheng Q, Fedor H, Isaacs WB, De Marzo AM, Nelson WG, Yegnasubramanian S. Nucleotide resolution analysis of TMPRSS2 and ERG rearrangements in prostate cancer. J Pathol 2013; 230:174-83. [PMID: 23447416 DOI: 10.1002/path.4186] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2012] [Revised: 02/08/2013] [Accepted: 02/22/2013] [Indexed: 01/01/2023]
Abstract
TMPRSS2-ERG rearrangements occur in approximately 50% of prostate cancers and therefore represent one of the most frequently observed structural rearrangements in all cancers. However, little is known about the genomic architecture of such rearrangements. We therefore designed and optimized a pipeline involving target capture of TMPRSS2 and ERG genomic sequences coupled with paired-end next-generation sequencing to resolve genomic rearrangement breakpoints in TMPRSS2 and ERG at nucleotide resolution in a large series of primary prostate cancer specimens (n = 83). This strategy showed > 90% sensitivity and specificity in identifying TMPRSS2-ERG rearrangements, and allowed identification of intra- and inter-chromosomal rearrangements involving TMPRSS2 and ERG with known and novel fusion partners. Our results indicate that rearrangement breakpoints show strong clustering in specific intronic regions of TMPRSS2 and ERG. The observed TMPRSS2-ERG rearrangements often exhibited complex chromosomal architecture associated with several intra- and inter-chromosomal rearrangements. Nucleotide resolution analysis of breakpoint junctions revealed that the majority of TMPRSS2 and ERG rearrangements (~88%) occurred at or near regions of microhomology or involved insertions of one or more base pairs. This architecture implicates non-homologous end joining (NHEJ) and microhomology-mediated end joining (MMEJ) pathways in the generation of such rearrangements. These analyses have provided important insights into the molecular mechanisms involved in generating prostate cancer-specific recurrent rearrangements.
Collapse
Affiliation(s)
- Christopher Weier
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University, Baltimore, MD 21231, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Katju V. In with the old, in with the new: the promiscuity of the duplication process engenders diverse pathways for novel gene creation. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:341932. [PMID: 23008799 PMCID: PMC3449122 DOI: 10.1155/2012/341932] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 06/03/2012] [Indexed: 01/26/2023]
Abstract
The gene duplication process has exhibited far greater promiscuity in the creation of paralogs with novel exon-intron structures than anticipated even by Ohno. In this paper I explore the history of the field, from the neo-Darwinian synthesis through Ohno's formulation of the canonical model for the evolution of gene duplicates and culminating in the present genomic era. I delineate the major tenets of Ohno's model and discuss its failure to encapsulate the full complexity of the duplication process as revealed in the era of genomics. I discuss the diverse classes of paralogs originating from both DNA- and RNA-mediated duplication events and their evolutionary potential for assuming radically altered functions, as well as the degree to which they can function unconstrained from the pressure of gene conversion. Lastly, I explore theoretical population-genetic considerations of how the effective population size (N(e)) of a species may influence the probability of emergence of genes with radically altered functions.
Collapse
Affiliation(s)
- Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| |
Collapse
|
28
|
Vogt J, Mussotter T, Bengesser K, Claes K, Högel J, Chuzhanova N, Fu C, van den Ende J, Mautner VF, Cooper DN, Messiaen L, Kehrer-Sawatzki H. Identification of recurrent type-2 NF1 microdeletions reveals a mitotic nonallelic homologous recombination hotspot underlying a human genomic disorder. Hum Mutat 2012; 33:1599-609. [PMID: 22837079 DOI: 10.1002/humu.22171] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 07/11/2012] [Indexed: 01/08/2023]
Abstract
Nonallelic homologous recombination (NAHR) is one of the major mechanisms underlying copy number variation in the human genome. Although several disease-associated meiotic NAHR breakpoints have been analyzed in great detail, hotspots for mitotic NAHR are not well characterized. Type-2 NF1 microdeletions, which are predominantly of postzygotic origin, constitute a highly informative model with which to investigate the features of mitotic NAHR. Here, a custom-designed MLPA- and PCR-based approach was used to identify 23 novel NAHR-mediated type-2 NF1 deletions. Breakpoint analysis of these 23 type-2 deletions, together with 17 NAHR-mediated type-2 deletions identified previously, revealed that the breakpoints are nonuniformly distributed within the paralogous SUZ12 and SUZ12P sequences. Further, the analysis of this large group of type-2 deletions revealed breakpoint recurrence within short segments (ranging in size from 57 to 253-bp) as well as the existence of a novel NAHR hotspot of 1.9-kb (termed PRS4). This hotspot harbored 20% (8/40) of the type-2 deletion breakpoints and contains the 253-bp recurrent breakpoint region BR6 in which four independent type-2 deletion breakpoints were identified. Our findings indicate that a combination of an open chromatin conformation and short non-B DNA-forming repeats may predispose to recurrent mitotic NAHR events between SUZ12 and its pseudogene.
Collapse
Affiliation(s)
- Julia Vogt
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Simmons AD, Carvalho CMB, Lupski JR. What have studies of genomic disorders taught us about our genome? Methods Mol Biol 2012; 838:1-27. [PMID: 22228005 DOI: 10.1007/978-1-61779-507-7_1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
The elucidation of genomic disorders began with molecular technologies that enabled detection of genomic changes which were (a) smaller than those resolved by traditional cytogenetics (less than 5 Mb) and (b) larger than what could be determined by conventional gel electrophoresis. Methods such as pulsed field gel electrophoresis (PFGE) and fluorescent in situ hybridization (FISH) could resolve such changes but were limited to locus-specific studies. The study of genomic disorders has rapidly advanced with the development of array-based techniques. These enabled examination of the entire human genome at a higher level of resolution, thus allowing elucidation of the basis of many new disorders, mechanisms that result in genomic changes that can result in copy number variation (CNV), and most importantly, a deeper understanding of the characteristics, features, and plasticity of our genome. In this chapter, we focus on the structural and architectural features of the genome, which can potentially result in genomic instability, delineate how mechanisms, such as NAHR, NHEJ, and FoSTeS/MMBIR lead to disease-causing rearrangements, and briefly describe the relationship between the leading methods presently used in studying genomic disorders. We end with a discussion on our new understanding about our genome including: the contribution of new mutation CNV to disease, the abundance of mosaicism, the extent of subtelomeric rearrangements, the frequency of de novo rearrangements associated with sporadic birth defects, the occurrence of balanced and unbalanced translocations, the increasing discovery of insertional translocations, the exploration of complex rearrangements and exonic CNVs. In the postgenomic era, our understanding of the genome has advanced very rapidly as the level of technical resolution has become higher. This leads to a greater understanding of the effects of rearrangements present both in healthy subjects and individuals with clinically relevant phenotypes.
Collapse
|
30
|
Charcot–Marie–Tooth diseases. Neurogenetics 2012. [DOI: 10.1017/cbo9781139087711.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
31
|
Zickler AM, Hampp S, Messiaen L, Bengesser K, Mussotter T, Roehl AC, Wimmer K, Mautner VF, Kluwe L, Upadhyaya M, Pasmant E, Chuzhanova N, Kestler HA, Högel J, Legius E, Claes K, Cooper DN, Kehrer-Sawatzki H. Characterization of the nonallelic homologous recombination hotspot PRS3 associated with type-3 NF1 deletions. Hum Mutat 2011; 33:372-83. [PMID: 22045503 DOI: 10.1002/humu.21644] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Accepted: 10/06/2011] [Indexed: 12/21/2022]
Abstract
Nonallelic homologous recombination (NAHR) is the major mechanism underlying recurrent genomic rearrangements, including the large deletions at 17q11.2 that cause neurofibromatosis type 1 (NF1). Here, we identify a novel NAHR hotspot, responsible for type-3 NF1 deletions that span 1.0 Mb. Breakpoint clustering within this 1-kb hotspot, termed PRS3, was noted in 10 of 11 known type-3 NF1 deletions. PRS3 is located within the LRRC37B pseudogene of the NF1-REPb and NF1-REPc low-copy repeats. In contrast to other previously characterized NAHR hotspots, PRS3 has not developed on a preexisting allelic homologous recombination hotspot. Furthermore, the variation pattern of PRS3 and its flanking regions is unusual since only NF1-REPc (and not NF1-REPb) is characterized by a high single nucleotide polymorphism (SNP) frequency, suggestive of unidirectional sequence transfer via nonallelic homologous gene conversion (NAHGC). By contrast, the previously described intense NAHR hotspots within the CMT1A-REPs, and the PRS1 and PRS2 hotspots underlying type-1 NF1 deletions, experience frequent bidirectional sequence transfer. PRS3 within NF1-REPc was also found to be involved in NAHGC with the LRRC37B gene, the progenitor locus of the LRRC37B-P duplicons, as indicated by the presence of shared SNPs between these loci. PRS3 therefore represents a weak (and probably evolutionarily rather young) NAHR hotspot with unique properties.
Collapse
Affiliation(s)
- Antje M Zickler
- Institute of Human Genetics, University of Ulm, Albert-Einstein-Allee 11, Ulm, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Liu P, Lacaria M, Zhang F, Withers M, Hastings P, Lupski J. Frequency of nonallelic homologous recombination is correlated with length of homology: evidence that ectopic synapsis precedes ectopic crossing-over. Am J Hum Genet 2011; 89:580-8. [PMID: 21981782 DOI: 10.1016/j.ajhg.2011.09.009] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Revised: 09/14/2011] [Accepted: 09/15/2011] [Indexed: 10/16/2022] Open
Abstract
Genomic disorders constitute a class of diseases that are associated with DNA rearrangements resulting from region-specific genome instability, that is, genome architecture incites genome instability. Nonallelic homologous recombination (NAHR) or crossing-over in meiosis between sequences that are not in allelic positions (i.e., paralogous sequences) can result in recurrent deletions or duplications causing genomic disorders. Previous studies of NAHR have focused on description of the phenomenon, but it remains unclear how NAHR occurs during meiosis and what factors determine its frequency. Here we assembled two patient cohorts with reciprocal genomic disorders; deletion associated Smith-Magenis syndrome and duplication associated Potocki-Lupski syndrome. By assessing the full spectrum of rearrangement types from the two cohorts, we find that complex rearrangements (those with more than one breakpoint) are more prevalent in copy-number gains (17.7%) than in copy-number losses (2.3%); an observation that supports a role for replicative mechanisms in complex rearrangement formation. Interestingly, for NAHR-mediated recurrent rearrangements, we show that crossover frequency is positively associated with the flanking low-copy repeat (LCR) length and inversely influenced by the inter-LCR distance. To explain this, we propose that the probability of ectopic chromosome synapsis increases with increased LCR length, and that ectopic synapsis is a necessary precursor to ectopic crossing-over.
Collapse
|
33
|
Koumbaris G, Hatzisevastou-Loukidou H, Alexandrou A, Ioannides M, Christodoulou C, Fitzgerald T, Rajan D, Clayton S, Kitsiou-Tzeli S, Vermeesch JR, Skordis N, Antoniou P, Kurg A, Georgiou I, Carter NP, Patsalis PC. FoSTeS, MMBIR and NAHR at the human proximal Xp region and the mechanisms of human Xq isochromosome formation. Hum Mol Genet 2011; 20:1925-36. [PMID: 21349920 PMCID: PMC3428953 DOI: 10.1093/hmg/ddr074] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The recently described DNA replication-based mechanisms of fork stalling and template switching (FoSTeS) and microhomology-mediated break-induced replication (MMBIR) were previously shown to catalyze complex exonic, genic and genomic rearrangements. By analyzing a large number of isochromosomes of the long arm of chromosome X (i(Xq)), using whole-genome tiling path array comparative genomic hybridization (aCGH), ultra-high resolution targeted aCGH and sequencing, we provide evidence that the FoSTeS and MMBIR mechanisms can generate large-scale gross chromosomal rearrangements leading to the deletion and duplication of entire chromosome arms, thus suggesting an important role for DNA replication-based mechanisms in both the development of genomic disorders and cancer. Furthermore, we elucidate the mechanisms of dicentric i(Xq) (idic(Xq)) formation and show that most idic(Xq) chromosomes result from non-allelic homologous recombination between palindromic low copy repeats and highly homologous palindromic LINE elements. We also show that non-recurrent-breakpoint idic(Xq) chromosomes have microhomology-associated breakpoint junctions and are likely catalyzed by microhomology-mediated replication-dependent recombination mechanisms such as FoSTeS and MMBIR. Finally, we stress the role of the proximal Xp region as a chromosomal rearrangement hotspot.
Collapse
Affiliation(s)
- George Koumbaris
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | | | - Angelos Alexandrou
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Marios Ioannides
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Christodoulos Christodoulou
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Tomas Fitzgerald
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Diana Rajan
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Stephen Clayton
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Sophia Kitsiou-Tzeli
- Department of Medical Genetics, University of Athens, St Sophia Children’s Hospital, Athens 11527, Greece
| | - Joris R. Vermeesch
- Centre for Human Genetics, University Hospital, Catholic University of Leuven, 3000 Leuven, Belgium
| | - Nicos Skordis
- Pediatric Endocrine Unit, Makarios III Hospital, Nicosia 1474, Cyprus
| | - Pavlos Antoniou
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Ants Kurg
- Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia
| | | | - Nigel P. Carter
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Philippos C. Patsalis
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| |
Collapse
|
34
|
Lawson ARJ, Hindley GFL, Forshew T, Tatevossian RG, Jamie GA, Kelly GP, Neale GA, Ma J, Jones TA, Ellison DW, Sheer D. RAF gene fusion breakpoints in pediatric brain tumors are characterized by significant enrichment of sequence microhomology. Genome Res 2011; 21:505-14. [PMID: 21393386 DOI: 10.1101/gr.115782.110] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Gene fusions involving members of the RAF family of protein kinases have recently been identified as characteristic aberrations of low-grade astrocytomas, the most common tumors of the central nervous system in children. While it has been shown that these fusions cause constitutive activation of the ERK/MAPK pathway, very little is known about their formation. Here, we present a detailed analysis of RAF gene fusion breakpoints from a well-characterized cohort of 43 low-grade astrocytomas. Our findings show that the rearrangements that generate these RAF gene fusions may be simple or complex and that both inserted nucleotides and microhomology are common at the DNA breakpoints. Furthermore, we identify novel enrichment of microhomologous sequences in the regions immediately flanking the breakpoints. We thus provide evidence that the tandem duplications responsible for these fusions are generated by microhomology-mediated break-induced replication (MMBIR). Although MMBIR has previously been implicated in the pathogenesis of other diseases and the evolution of eukaryotic genomes, we demonstrate here that the proposed details of MMBIR are consistent with a recurrent rearrangement in cancer. Our analysis of repetitive elements, Z-DNA and sequence motifs in the fusion partners identified significant enrichment of the human minisatellite conserved sequence/χ-like element at one side of the breakpoint. Therefore, in addition to furthering our understanding of low-grade astrocytomas, this study provides insights into the molecular mechanistic details of MMBIR and the sequence of events that occur in the formation of genomic rearrangements.
Collapse
Affiliation(s)
- Andrew R J Lawson
- Queen Mary University of London, Centre for Neuroscience and Trauma, Blizard Institute of Cell and Molecular Science, Barts and the London School of Medicine and Dentistry, London, United Kingdom
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Koike A, Nishida N, Yamashita D, Tokunaga K. Comparative analysis of copy number variation detection methods and database construction. BMC Genet 2011; 12:29. [PMID: 21385384 PMCID: PMC3058066 DOI: 10.1186/1471-2156-12-29] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2010] [Accepted: 03/07/2011] [Indexed: 12/13/2022] Open
Abstract
Background Array-based detection of copy number variations (CNVs) is widely used for identifying disease-specific genetic variations. However, the accuracy of CNV detection is not sufficient and results differ depending on the detection programs used and their parameters. In this study, we evaluated five widely used CNV detection programs, Birdsuite (mainly consisting of the Birdseye and Canary modules), Birdseye (part of Birdsuite), PennCNV, CGHseg, and DNAcopy from the viewpoint of performance on the Affymetrix platform using HapMap data and other experimental data. Furthermore, we identified CNVs of 180 healthy Japanese individuals using parameters that showed the best performance in the HapMap data and investigated their characteristics. Results The results indicate that Hidden Markov model-based programs PennCNV and Birdseye (part of Birdsuite), or Birdsuite show better detection performance than other programs when the high reproducibility rates of the same individuals and the low Mendelian inconsistencies are considered. Furthermore, when rates of overlap with other experimental results were taken into account, Birdsuite showed the best performance from the view point of sensitivity but was expected to include many false negatives and some false positives. The results of 180 healthy Japanese demonstrate that the ratio containing repeat sequences, not only segmental repeats but also long interspersed nuclear element (LINE) sequences both in the start and end regions of the CNVs, is higher in CNVs that are commonly detected among multiple individuals than that in randomly selected regions, and the conservation score based on primates is lower in these regions than in randomly selected regions. Similar tendencies were observed in HapMap data and other experimental data. Conclusions Our results suggest that not only segmental repeats but also interspersed repeats, especially LINE sequences, are deeply involved in CNVs, particularly in common CNV formations. The detected CNVs are stored in the CNV repository database newly constructed by the "Japanese integrated database project" for sharing data among researchers. http://gwas.lifesciencedb.jp/cgi-bin/cnvdb/cnv_top.cgi
Collapse
Affiliation(s)
- Asako Koike
- Central Research Laboratory, Hitachi Ltd., Tokyo, Japan.
| | | | | | | |
Collapse
|
36
|
Neill NJ, Ballif BC, Lamb AN, Parikh S, Ravnan JB, Schultz RA, Torchia BS, Rosenfeld JA, Shaffer LG. Recurrence, submicroscopic complexity, and potential clinical relevance of copy gains detected by array CGH that are shown to be unbalanced insertions by FISH. Genome Res 2011; 21:535-44. [PMID: 21383316 DOI: 10.1101/gr.114579.110] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Insertions occur when a segment of one chromosome is translocated and inserted into a new region of the same chromosome or a non-homologous chromosome. We report 71 cases with unbalanced insertions identified using array CGH and FISH in 4909 cases referred to our laboratory for array CGH and found to have copy-number abnormalities. Although the majority of insertions were non-recurrent, several recurrent unbalanced insertions were detected, including three der(Y)ins(Y;18)(q?11.2;p11.32p11.32)pat inherited from parents carrying an unbalanced insertion. The clinical significance of these recurrent rearrangements is unclear, although the small size, limited gene content, and inheritance pattern of each suggests that the phenotypic consequences may be benign. Cryptic, submicroscopic duplications were observed at or near the insertion sites in two patients, further confounding the clinical interpretation of these insertions. Using FISH, linear amplification, and array CGH, we identified a 126-kb duplicated region from 19p13.3 inserted into MECP2 at Xq28 in a patient with symptoms of Rett syndrome. Our results demonstrate that although the interpretation of most non-recurrent insertions is unclear without high-resolution insertion site characterization, the potential for an otherwise benign duplication to result in a clinically relevant outcome through the disruption of a gene necessitates the use of FISH to determine whether copy-number gains detected by array CGH represent tandem duplications or unbalanced insertions. Further follow-up testing using techniques such as linear amplification or sequencing should be used to determine gene involvement at the insertion site after FISH has identified the presence of an insertion.
Collapse
|
37
|
Roehl AC, Vogt J, Mussotter T, Zickler AN, Spöti H, Högel J, Chuzhanova NA, Wimmer K, Kluwe L, Mautner VF, Cooper DN, Kehrer-Sawatzki H. Intrachromosomal mitotic nonallelic homologous recombination is the major molecular mechanism underlying type-2 NF1 deletions. Hum Mutat 2011; 31:1163-73. [PMID: 20725927 DOI: 10.1002/humu.21340] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Nonallelic homologous recombination (NAHR) is responsible for the recurrent rearrangements that give rise to genomic disorders. Although meiotic NAHR has been investigated in multiple contexts, much less is known about mitotic NAHR despite its importance for tumorigenesis. Because type-2 NF1 microdeletions frequently result from mitotic NAHR, they represent a good model in which to investigate the features of mitotic NAHR. We have used microsatellite analysis and SNP arrays to distinguish between the various alternative recombinational possibilities, thereby ascertaining that 17 of 18 type-2 NF1 deletions, with breakpoints in the SUZ12 gene and its highly homologous pseudogene, originated via intrachromosomal recombination. This high proportion of intrachromosomal NAHR causing somatic type-2 NF1 deletions contrasts with the interchromosomal origin of germline type-1 NF1 microdeletions, whose breakpoints are located within the NF1-REPs (low-copy repeats located adjacent to the SUZ12 sequences). Further, meiotic NAHR causing type-1 NF1 deletions occurs within recombination hotspots characterized by high GC-content and DNA duplex stability, whereas the type-2 breakpoints associated with the mitotic NAHR events investigated here do not cluster within hotspots and are located within regions of significantly lower GC-content and DNA stability. Our findings therefore point to fundamental mechanistic differences between the determinants of mitotic and meiotic NAHR.
Collapse
|
38
|
Hou A, Lin SP, Ho SY, Chen CFJ, Lin HY, Chen YJ, Huang CY, Chiu HC, Chuang CK, Chen KS. Genetic studies of Prader-Willi patients provide evidence for conservation of genomic architecture in proximal chromosome 15q. Ann Hum Genet 2011; 75:211-21. [PMID: 21198515 DOI: 10.1111/j.1469-1809.2010.00633.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Prader-Willi syndrome (PWS) is a neurogenetic disorder associated with recurrent genomic recombination involving low copy repeats (LCRs) located in the human chromosome 15q11-q13. Previous studies of PWS patients from Asia suggested that there is a higher incidence of deletion and lower incidence of maternal uniparental disomy (mUPD) compared to that of Western populations. In this report, we present genetic etiology of 28 PWS patients from Taiwan. Consistent with the genetic etiology findings from Western populations, the type II deletion appears to be the most common deletion subtype. Furthermore, the ratio of the two most common deletion subtypes and the ratio of the maternal heterodisomy to isodisomy cases observed from this study are in agreement with previous findings from Western populations. In addition, we identified and further mapped the deletion breakpoints in two patients with atypical deletions using array CGH (comparative genomic hybridization). Despite the relatively small numbers of patients in each subgroup, our findings suggest that the genomic architecture responsible for the recurrent recombination in PWS is conserved in Taiwanese of the Han Chinese heritage and Western populations, thereby predisposing chromosome 15q11-q13 to a similar risk of rearrangements.
Collapse
Affiliation(s)
- Aihua Hou
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Ou Z, Stankiewicz P, Xia Z, Breman AM, Dawson B, Wiszniewska J, Szafranski P, Cooper ML, Rao M, Shao L, South ST, Coleman K, Fernhoff PM, Deray MJ, Rosengren S, Roeder ER, Enciso VB, Chinault AC, Patel A, Kang SHL, Shaw CA, Lupski JR, Cheung SW. Observation and prediction of recurrent human translocations mediated by NAHR between nonhomologous chromosomes. Genome Res 2011; 21:33-46. [PMID: 21205869 PMCID: PMC3012924 DOI: 10.1101/gr.111609.110] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Accepted: 10/06/2010] [Indexed: 11/24/2022]
Abstract
Four unrelated families with the same unbalanced translocation der(4)t(4;11)(p16.2;p15.4) were analyzed. Both of the breakpoint regions in 4p16.2 and 11p15.4 were narrowed to large ∼359-kb and ∼215-kb low-copy repeat (LCR) clusters, respectively, by aCGH and SNP array analyses. DNA sequencing enabled mapping the breakpoints of one translocation to 24 bp within interchromosomal paralogous LCRs of ∼130 kb in length and 94.7% DNA sequence identity located in olfactory receptor gene clusters, indicating nonallelic homologous recombination (NAHR) as the mechanism for translocation formation. To investigate the potential involvement of interchromosomal LCRs in recurrent chromosomal translocation formation, we performed computational genome-wide analyses and identified 1143 interchromosomal LCR substrate pairs, >5 kb in size and sharing >94% sequence identity that can potentially mediate chromosomal translocations. Additional evidence for interchromosomal NAHR mediated translocation formation was provided by sequencing the breakpoints of another recurrent translocation, der(8)t(8;12)(p23.1;p13.31). The NAHR sites were mapped within 55 bp in ∼7.8-kb paralogous subunits of 95.3% sequence identity located in the ∼579-kb (chr 8) and ∼287-kb (chr 12) LCR clusters. We demonstrate that NAHR mediates recurrent constitutional translocations t(4;11) and t(8;12) and potentially many other interchromosomal translocations throughout the human genome. Furthermore, we provide a computationally determined genome-wide "recurrent translocation map."
Collapse
Affiliation(s)
- Zhishuo Ou
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Paweł Stankiewicz
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Zhilian Xia
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Amy M. Breman
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Brian Dawson
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Joanna Wiszniewska
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Przemyslaw Szafranski
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - M. Lance Cooper
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Mitchell Rao
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Lina Shao
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Sarah T. South
- Departments of Pediatrics and Pathology, University of Utah, Salt Lake City, Utah 84112, USA
| | - Karlene Coleman
- Children's Healthcare of Atlanta, Atlanta, Georgia 30033, USA
| | | | - Marcel J. Deray
- Department of Neurology, Miami Children's Hospital, Miami, Florida 33155, USA
| | | | | | | | - A. Craig Chinault
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Ankita Patel
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Sung-Hae L. Kang
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Chad A. Shaw
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - James R. Lupski
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas 77030, USA
- Texas Children's Hospital, Houston, Texas 77030, USA
| | - Sau W. Cheung
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
40
|
Chen JM, Férec C, Cooper DN. Gene conversion in human genetic disease. Genes (Basel) 2010; 1:550-63. [PMID: 24710102 PMCID: PMC3966225 DOI: 10.3390/genes1030550] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2010] [Revised: 11/12/2010] [Accepted: 11/17/2010] [Indexed: 01/01/2023] Open
Abstract
Gene conversion is a specific type of homologous recombination that involves the unidirectional transfer of genetic material from a ‘donor’ sequence to a highly homologous ‘acceptor’. We have recently reviewed the molecular mechanisms underlying gene conversion, explored the key part that this process has played in fashioning extant human genes, and performed a meta-analysis of gene-conversion events known to have caused human genetic disease. Here we shall briefly summarize some of the latest developments in the study of pathogenic gene conversion events, including (i) the emerging idea of minimal efficient sequence homology (MESH) for homologous recombination, (ii) the local DNA sequence features that appear to predispose to gene conversion, (iii) a mechanistic comparison of gene conversion and transient hypermutability, and (iv) recently reported examples of pathogenic gene conversion events.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | - Claude Férec
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, UK.
| |
Collapse
|
41
|
Zhang F, Seeman P, Liu P, Weterman MA, Gonzaga-Jauregui C, Towne CF, Batish SD, De Vriendt E, De Jonghe P, Rautenstrauss B, Krause KH, Khajavi M, Posadka J, Vandenberghe A, Palau F, Van Maldergem L, Baas F, Timmerman V, Lupski JR. Mechanisms for nonrecurrent genomic rearrangements associated with CMT1A or HNPP: rare CNVs as a cause for missing heritability. Am J Hum Genet 2010; 86:892-903. [PMID: 20493460 DOI: 10.1016/j.ajhg.2010.05.001] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2010] [Revised: 04/28/2010] [Accepted: 05/03/2010] [Indexed: 12/20/2022] Open
Abstract
Genomic rearrangements involving the peripheral myelin protein gene (PMP22) in human chromosome 17p12 are associated with neuropathy: duplications cause Charcot-Marie-Tooth disease type 1A (CMT1A), whereas deletions lead to hereditary neuropathy with liability to pressure palsies (HNPP). Our previous studies showed that >99% of these rearrangements are recurrent and mediated by nonallelic homologous recombination (NAHR). Rare copy number variations (CNVs) generated by nonrecurrent rearrangements also exist in 17p12, but their underlying mechanisms are not well understood. We investigated 21 subjects with rare CNVs associated with CMT1A or HNPP by oligonucleotide-based comparative genomic hybridization microarrays and breakpoint sequence analyses, and we identified 17 unique CNVs, including two genomic deletions, ten genomic duplications, two complex rearrangements, and three small exonic deletions. Each of these CNVs includes either the entire PMP22 gene, or exon(s) only, or ultraconserved potential regulatory sequences upstream of PMP22, further supporting the contention that PMP22 is the critical gene mediating the neuropathy phenotypes associated with 17p12 rearrangements. Breakpoint sequence analysis reveals that, different from the predominant NAHR mechanism in recurrent rearrangement, various molecular mechanisms, including nonhomologous end joining, Alu-Alu-mediated recombination, and replication-based mechanisms (e.g., FoSTeS and/or MMBIR), can generate nonrecurrent 17p12 rearrangements associated with neuropathy. We document a multitude of ways in which gene function can be altered by CNVs. Given the characteristics, including small size, structural complexity, and location outside of coding regions, of selected rare CNVs, their identification remains a challenge for genome analysis. Rare CNVs may potentially represent an important portion of "missing heritability" for human diseases.
Collapse
|
42
|
Aleshin A, Zhi D. Recombination-associated sequence homogenization of neighboring Alu elements: signature of nonallelic gene conversion. Mol Biol Evol 2010; 27:2300-11. [PMID: 20453015 DOI: 10.1093/molbev/msq116] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Recently, researchers have begun to recognize that, in order to establish neutral models for disease association and evolutionary genomics studies, it is crucial to have a clear understanding of the genomic impact of nonallelic gene conversion. Drawing on previous successes in characterizing this phenomenon over protein-coding gene families, we undertook a computational analysis of neighboring Alu sequences in the genome scale. For this purpose, we developed adjusted comutation rate (aCMR), a novel statistical method measuring the excess number of identical point mutations shared by adjacent Alu sequences, vis-à-vis random pairs. Using aCMR, we uncovered a remarkable genome-wide sequence homogenization of neighboring Alus, with the strongest signal observed in the pseudoautosomal regions of the X and Y chromosomes. The magnitude of sequence homogenization between Alu pairs is greater with shorter interlocus distance, higher sequence identity, and parallel orientation. Moreover, shared substitutions show a strong directionality toward GC nucleotides, with multiple substitutions tending to cluster within the Alu sequence. Taken together, these observed recombination-associated sequence homogenization patterns are best explained by frequent ubiquitous gene conversion events between neighboring Alus. We believe that these observations help to illuminate the nature and impact of the enigmatic phenomenon of gene conversion.
Collapse
Affiliation(s)
- Alexey Aleshin
- Department of Medicine, Division of Hematology, Oncology, David Geffen School of Medicine, University of California, Los Angeles, USA
| | | |
Collapse
|
43
|
Genomic and clinical characteristics of microduplications in chromosome 17. Am J Med Genet A 2010; 152A:1101-10. [DOI: 10.1002/ajmg.a.33248] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
44
|
Dynamics and processes of copy number instability in human gamma-globin genes. Proc Natl Acad Sci U S A 2010; 107:8304-9. [PMID: 20404158 DOI: 10.1073/pnas.1003634107] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Copy number variation in the human genome is prevalent but relatively little is known about the dynamics of DNA rearrangement. We therefore used the duplicated gamma-globin genes as a simple system to explore de novo copy number changes. Rearrangements that changed gene number were seen in both germline and somatic DNA, and mainly arose by unequal sister chromatid exchange between homologous sequences, with evidence from recurrent mosaic rearrangements that many, if not all, of these events in sperm arise before meiosis. Unequal exchange frequencies are apparently controlled primarily by the degree of sequence identity shared by the duplicate genes, leading to substantial variation between haplotypes in copy number instability. Additional, more complex rearrangements generated by mechanisms not involving homologous recombination, and in some cases showing DNA transfer between chromosomes, were also detected but were rare. Sequence changes were also seen in gamma-globin DNA molecules, with strong evidence that some were genuine de novo base substitutions. They were present in sperm at a frequency far higher than predicted from current estimates of germline mutation rates, raising interesting questions about base mutation dynamics in the male germline.
Collapse
|
45
|
Roehl AC, Cooper DN, Kluwe L, Helbrich A, Wimmer K, Högel J, Mautner VF, Kehrer-Sawatzki H. Extended runs of homozygosity at 17q11.2: an association with type-2NF1deletions? Hum Mutat 2010; 31:325-34. [DOI: 10.1002/humu.21191] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
46
|
Genome destabilization by homologous recombination in the germ line. Nat Rev Mol Cell Biol 2010; 11:182-95. [PMID: 20164840 DOI: 10.1038/nrm2849] [Citation(s) in RCA: 159] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Meiotic recombination, which promotes proper homologous chromosome segregation at the first meiotic division, normally occurs between allelic sequences on homologues. However, recombination can also take place between non-allelic DNA segments that share high sequence identity. Such non-allelic homologous recombination (NAHR) can markedly alter genome architecture during gametogenesis by generating chromosomal rearrangements. Indeed, NAHR-mediated deletions, duplications, inversions and other alterations have been implicated in numerous human genetic disorders. Studies in yeast have provided insights into the molecular mechanisms of meiotic NAHR as well as the cellular strategies that limit it.
Collapse
|
47
|
Zepeda-Mendoza CJ, Lemus T, Yáñez O, García D, Valle-García D, Meza-Sosa KF, Gutiérrez-Arcelus M, Márquez-Ortiz Y, Domínguez-Vidaña R, Gonzaga-Jauregui C, Flores M, Palacios R. Identical repeated backbone of the human genome. BMC Genomics 2010; 11:60. [PMID: 20096123 PMCID: PMC2845111 DOI: 10.1186/1471-2164-11-60] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2009] [Accepted: 01/23/2010] [Indexed: 11/10/2022] Open
Abstract
Background Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome. Results The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to ~2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes. Conclusions In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.
Collapse
Affiliation(s)
- Cinthya J Zepeda-Mendoza
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, 62210, México.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Kijima TE, Innan H. On the estimation of the insertion time of LTR retrotransposable elements. Mol Biol Evol 2009; 27:896-904. [PMID: 19955475 DOI: 10.1093/molbev/msp295] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
It has been proposed that the insertion time of a long terminal repeat (LTR) retrotransposon can be estimated by the divergence between the two LTRs at the both ends because their sequences were identical at the insertion event. This method is based on the assumption that the two LTRs accumulate point mutations independently; therefore, the divergence reflects the time since the insertion event. However, if gene conversion occurs between LTRs, the nucleotide divergence will be much smaller than expected with the assumption of the independent accumulation of point mutations. To examine this assumption, we investigated the extent of gene conversion between LTRs by applying a comparative genomic approach to primates (humans and rhesus macaques) and rodents (mice and rats). We found that gene conversion plays a significant role in the molecular evolution of LTRs in primates and rodents, but the extent is quite different. In rodents, most LTRs are subject to extensive gene conversion that reduces the divergence, so that the divergence-based method results in a serious underestimation of the insertion time. In primates, this effect is limited to a small proportion of LTRs. The most likely explanation of the difference involves the minimum length of the interacting sequence (minimal efficient processing segment [MEPS]) for interlocus gene conversion. An empirical estimate of MEPS in human is 300-500 bp, which exceeds the length of most of the analyzed LTRs. In contrast, MEPS for mice should be much smaller. Thus, MEPS can be an important factor to determine the susceptibility of LTRs to gene conversion, although there are many other factors involved. It is concluded that the divergence method to estimate the insertion time should be applied with special caution because at least some LTRs undergo gene conversion.
Collapse
Affiliation(s)
- T E Kijima
- Graduate University for Advanced Studies, Hayama, Kanagawa, Japan
| | | |
Collapse
|
49
|
Genotype to phenotype—discovery and characterization of novel genomic disorders in a “genotype-first” era. Genet Med 2009; 11:836-42. [DOI: 10.1097/gim.0b013e3181c175d2] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
|
50
|
Lajoie M, Bertrand D, El-Mabrouk N. Inferring the evolutionary history of gene clusters from phylogenetic and gene order data. Mol Biol Evol 2009; 27:761-72. [PMID: 19903657 DOI: 10.1093/molbev/msp271] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Gene duplication is frequent within gene clusters and plays a fundamental role in evolution by providing a source of new genetic material upon which natural selection can act. Although classical phylogenetic inference methods provide some insight into the evolutionary history of a gene cluster, they are not sufficient alone to differentiate single- from multiple gene duplication events and to answer other questions regarding the nature and size of evolutionary events. In this paper, we present an algorithm allowing to infer a set of optimal evolutionary histories for a gene cluster in a single species, according to a general cost model involving variable length duplications (in tandem or inverted), deletions, and inversions. We applied our algorithm to the human olfactory receptor and protocadherin gene clusters, showing that the duplication size distribution differs significantly between the two gene families. The algorithm is available through a web interface at http://www-lbit.iro.umontreal.ca/DILTAG/.
Collapse
Affiliation(s)
- Mathieu Lajoie
- Département d'informatique et de recherche opérationnelle Université de Montréal, Montréal, Canada.
| | | | | |
Collapse
|