1
|
Zhou B, Purmann C, Guo H, Shin G, Huang Y, Pattni R, Meng Q, Greer SU, Roychowdhury T, Wood RN, Ho M, zu Dohna H, Abyzov A, Hallmayer JF, Wong WH, Ji HP, Urban AE. Resolving the 22q11.2 deletion using CTLR-Seq reveals chromosomal rearrangement mechanisms and individual variance in breakpoints. Proc Natl Acad Sci U S A 2024; 121:e2322834121. [PMID: 39042694 PMCID: PMC11295037 DOI: 10.1073/pnas.2322834121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 06/15/2024] [Indexed: 07/25/2024] Open
Abstract
We developed a generally applicable method, CRISPR/Cas9-targeted long-read sequencing (CTLR-Seq), to resolve, haplotype-specifically, the large and complex regions in the human genome that had been previously impenetrable to sequencing analysis, such as large segmental duplications (SegDups) and their associated genome rearrangements. CTLR-Seq combines in vitro Cas9-mediated cutting of the genome and pulse-field gel electrophoresis to isolate intact large (i.e., up to 2,000 kb) genomic regions that encompass previously unresolvable genomic sequences. These targets are then sequenced (amplification-free) at high on-target coverage using long-read sequencing, allowing for their complete sequence assembly. We applied CTLR-Seq to the SegDup-mediated rearrangements that constitute the boundaries of, and give rise to, the 22q11.2 Deletion Syndrome (22q11DS), the most common human microdeletion disorder. We then performed de novo assembly to resolve, at base-pair resolution, the full sequence rearrangements and exact chromosomal breakpoints of 22q11.2DS (including all common subtypes). Across multiple patients, we found a high degree of variability for both the rearranged SegDup sequences and the exact chromosomal breakpoint locations, which coincide with various transposons within the 22q11.2 SegDups, suggesting that 22q11DS can be driven by transposon-mediated genome recombination. Guided by CTLR-Seq results from two 22q11DS patients, we performed three-dimensional chromosomal folding analysis for the 22q11.2 SegDups from patient-derived neurons and astrocytes and found chromosome interactions anchored within the SegDups to be both cell type-specific and patient-specific. Lastly, we demonstrated that CTLR-Seq enables cell-type specific analysis of DNA methylation patterns within the deletion haplotype of 22q11DS.
Collapse
Affiliation(s)
- Bo Zhou
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Stanford Maternal and Child Health Research Institute, Stanford University School of Medicine, Stanford, CA94305
| | - Carolin Purmann
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Stanford Maternal and Child Health Research Institute, Stanford University School of Medicine, Stanford, CA94305
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
| | - Hanmin Guo
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Stanford Maternal and Child Health Research Institute, Stanford University School of Medicine, Stanford, CA94305
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
- Department of Statistics, Stanford University, Stanford, CA94305
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA 94305
| | - GiWon Shin
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA94305
| | - Yiling Huang
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
| | - Reenal Pattni
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
| | - Qingxi Meng
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA94305
| | - Stephanie U. Greer
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA94305
| | - Tanmoy Roychowdhury
- Division of Computational Biology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN55905
| | - Raegan N. Wood
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA94305
| | - Marcus Ho
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
| | - Heinrich zu Dohna
- Department of Biology, American University of Beirut, Beirut1107 2020, Lebanon
| | - Alexej Abyzov
- Division of Computational Biology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN55905
| | - Joachim F. Hallmayer
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
| | - Wing H. Wong
- Department of Statistics, Stanford University, Stanford, CA94305
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA 94305
| | - Hanlee P. Ji
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA94305
| | - Alexander E. Urban
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA94305
- Stanford Maternal and Child Health Research Institute, Stanford University School of Medicine, Stanford, CA94305
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
- Program on Genetics of Brain Function, Stanford Center for Genomics and Personalized Medicine, Stanford University School of Medicine, Stanford, CA94305
| |
Collapse
|
2
|
Hyder M, Lodhi AM, Wang Z, Bukero A, Gao J, Mao R. Wolbachia Interactions with Diverse Insect Hosts: From Reproductive Modulations to Sustainable Pest Management Strategies. BIOLOGY 2024; 13:151. [PMID: 38534421 DOI: 10.3390/biology13030151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 02/14/2024] [Accepted: 02/16/2024] [Indexed: 03/28/2024]
Abstract
Effective in a variety of insect orders, including dipteran, lepidopteran, and hemipteran, Wolbachia-based control tactics are investigated, noting the importance of sterile and incompatible insect techniques. Encouraging approaches for controlling Aedes mosquitoes are necessary, as demonstrated by the evaluation of a new SIT/IIT combination and the incorporation of SIT into Drosophila suzukii management. For example, Wolbachia may protect plants from rice pests, demonstrating its potential for agricultural biological vector management. Maternal transmission and cytoplasmic incompatibility dynamics are explored, while Wolbachia phenotypic impacts on mosquito and rice pest management are examined. The importance of host evolutionary distance is emphasised in recent scale insect research that addresses host-shifting. Using greater information, a suggested method for comprehending Wolbachia host variations in various contexts emphasises ecological connectivity. Endosymbionts passed on maternally in nematodes and arthropods, Wolbachia are widely distributed around the world and have evolved both mutualistic and parasitic traits. Wolbachia is positioned as a paradigm for microbial symbiosis due to advancements in multiomics, gene functional assays, and its effect on human health. The challenges and opportunities facing Wolbachia research include scale issues, ecological implications, ethical conundrums, and the possibility of customising strains through genetic engineering. It is thought that cooperative efforts are required to include Wolbachia-based therapies into pest management techniques while ensuring responsible and sustainable ways.
Collapse
Affiliation(s)
- Moazam Hyder
- Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Guangdong Engineering Research Center for Mineral Oil Pesticides, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou 510260, China
| | - Abdul Mubeen Lodhi
- Department Plant Protection, Sindh Agriculture University, Tandojam 70080, Pakistan
| | - Zhaohong Wang
- Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Guangdong Engineering Research Center for Mineral Oil Pesticides, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou 510260, China
| | - Aslam Bukero
- Department of Entomology, Sindh Agriculture University, Tandojam 70080, Pakistan
| | - Jing Gao
- Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Guangdong Engineering Research Center for Mineral Oil Pesticides, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou 510260, China
| | - Runqian Mao
- Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Guangdong Engineering Research Center for Mineral Oil Pesticides, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou 510260, China
| |
Collapse
|
3
|
Kouprina N, Larionov V. Transformation-associated recombination (TAR) cloning and its applications for gene function; genome architecture and evolution; biotechnology and biomedicine. Oncotarget 2023; 14:1009-1033. [PMID: 38147065 PMCID: PMC10750837 DOI: 10.18632/oncotarget.28546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 11/27/2023] [Indexed: 12/27/2023] Open
Abstract
Transformation-associated recombination (TAR) cloning represents a unique tool to selectively and efficiently recover a given chromosomal segment up to several hundred kb in length from complex genomes (such as animals and plants) and simple genomes (such as bacteria and viruses). The technique exploits a high level of homologous recombination in the yeast Sacharomyces cerevisiae. In this review, we summarize multiple applications of the pioneering TAR cloning technique, developed previously for complex genomes, for functional, evolutionary, and structural studies, and extended the modified TAR versions to isolate biosynthetic gene clusters (BGCs) from microbes, which are the major source of pharmacological agents and industrial compounds, and to engineer synthetic viruses with novel properties to design a new generation of vaccines. TAR cloning was adapted as a reliable method for the assembly of synthetic microbe genomes for fundamental research. In this review, we also discuss how the TAR cloning in combination with HAC (human artificial chromosome)- and CRISPR-based technologies may contribute to the future.
Collapse
Affiliation(s)
- Natalay Kouprina
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Vladimir Larionov
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| |
Collapse
|
4
|
Hook PW, Timp W. Beyond assembly: the increasing flexibility of single-molecule sequencing technology. Nat Rev Genet 2023; 24:627-641. [PMID: 37161088 PMCID: PMC10169143 DOI: 10.1038/s41576-023-00600-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2023] [Indexed: 05/11/2023]
Abstract
The maturation of high-throughput short-read sequencing technology over the past two decades has shaped the way genomes are studied. Recently, single-molecule, long-read sequencing has emerged as an essential tool in deciphering genome structure and function, including filling gaps in the human reference genome, measuring the epigenome and characterizing splicing variants in the transcriptome. With recent technological developments, these single-molecule technologies have moved beyond genome assembly and are being used in a variety of ways, including to selectively sequence specific loci with long reads, measure chromatin state and protein-DNA binding in order to investigate the dynamics of gene regulation, and rapidly determine copy number variation. These increasingly flexible uses of single-molecule technologies highlight a young and fast-moving part of the field that is leading to a more accessible era of nucleic acid sequencing.
Collapse
Affiliation(s)
- Paul W Hook
- Department of Biomedical Engineering, Molecular Biology and Genetics, and Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Winston Timp
- Department of Biomedical Engineering, Molecular Biology and Genetics, and Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
5
|
Mastrorosa FK, Miller DE, Eichler EE. Applications of long-read sequencing to Mendelian genetics. Genome Med 2023; 15:42. [PMID: 37316925 DOI: 10.1186/s13073-023-01194-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 05/18/2023] [Indexed: 06/16/2023] Open
Abstract
Advances in clinical genetic testing, including the introduction of exome sequencing, have uncovered the molecular etiology for many rare and previously unsolved genetic disorders, yet more than half of individuals with a suspected genetic disorder remain unsolved after complete clinical evaluation. A precise genetic diagnosis may guide clinical treatment plans, allow families to make informed care decisions, and permit individuals to participate in N-of-1 trials; thus, there is high interest in developing new tools and techniques to increase the solve rate. Long-read sequencing (LRS) is a promising technology for both increasing the solve rate and decreasing the amount of time required to make a precise genetic diagnosis. Here, we summarize current LRS technologies, give examples of how they have been used to evaluate complex genetic variation and identify missing variants, and discuss future clinical applications of LRS. As costs continue to decrease, LRS will find additional utility in the clinical space fundamentally changing how pathological variants are discovered and eventually acting as a single-data source that can be interrogated multiple times for clinical service.
Collapse
Affiliation(s)
| | - Danny E Miller
- Division of Genetic Medicine, Department of Pediatrics, University of Washington and Seattle Children's Hospital, Seattle, WA, 98195, USA
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA, 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
6
|
Tamura T, Yamamoto Shimojima K, Okamoto N, Yagasaki H, Morioka I, Kanno H, Minakuchi Y, Toyoda A, Yamamoto T. Long-read sequence analysis for clustered genomic copy number aberrations revealed architectures of intricately intertwined rearrangements. Am J Med Genet A 2023; 191:112-119. [PMID: 36282026 DOI: 10.1002/ajmg.a.62997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Revised: 07/13/2022] [Accepted: 08/24/2022] [Indexed: 12/14/2022]
Abstract
Most chromosomal aberrations revealed by chromosomal microarray testing (CMA) are simple; however, very complex chromosomal structural rearrangements can also be found. Although the mechanism of structural rearrangements has been gradually revealed, not all mechanisms have been elucidated. We analyzed the breakpoint-junctions (BJs) of two or more clustered copy number variations (CNVs) in the same chromosome arms to understand their conformation and the mechanism of complex structural rearrangements. Combining CMA with long-read whole-genome sequencing (WGS) analysis, we successfully determined all BJs for the clustered CNVs identified in four patients. Multiple CNVs were intricately intertwined with each other, and clustered CNVs in four patients were involved in global complex chromosomal rearrangements. The BJs of two clustered deletions identified in two patients showed microhomologies, and their characteristics were explained by chromothripsis. In contrast, the BJs in the other two patients, who showed clustered deletions and duplications, consisted of blunt-end and nontemplated insertions. These findings could be explained only by alternative nonhomologous end-joining, a mechanism related to polymerase theta. All the patients had at least one inverted segment. Three patients showed cryptic aberrations involving a disruption and a deletion/duplication, which were not detected by CMA but were first identified by WGS. This result suggested that complex rearrangements should be considered if clustered CNVs are observed in the same chromosome arms. Because CMA has potential limitations in genotype-phenotype correlation analysis, a more detailed analysis by whole genome examination is recommended in cases of suspected complex structural aberrations.
Collapse
Affiliation(s)
- Takeaki Tamura
- Department of Pediatrics and Child Health, Nihon University School of Medicine, Tokyo, Japan.,Division of Gene Medicine, Graduate School of Medical Science, Tokyo Women's Medical University, Tokyo, Japan.,Department of Transfusion Medicine and Cell Processing, Tokyo Women's Medical University, Tokyo, Japan
| | - Keiko Yamamoto Shimojima
- Department of Transfusion Medicine and Cell Processing, Tokyo Women's Medical University, Tokyo, Japan.,Institute of Medical Genetics, Tokyo Women's Medical University, Tokyo, Japan
| | - Nobuhiko Okamoto
- Department of Medical Genetics, Osaka Women's and Children's Hospital, Izumi, Japan
| | - Hiroshi Yagasaki
- Department of Pediatrics and Child Health, Nihon University School of Medicine, Tokyo, Japan
| | - Ichiro Morioka
- Department of Pediatrics and Child Health, Nihon University School of Medicine, Tokyo, Japan
| | - Hitoshi Kanno
- Department of Transfusion Medicine and Cell Processing, Tokyo Women's Medical University, Tokyo, Japan.,Institute of Medical Genetics, Tokyo Women's Medical University, Tokyo, Japan
| | - Yohei Minakuchi
- Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Shizuoka, Japan
| | - Atsushi Toyoda
- Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Shizuoka, Japan
| | - Toshiyuki Yamamoto
- Division of Gene Medicine, Graduate School of Medical Science, Tokyo Women's Medical University, Tokyo, Japan.,Institute of Medical Genetics, Tokyo Women's Medical University, Tokyo, Japan
| |
Collapse
|
7
|
Skowronek D, Pilz RA, Bonde L, Schamuhn OJ, Feldmann JL, Hoffjan S, Much CD, Felbor U, Rath M. Cas9-Mediated Nanopore Sequencing Enables Precise Characterization of Structural Variants in CCM Genes. Int J Mol Sci 2022; 23:ijms232415639. [PMID: 36555281 PMCID: PMC9779250 DOI: 10.3390/ijms232415639] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 12/05/2022] [Accepted: 12/08/2022] [Indexed: 12/14/2022] Open
Abstract
Deletions in the CCM1, CCM2, and CCM3 genes are a common cause of familial cerebral cavernous malformations (CCMs). In current molecular genetic laboratories, targeted next-generation sequencing or multiplex ligation-dependent probe amplification are mostly used to identify copy number variants (CNVs). However, both techniques are limited in their ability to specify the breakpoints of CNVs and identify complex structural variants (SVs). To overcome these constraints, we established a targeted Cas9-mediated nanopore sequencing approach for CNV detection with single nucleotide resolution. Using a MinION device, we achieved complete coverage for the CCM genes and determined the exact size of CNVs in positive controls. Long-read sequencing for a CCM1 and CCM2 CNV revealed that the adjacent ANKIB1 and NACAD genes were also partially or completely deleted. In addition, an interchromosomal insertion and an inversion in CCM2 were reliably re-identified by long-read sequencing. The refinement of CNV breakpoints by long-read sequencing enabled fast and inexpensive PCR-based variant confirmation, which is highly desirable to reduce costs in subsequent family analyses. In conclusion, Cas9-mediated nanopore sequencing is a cost-effective and flexible tool for molecular genetic diagnostics which can be easily adapted to various target regions.
Collapse
Affiliation(s)
- Dariush Skowronek
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Robin A. Pilz
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Loisa Bonde
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Ole J. Schamuhn
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Janne L. Feldmann
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Sabine Hoffjan
- Department of Human Genetics, Ruhr-University, 44801 Bochum, Germany
| | - Christiane D. Much
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Ute Felbor
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
| | - Matthias Rath
- Department of Human Genetics, University Medicine Greifswald and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, 17475 Greifswald, Germany
- Department of Human Medicine and Institute for Molecular Medicine, MSH Medical School Hamburg, 20457 Hamburg, Germany
- Correspondence:
| |
Collapse
|
8
|
Tandem repeats structure of gel-forming mucin domains could be revealed by SMRT sequencing data. Sci Rep 2022; 12:20652. [PMID: 36450890 PMCID: PMC9712336 DOI: 10.1038/s41598-022-25262-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 11/28/2022] [Indexed: 12/05/2022] Open
Abstract
Mucins are large glycoproteins that cover and protect epithelial surface of the body. Mucin domains of gel-forming mucins are rich in proline, threonine, and serine that are heavily glycosylated. These domains show great complexity with tandem repeats, thus make it difficult to study the sequences. With the coming of single molecule real-time (SMRT) sequencing technologies, we manage to present sequence structure of mucin domains via SMRT long reads for gel-forming mucins MUC2, MUC5AC, MUC5B and MUC6. Our study shows that for different individuals, single nucleotide polymorphisms could be found in mucin domains of MUC2, MUC5AC, MUC5B and MUC6, while different number of tandem repeats could be found in mucin domains of MUC2 and MUC6. Furthermore, we get the sequence of MUC2, MUC5AC, and MUC5B mucin domain in a Chinese individual for each nucleotide at accuracy of possibly 99.98-99.99%, 99.93-99.99%, and 99.76-99.99%, respectively. We report a new method to obtain DNA sequence of gel-forming mucin domains. This method will provided new insights on getting the sequence for Tandem Repeat parts which locate in coding region. With the sequences we obtained through this method, we can give more information for people to study the sequences of gel-forming mucin domains.
Collapse
|
9
|
Lei YQ, Xu LP, Cao H, Wang XR. A method of large DNA fragment enrichment for nanopore sequencing in region 22q11.2. Front Genet 2022; 13:959883. [DOI: 10.3389/fgene.2022.959883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 10/11/2022] [Indexed: 11/13/2022] Open
Abstract
Background: 22q11.2 deletion syndrome (22q11.2DS) is a disorder caused when a small part of chromosome 22 is missing. Diagnosis is currently established by the identification of a heterozygous deletion at chromosome 22q11.2 through chromosomal microarray analysis or other genomic analyses. However, more accurate identification of the breakpoint contributes to a clearer understanding of the 22q11.2 deletion syndrome.Methods: In this study, we present a feasible nanopore sequencing method of 22q11.2 deletion. This DNA enrichment method—region-specific amplification (RSA)—is able to analyze the 22q11.2 deletion by specific amplification of an approximately 1-Mb region where the breakpoint might exist. RSA introduces universal primers into the target region DNA by a Y-shaped adaptor ligation and a single primer extension. The enriched products, completed by amplification with universal primers, are then processed by standard ONT ligation sequencing protocols.Results: RSA is able to deliver adequate coverage (>98%) and comparable long reads (average length >1 Kb) throughout the 22q11.2 region. The long nanopore sequencing reads, derived from three umbilical cord blood samples, have facilitated the identification of the breakpoint of the 22q11.2 deletion, as well as by Sanger sequencing.Conclusion: The Oxford Nanopore MinION sequencer can use RSA to sequence the target region 22q11.2; this method could also be used for other hard-to-sequence parts of the genome.
Collapse
|
10
|
Steiert TA, Fuß J, Juzenas S, Wittig M, Hoeppner M, Vollstedt M, Varkalaite G, ElAbd H, Brockmann C, Görg S, Gassner C, Forster M, Franke A. High-throughput method for the hybridisation-based targeted enrichment of long genomic fragments for PacBio third-generation sequencing. NAR Genom Bioinform 2022; 4:lqac051. [PMID: 35855323 PMCID: PMC9278042 DOI: 10.1093/nargab/lqac051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 06/08/2022] [Accepted: 06/29/2022] [Indexed: 11/16/2022] Open
Abstract
Hybridisation-based targeted enrichment is a widely used and well-established technique in high-throughput second-generation short-read sequencing. Despite the high potential to genetically resolve highly repetitive and variable genomic sequences by, for example PacBio third-generation sequencing, targeted enrichment for long fragments has not yet established the same high-throughput due to currently existing complex workflows and technological dependencies. We here describe a scalable targeted enrichment protocol for fragment sizes of >7 kb. For demonstration purposes we developed a custom blood group panel of challenging loci. Test results achieved > 65% on-target rate, good coverage (142.7×) and sufficient coverage evenness for both non-paralogous and paralogous targets, and sufficient non-duplicate read counts (83.5%) per sample for a highly multiplexed enrichment pool of 16 samples. We genotyped the blood groups of nine patients employing highly accurate phased assemblies at an allelic resolution that match reference blood group allele calls determined by SNP array and NGS genotyping. Seven Genome-in-a-Bottle reference samples achieved high recall (96%) and precision (99%) rates. Mendelian error rates were 0.04% and 0.13% for the included Ashkenazim and Han Chinese trios, respectively. In summary, we provide a protocol and first example for accurate targeted long-read sequencing that can be used in a high-throughput fashion.
Collapse
Affiliation(s)
- Tim Alexander Steiert
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Janina Fuß
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Simonas Juzenas
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
- Institute of Biotechnology, Life Science Centre, Vilnius University, Vilnius 02241, Lithuania
| | - Michael Wittig
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Marc Patrick Hoeppner
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Melanie Vollstedt
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Greta Varkalaite
- Institute for Digestive Research, Lithuanian University of Health Sciences, Kaunas 44307, Lithuania
| | - Hesham ElAbd
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Christian Brockmann
- Institute of Transfusion Medicine, University Hospital of Schleswig-Holstein, Kiel 24105, Germany
| | - Siegfried Görg
- Institute of Transfusion Medicine, University Hospital of Schleswig-Holstein, Kiel 24105, Germany
| | - Christoph Gassner
- Institute of Translational Medicine, Private University in the Principality of Liechtenstein, Triesen 9495, Liechtenstein
| | - Michael Forster
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel 24105, Germany
| |
Collapse
|
11
|
Laine P, Rowell WJ, Paulin L, Kujawa S, Raterman D, Mayhew G, Wendt J, Burgess DL, Partonen T, Paunio T, Auvinen P, Ekholm JM. Alu element in the RNA binding motif protein, X-linked 2 (RBMX2) gene found to be linked to bipolar disorder. PLoS One 2021; 16:e0261170. [PMID: 34914762 PMCID: PMC8675739 DOI: 10.1371/journal.pone.0261170] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 11/24/2021] [Indexed: 11/23/2022] Open
Abstract
Objective We have used long-read single molecule, real-time (SMRT) sequencing to fully characterize a ~12Mb genomic region on chromosome Xq24-q27, significantly linked to bipolar disorder (BD) in an extended family from a genetic sub-isolate. This family segregates BD in at least four generations with 24 affected individuals. Methods We selected 16 family members for targeted sequencing. The selected individuals either carried the disease haplotype, were non-carriers of the disease haplotype, or served as married-in controls. We designed hybrid capture probes enriching for 5-9Kb fragments spanning the entire 12Mb region that were then sequenced to screen for candidate structural variants (SVs) that could explain the increased risk for BD in this extended family. Results Altogether, 201 variants were detected in the critically linked region. Although most of these represented common variants, three variants emerged that showed near-perfect segregation among all BD type I affected individuals. Two of the SVs were identified in or near genes belonging to the RNA Binding Motif Protein, X-Linked (RBMX) gene family—a 330bp Alu (subfamily AluYa5) deletion in intron 3 of the RBMX2 gene and an intergenic 27bp tandem repeat deletion between the RBMX and G protein-coupled receptor 101 (GPR101) genes. The third SV was a 50bp tandem repeat insertion in intron 1 of the Coagulation Factor IX (F9) gene. Conclusions Among the three genetically linked SVs, additional evidence supported the Alu element deletion in RBMX2 as the leading candidate for contributing directly to the disease development of BD type I in this extended family.
Collapse
Affiliation(s)
- Pia Laine
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | | | - Lars Paulin
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Steve Kujawa
- Pacific Biosciences, Menlo Park, CA, United States of America
| | - Denise Raterman
- Roche Sequencing Solutions, Madison, WI, United States of America
| | - George Mayhew
- Roche Sequencing Solutions, Madison, WI, United States of America
| | - Jennifer Wendt
- Roche Sequencing Solutions, Madison, WI, United States of America
| | | | - Timo Partonen
- Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland
| | - Tiina Paunio
- Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland
- Department of Psychiatry, University of Helsinki, Helsinki, Finland
| | - Petri Auvinen
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Jenny M. Ekholm
- Pacific Biosciences, Menlo Park, CA, United States of America
- * E-mail:
| |
Collapse
|
12
|
Sun X, Song L, Yang W, Zhang L, Liu M, Li X, Tian G, Wang W. Nanopore Sequencing and Its Clinical Applications. Methods Mol Biol 2021; 2204:13-32. [PMID: 32710311 DOI: 10.1007/978-1-0716-0904-0_2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Nanopore sequencing is a method for determining the order and modifications of DNA/RNA nucleotides by detecting the electric current variations when DNA/RNA oligonucleotides pass through the nanometer-sized hole (nanopore). Nanopore-based DNA analysis techniques have been commercialized by Oxford Nanopore Technologies, NabSys, and Sequenom, and widely used in scientific researches recently including human genomics, cancer, metagenomics, plant sciences, etc., moreover, it also has potential applications in the field of healthcare due to its fast turn-around time, portable and real-time data analysis. Those features make it a promising technology for the point-of-care testing (POCT) and its potential clinical applications are briefly discussed in this chapter.
Collapse
Affiliation(s)
- Xue Sun
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Lei Song
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Wenjuan Yang
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Lili Zhang
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Meng Liu
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Xiaoshuang Li
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Geng Tian
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China
| | - Weiwei Wang
- Geneis (Beijing) Co., Ltd., Beijing, People's Republic of China.
| |
Collapse
|
13
|
Blondal T, Gamba C, Møller Jagd L, Su L, Demirov D, Guo S, Johnston CM, Riising EM, Wu X, Mikkelsen MJ, Szabova L, Mouritzen P. Verification of CRISPR editing and finding transgenic inserts by Xdrop indirect sequence capture followed by short- and long-read sequencing. Methods 2021; 191:68-77. [PMID: 33582298 DOI: 10.1016/j.ymeth.2021.02.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 11/12/2020] [Accepted: 02/02/2021] [Indexed: 01/02/2023] Open
Abstract
Validation of CRISPR-Cas9 editing typically explores the immediate vicinity of the gene editing site and distal off-target sequences, which has led to the conclusion that CRISPR-Cas9 editing is very specific. However, an increasing number of studies suggest that on-target unintended editing events like deletions and insertions are relatively frequent but unfortunately often missed in the validation of CRISPR-Cas9 editing. The deletions may be several kilobases-long and only affect one allele. The gold standard in molecular validation of gene editing is direct sequencing of relatively short PCR amplicons. This approach allows the detection of small editing events but fails in detecting large rearrangements, in particular when only one allele is affected. Detection of large rearrangements requires that an extended region is analyzed and the characterization of events may benefit from long-read sequencing. Here we implemented Xdrop™, a new microfluidic technology that allows targeted enrichment of long regions (~100 kb) using just a single standard PCR primer set. Sequencing of the enriched CRISPR-Cas9 gene-edited region in four cell lines on long- and short-read sequencing platforms unravelled unknown and unintended genome editing events. The analysis revealed accidental kilobases-large insertions in three of the cell lines, which remained undetected using standard procedures. We also applied the targeted enrichment approach to identify the integration site of a transgene in a mouse line. The results demonstrate the potential of this technology in gene editing validation as well as in more classic transgenics.
Collapse
Affiliation(s)
| | | | | | - Ling Su
- Cancer Research Technology Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Dimiter Demirov
- Cancer Research Technology Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Shuang Guo
- Cancer Research Technology Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | | | | | - Xiaolin Wu
- Cancer Research Technology Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | | | - Ludmila Szabova
- Center for Advanced Preclinical Research, Frederick National Laboratory for Cancer Research at the National Cancer Institute-Frederick, Frederick, MD, USA
| | | |
Collapse
|
14
|
Alquezar‐Planas DE, Löber U, Cui P, Quedenau C, Chen W, Greenwood AD. DNA sonication inverse PCR for genome scale analysis of uncharacterized flanking sequences. Methods Ecol Evol 2020. [DOI: 10.1111/2041-210x.13497] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- David E. Alquezar‐Planas
- Department of Wildlife Diseases Leibniz Institute for Zoo and Wildlife Research Berlin Germany
- Australian Museum Research InstituteAustralian Museum Sydney NSW Australia
| | - Ulrike Löber
- Department of Wildlife Diseases Leibniz Institute for Zoo and Wildlife Research Berlin Germany
- The Berlin Center for Genomics in Biodiversity Research Berlin Germany
- Experimental and Clinical Research Center A Cooperation of Charité – Universitätsmedizin Berlin and Max Delbruck Center for Molecular Medicine Berlin Germany
| | - Pin Cui
- Department of Wildlife Diseases Leibniz Institute for Zoo and Wildlife Research Berlin Germany
| | - Claudia Quedenau
- Genomics Max Delbrück Center for Molecular Medicine Berlin Germany
| | - Wei Chen
- Berlin Institute for Medical Systems BiologyMax‐Delbrück Center for Molecular Medicine Berlin Germany
| | - Alex D. Greenwood
- Department of Wildlife Diseases Leibniz Institute for Zoo and Wildlife Research Berlin Germany
- Department of Veterinary Medicine Freie Universität Berlin Berlin Germany
| |
Collapse
|
15
|
Madsen EB, Höijer I, Kvist T, Ameur A, Mikkelsen MJ. Xdrop: Targeted sequencing of long DNA molecules from low input samples using droplet sorting. Hum Mutat 2020; 41:1671-1679. [PMID: 32516842 PMCID: PMC7496172 DOI: 10.1002/humu.24063] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 04/29/2020] [Accepted: 06/05/2020] [Indexed: 11/07/2022]
Abstract
Long-read sequencing can resolve regions of the genome that are inaccessible to short reads, and therefore are ideal for genome-gap closure, solving structural rearrangements and sequencing through repetitive elements. Here we introduce the Xdrop technology: a novel microfluidic-based system that allows for targeted enrichment of long DNA molecules starting from only a few nanograms of DNA. Xdrop is based on the isolation of long DNA fragments in millions of droplets, where the droplets containing a target sequence of interest are fluorescently labeled and sorted using flow cytometry. The final product from the Xdrop procedure is an enriched population of long DNA molecules that can be investigated by sequencing. To demonstrate the capability of Xdrop, we performed enrichment of the human papilloma virus 18 integrated into the genome of human HeLa cells. Analysis of the sequencing reads resolved three HPV18-chr8 integrations at base-pair resolution, and the captured fragments extended up to 30 kb into the human genome at the integration sites. Further, we enriched the complete TP53 locus in a leukemia cell line and could successfully phase coexisting mutations using PacBio sequencing. In summary, our results show that Xdrop is an efficient enrichment technology for studying complex genomic regions.
Collapse
Affiliation(s)
| | - Ida Höijer
- Science for Life Laboratory, Department of Immunology, Genetics and PathologyUppsala UniversityUppsalaSweden
| | | | - Adam Ameur
- Science for Life Laboratory, Department of Immunology, Genetics and PathologyUppsala UniversityUppsalaSweden
- Department of Epidemiology and Preventive MedicineMonash UniversityMelbourneAustralia
| | | |
Collapse
|
16
|
Dinh HX, Singh D, Periyannan S, Park RF, Pourkheirandish M. Molecular genetics of leaf rust resistance in wheat and barley. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:2035-2050. [PMID: 32128617 DOI: 10.1007/s00122-020-03570-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 02/18/2020] [Indexed: 06/10/2023]
Abstract
The demand for cereal grains as a main source of energy continues to increase due to the rapid increase in world population. The leaf rust diseases of cereals cause significant yield losses, posing challenges for global food security. The deployment of resistance genes has long been considered as the most effective and sustainable way to control cereal leaf rust diseases. While genetic resistance has reduced the impact of these diseases in agriculture, losses still occur due to the ability of the respective rust pathogens to change and render resistance genes ineffective plus the slow pace at which resistance genes are discovered and characterized. This article highlights novel recently developed strategies based on advances in genome sequencing that have accelerated gene isolation by overcoming the complexity of cereal genomes. The leaf rust resistance genes cloned so far from wheat and barley belong to various protein families, including nucleotide binding site/leucine-rich repeat receptors and transporters. We review recent studies that are beginning to reveal the defense mechanisms conferred by the leaf rust resistance genes identified to date in cereals and their roles in either pattern-triggered immunity or effector-triggered immunity.
Collapse
Affiliation(s)
- Hoan X Dinh
- Plant Breeding Institute, Faculty of Science, The University of Sydney, Cobbitty, NSW, 2570, Australia
| | - Davinder Singh
- Plant Breeding Institute, Faculty of Science, The University of Sydney, Cobbitty, NSW, 2570, Australia
| | - Sambasivam Periyannan
- CSIRO Agriculture and Food, Box 1700, Clunies Ross Street, Canberra, 2601, Australia
| | - Robert F Park
- Plant Breeding Institute, Faculty of Science, The University of Sydney, Cobbitty, NSW, 2570, Australia.
| | | |
Collapse
|
17
|
Mateos M, Martinez Montoya H, Lanzavecchia SB, Conte C, Guillén K, Morán-Aceves BM, Toledo J, Liedo P, Asimakis ED, Doudoumis V, Kyritsis GA, Papadopoulos NT, Augustinos AA, Segura DF, Tsiamis G. Wolbachia pipientis Associated With Tephritid Fruit Fly Pests: From Basic Research to Applications. Front Microbiol 2020; 11:1080. [PMID: 32582067 PMCID: PMC7283806 DOI: 10.3389/fmicb.2020.01080] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 04/30/2020] [Indexed: 12/19/2022] Open
Abstract
Members of the true fruit flies (family Tephritidae) are among the most serious agricultural pests worldwide, whose control and management demands large and costly international efforts. The need for cost-effective and environmentally friendly integrated pest management (IPM) has led to the development and implementation of autocidal control strategies. These approaches include the widely used sterile insect technique and the incompatible insect technique (IIT). IIT relies on maternally transmitted bacteria (namely Wolbachia) to cause a conditional sterility in crosses between released mass-reared Wolbachia-infected males and wild females, which are either uninfected or infected with a different Wolbachia strain (i.e., cytoplasmic incompatibility; CI). Herein, we review the current state of knowledge on Wolbachia-tephritid interactions including infection prevalence in wild populations, phenotypic consequences, and their impact on life history traits. Numerous pest tephritid species are reported to harbor Wolbachia infections, with a subset exhibiting high prevalence. The phenotypic effects of Wolbachia have been assessed in very few tephritid species, due in part to the difficulty of manipulating Wolbachia infection (removal or transinfection). Based on recent methodological advances (high-throughput DNA sequencing) and breakthroughs concerning the mechanistic basis of CI, we suggest research avenues that could accelerate generation of necessary knowledge for the potential use of Wolbachia-based IIT in area-wide integrated pest management (AW-IPM) strategies for the population control of tephritid pests.
Collapse
Affiliation(s)
- Mariana Mateos
- Departments of Ecology and Conservation Biology, and Wildlife and Fisheries Sciences, Texas A&M University, College Station, TX, United States
| | - Humberto Martinez Montoya
- Laboratorio de Genética y Genómica Comparativa, Unidad Académica Multidisciplinaria Reynosa Aztlan, Universidad Autónoma de Tamaulipas, Ciudad Victoria, Mexico
| | - Silvia B Lanzavecchia
- Instituto de Genética 'Ewald A. Favret' - GV IABIMO (INTA-CONICET) Hurlingham, Buenos Aires, Argentina
| | - Claudia Conte
- Instituto de Genética 'Ewald A. Favret' - GV IABIMO (INTA-CONICET) Hurlingham, Buenos Aires, Argentina
| | | | | | - Jorge Toledo
- El Colegio de la Frontera Sur, Tapachula, Mexico
| | - Pablo Liedo
- El Colegio de la Frontera Sur, Tapachula, Mexico
| | - Elias D Asimakis
- Department of Environmental Engineering, University of Patras, Agrinio, Greece
| | - Vangelis Doudoumis
- Department of Environmental Engineering, University of Patras, Agrinio, Greece
| | - Georgios A Kyritsis
- Laboratory of Entomology and Agricultural Zoology, Department of Agriculture Crop Production and Rural Environment, University of Thessaly, Larissa, Greece
| | - Nikos T Papadopoulos
- Laboratory of Entomology and Agricultural Zoology, Department of Agriculture Crop Production and Rural Environment, University of Thessaly, Larissa, Greece
| | - Antonios A Augustinos
- Department of Plant Protection, Institute of Industrial and Forage Crops, Hellenic Agricultural Organization - DEMETER, Patras, Greece
| | - Diego F Segura
- Instituto de Genética 'Ewald A. Favret' - GV IABIMO (INTA-CONICET) Hurlingham, Buenos Aires, Argentina
| | - George Tsiamis
- Department of Environmental Engineering, University of Patras, Agrinio, Greece
| |
Collapse
|
18
|
Balachandran P, Beck CR. Structural variant identification and characterization. Chromosome Res 2020; 28:31-47. [PMID: 31907725 PMCID: PMC7131885 DOI: 10.1007/s10577-019-09623-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 10/15/2019] [Accepted: 11/24/2019] [Indexed: 01/06/2023]
Abstract
Structural variant (SV) differences between human genomes can cause germline and mosaic disease as well as inter-individual variation. De-regulation of accurate DNA repair and genomic surveillance mechanisms results in a large number of SVs in cancer. Analysis of the DNA sequences at SV breakpoints can help identify pathways of mutagenesis and regions of the genome that are more susceptible to rearrangement. Large-scale SV analyses have been enabled by high-throughput genome-level sequencing on humans in the past decade. These studies have shed light on the mechanisms and prevalence of complex genomic rearrangements. Recent advancements in both sequencing and other mapping technologies as well as calling algorithms for detection of genomic rearrangements have helped propel SV detection into population-scale studies, and have begun to elucidate previously inaccessible regions of the genome. Here, we discuss the genomic organization of simple and complex SVs, the molecular mechanisms of their formation, and various ways to detect them. We also introduce methods for characterizing SVs and their consequences on human genomes.
Collapse
Affiliation(s)
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA.
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, 06030, USA.
| |
Collapse
|
19
|
Andermann T, Torres Jiménez MF, Matos-Maraví P, Batista R, Blanco-Pastor JL, Gustafsson ALS, Kistler L, Liberal IM, Oxelman B, Bacon CD, Antonelli A. A Guide to Carrying Out a Phylogenomic Target Sequence Capture Project. Front Genet 2020; 10:1407. [PMID: 32153629 PMCID: PMC7047930 DOI: 10.3389/fgene.2019.01407] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 12/24/2019] [Indexed: 12/17/2022] Open
Abstract
High-throughput DNA sequencing techniques enable time- and cost-effective sequencing of large portions of the genome. Instead of sequencing and annotating whole genomes, many phylogenetic studies focus sequencing effort on large sets of pre-selected loci, which further reduces costs and bioinformatic challenges while increasing coverage. One common approach that enriches loci before sequencing is often referred to as target sequence capture. This technique has been shown to be applicable to phylogenetic studies of greatly varying evolutionary depth. Moreover, it has proven to produce powerful, large multi-locus DNA sequence datasets suitable for phylogenetic analyses. However, target capture requires careful considerations, which may greatly affect the success of experiments. Here we provide a simple flowchart for designing phylogenomic target capture experiments. We discuss necessary decisions from the identification of target loci to the final bioinformatic processing of sequence data. We outline challenges and solutions related to the taxonomic scope, sample quality, and available genomic resources of target capture projects. We hope this review will serve as a useful roadmap for designing and carrying out successful phylogenetic target capture studies.
Collapse
Affiliation(s)
- Tobias Andermann
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Maria Fernanda Torres Jiménez
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Pável Matos-Maraví
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Institute of Entomology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czechia
| | - Romina Batista
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Programa de Pós-Graduação em Genética, Conservação e Biologia Evolutiva, PPG GCBEv–Instituto Nacional de Pesquisas da Amazônia—INPA Campus II, Manaus, Brazil
- Coordenação de Zoologia, Museu Paraense Emílio Goeldi, Belém, Brazil
| | - José L. Blanco-Pastor
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- INRAE, Centre Nouvelle-Aquitaine-Poitiers, Lusignan, France
| | | | - Logan Kistler
- Department of Anthropology, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States
| | - Isabel M. Liberal
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
| | - Bengt Oxelman
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Christine D. Bacon
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Alexandre Antonelli
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Royal Botanic Gardens, Kew, Richmond-Surrey, United Kingdom
| |
Collapse
|
20
|
Sedlazeck FJ, Lee H, Darby CA, Schatz MC. Piercing the dark matter: bioinformatics of long-range sequencing and mapping. Nat Rev Genet 2019; 19:329-346. [PMID: 29599501 DOI: 10.1038/s41576-018-0003-4] [Citation(s) in RCA: 284] [Impact Index Per Article: 56.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Several new genomics technologies have become available that offer long-read sequencing or long-range mapping with higher throughput and higher resolution analysis than ever before. These long-range technologies are rapidly advancing the field with improved reference genomes, more comprehensive variant identification and more complete views of transcriptomes and epigenomes. However, they also require new bioinformatics approaches to take full advantage of their unique characteristics while overcoming their complex errors and modalities. Here, we discuss several of the most important applications of the new technologies, focusing on both the currently available bioinformatics tools and opportunities for future research.
Collapse
Affiliation(s)
- Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Hayan Lee
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Charlotte A Darby
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. .,Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
21
|
Mantere T, Kersten S, Hoischen A. Long-Read Sequencing Emerging in Medical Genetics. Front Genet 2019; 10:426. [PMID: 31134132 PMCID: PMC6514244 DOI: 10.3389/fgene.2019.00426] [Citation(s) in RCA: 226] [Impact Index Per Article: 45.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 04/18/2019] [Indexed: 12/12/2022] Open
Abstract
The wide implementation of next-generation sequencing (NGS) technologies has revolutionized the field of medical genetics. However, the short read lengths of currently used sequencing approaches pose a limitation for the identification of structural variants, sequencing repetitive regions, phasing of alleles and distinguishing highly homologous genomic regions. These limitations may significantly contribute to the diagnostic gap in patients with genetic disorders who have undergone standard NGS, like whole exome or even genome sequencing. Now, the emerging long-read sequencing (LRS) technologies may offer improvements in the characterization of genetic variation and regions that are difficult to assess with the prevailing NGS approaches. LRS has so far mainly been used to investigate genetic disorders with previously known or strongly suspected disease loci. While these targeted approaches already show the potential of LRS, it remains to be seen whether LRS technologies can soon enable true whole genome sequencing routinely. Ultimately, this could allow the de novo assembly of individual whole genomes used as a generic test for genetic disorders. In this article, we summarize the current LRS-based research on human genetic disorders and discuss the potential of these technologies to facilitate the next major advancements in medical genetics.
Collapse
Affiliation(s)
- Tuomo Mantere
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit and Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Simone Kersten
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| |
Collapse
|
22
|
Posey JE, O'Donnell-Luria AH, Chong JX, Harel T, Jhangiani SN, Coban Akdemir ZH, Buyske S, Pehlivan D, Carvalho CMB, Baxter S, Sobreira N, Liu P, Wu N, Rosenfeld JA, Kumar S, Avramopoulos D, White JJ, Doheny KF, Witmer PD, Boehm C, Sutton VR, Muzny DM, Boerwinkle E, Günel M, Nickerson DA, Mane S, MacArthur DG, Gibbs RA, Hamosh A, Lifton RP, Matise TC, Rehm HL, Gerstein M, Bamshad MJ, Valle D, Lupski JR. Insights into genetics, human biology and disease gleaned from family based genomic studies. Genet Med 2019; 21:798-812. [PMID: 30655598 PMCID: PMC6691975 DOI: 10.1038/s41436-018-0408-7] [Citation(s) in RCA: 122] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 12/05/2018] [Indexed: 12/16/2022] Open
Abstract
Identifying genes and variants contributing to rare disease phenotypes and Mendelian conditions informs biology and medicine, yet potential phenotypic consequences for variation of >75% of the ~20,000 annotated genes in the human genome are lacking. Technical advances to assess rare variation genome-wide, particularly exome sequencing (ES), enabled establishment in the United States of the National Institutes of Health (NIH)-supported Centers for Mendelian Genomics (CMGs) and have facilitated collaborative studies resulting in novel "disease gene" discoveries. Pedigree-based genomic studies and rare variant analyses in families with suspected Mendelian conditions have led to the elucidation of hundreds of novel disease genes and highlighted the impact of de novo mutational events, somatic variation underlying nononcologic traits, incompletely penetrant alleles, phenotypes with high locus heterogeneity, and multilocus pathogenic variation. Herein, we highlight CMG collaborative discoveries that have contributed to understanding both rare and common diseases and discuss opportunities for future discovery in single-locus Mendelian disorder genomics. Phenotypic annotation of all human genes; development of bioinformatic tools and analytic methods; exploration of non-Mendelian modes of inheritance including reduced penetrance, multilocus variation, and oligogenic inheritance; construction of allelic series at a locus; enhanced data sharing worldwide; and integration with clinical genomics are explored. Realizing the full contribution of rare disease research to functional annotation of the human genome, and further illuminating human biology and health, will lay the foundation for the Precision Medicine Initiative.
Collapse
Affiliation(s)
- Jennifer E Posey
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| | - Anne H O'Donnell-Luria
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Boston Children's Hospital, Boston, MA, USA
| | - Jessica X Chong
- Department of Pediatrics, University of Washington, Seattle, WA, USA
| | - Tamar Harel
- Department of Genetic and Metabolic Diseases, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| | - Shalini N Jhangiani
- The Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Zeynep H Coban Akdemir
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Steven Buyske
- Department of Genetics, Rutgers University, Piscataway, NJ, USA
- Department of Statistics, Rutgers University, Piscataway, NJ, USA
| | - Davut Pehlivan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Claudia M B Carvalho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Samantha Baxter
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nara Sobreira
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Baylor Genetics Laboratory, Houston, TX, USA
| | - Nan Wu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, China
| | - Jill A Rosenfeld
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Sushant Kumar
- Computational Biology and Bioinformatics Program, Yale University Medical School, New Haven, CT, USA
| | - Dimitri Avramopoulos
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Janson J White
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Department of Pediatrics, University of Washington, Seattle, WA, USA
| | - Kimberly F Doheny
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
- Center for Inherited Disease Research, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - P Dane Witmer
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
- Center for Inherited Disease Research, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Corinne Boehm
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - V Reid Sutton
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Donna M Muzny
- The Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Eric Boerwinkle
- The Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Human Genetics Center, University of Texas Health Science Center, Houston, TX, USA
| | - Murat Günel
- Department of Neurosurgery, Yale School of Medicine, New Haven, CT, USA
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | | | - Shrikant Mane
- Yale Center for Genome Analysis, Yale School of Medicine, Yale University, New Haven, CT, USA
| | - Daniel G MacArthur
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Richard A Gibbs
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- The Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Ada Hamosh
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Richard P Lifton
- Department of Neurosurgery, Yale School of Medicine, New Haven, CT, USA
- Department of Internal Medicine, Yale University School of Medicine, New Haven, CT, USA
- Laboratory of Human Genetics and Genomics, The Rockefeller University, New York, NY, USA
| | - Tara C Matise
- Department of Genetics, Rutgers University, Piscataway, NJ, USA
| | - Heidi L Rehm
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Mark Gerstein
- Computational Biology and Bioinformatics Program, Yale University Medical School, New Haven, CT, USA
| | - Michael J Bamshad
- Department of Pediatrics, University of Washington, Seattle, WA, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - David Valle
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- The Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA.
- Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
23
|
Beck CR, Carvalho CMB, Akdemir ZC, Sedlazeck FJ, Song X, Meng Q, Hu J, Doddapaneni H, Chong Z, Chen ES, Thornton PC, Liu P, Yuan B, Withers M, Jhangiani SN, Kalra D, Walker K, English AC, Han Y, Chen K, Muzny DM, Ira G, Shaw CA, Gibbs RA, Hastings PJ, Lupski JR. Megabase Length Hypermutation Accompanies Human Structural Variation at 17p11.2. Cell 2019; 176:1310-1324.e10. [PMID: 30827684 PMCID: PMC6438178 DOI: 10.1016/j.cell.2019.01.045] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 11/06/2018] [Accepted: 01/25/2019] [Indexed: 01/16/2023]
Abstract
DNA rearrangements resulting in human genome structural variants (SVs) are caused by diverse mutational mechanisms. We used long- and short-read sequencing technologies to investigate end products of de novo chromosome 17p11.2 rearrangements and query the molecular mechanisms underlying both recurrent and non-recurrent events. Evidence for an increased rate of clustered single-nucleotide variant (SNV) mutation in cis with non-recurrent rearrangements was found. Indel and SNV formation are associated with both copy-number gains and losses of 17p11.2, occur up to ∼1 Mb away from the breakpoint junctions, and favor C > G transversion substitutions; results suggest that single-stranded DNA is formed during the genesis of the SV and provide compelling support for a microhomology-mediated break-induced replication (MMBIR) mechanism for SV formation. Our data show an additional mutational burden of MMBIR consisting of hypermutation confined to the locus and manifesting as SNVs and indels predominantly within genes.
Collapse
Affiliation(s)
- Christine R Beck
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | | | - Zeynep C Akdemir
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | | | - Xiaofei Song
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Qingchang Meng
- Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | - Jianhong Hu
- Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | | | - Zechen Chong
- Department of Genetics and the Informatics Institute, the University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Edward S Chen
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Philip C Thornton
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Pengfei Liu
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Bo Yuan
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Marjorie Withers
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | | | - Divya Kalra
- Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | | | - Adam C English
- Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | - Yi Han
- Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | - Ken Chen
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | - Grzegorz Ira
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Chad A Shaw
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA
| | - Richard A Gibbs
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA; Human Genome Sequencing Center, BCM, Houston, TX 77030, USA
| | - P J Hastings
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA; Dan L. Duncan Comprehensive Cancer Center, BCM, Houston, TX 77030, USA.
| | - James R Lupski
- Department of Molecular and Human Genetics, BCM, Houston, TX 77030, USA; Human Genome Sequencing Center, BCM, Houston, TX 77030, USA; Department of Pediatrics, BCM, Houston, TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA; Dan L. Duncan Comprehensive Cancer Center, BCM, Houston, TX 77030, USA.
| |
Collapse
|
24
|
Single-Molecule Sequencing: Towards Clinical Applications. Trends Biotechnol 2019; 37:72-85. [DOI: 10.1016/j.tibtech.2018.07.013] [Citation(s) in RCA: 112] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 07/16/2018] [Accepted: 07/18/2018] [Indexed: 12/31/2022]
|
25
|
Hoff SNK, Baalsrud HT, Tooming-Klunderud A, Skage M, Richmond T, Obernosterer G, Shirzadi R, Tørresen OK, Jakobsen KS, Jentoft S. Long-read sequence capture of the haemoglobin gene clusters across codfish species. Mol Ecol Resour 2018; 19:245-259. [PMID: 30329222 PMCID: PMC7379720 DOI: 10.1111/1755-0998.12955] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 10/05/2018] [Accepted: 10/09/2018] [Indexed: 11/30/2022]
Abstract
Combining high-throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short-read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long-read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom-made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage-specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long-read capture as a versatile approach for comparative genomic studies by generation of a cross-species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes.
Collapse
Affiliation(s)
- Siv Nam Khang Hoff
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Helle T Baalsrud
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Ave Tooming-Klunderud
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Morten Skage
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | | | | | | | - Ole Kristian Tørresen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Kjetill S Jakobsen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Sissel Jentoft
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| |
Collapse
|
26
|
Höijer I, Tsai Y, Clark TA, Kotturi P, Dahl N, Stattin E, Bondeson M, Feuk L, Gyllensten U, Ameur A. Detailed analysis of HTT repeat elements in human blood using targeted amplification-free long-read sequencing. Hum Mutat 2018; 39:1262-1272. [PMID: 29932473 PMCID: PMC6175010 DOI: 10.1002/humu.23580] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Revised: 05/30/2018] [Accepted: 06/20/2018] [Indexed: 12/24/2022]
Abstract
Amplification of DNA is required as a mandatory step during library preparation in most targeted sequencing protocols. This can be a critical limitation when targeting regions that are highly repetitive or with extreme guanine-cytosine (GC) content, including repeat expansions associated with human disease. Here, we used an amplification-free protocol for targeted enrichment utilizing the CRISPR/Cas9 system (No-Amp Targeted sequencing) in combination with single molecule, real-time (SMRT) sequencing for studying repeat elements in the huntingtin (HTT) gene, where an expanded CAG repeat is causative for Huntington disease. We also developed a robust data analysis pipeline for repeat element analysis that is independent of alignment of reads to a reference genome. The method was applied to 11 diagnostic blood samples, and for all 22 alleles the resulting CAG repeat count agreed with previous results based on fragment analysis. The amplification-free protocol also allowed for studying somatic variability of repeat elements in our samples, without the interference of PCR stutter. In summary, with No-Amp Targeted sequencing in combination with our analysis pipeline, we could accurately study repeat elements that are difficult to investigate using PCR-based methods.
Collapse
Affiliation(s)
- Ida Höijer
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
| | | | | | | | - Niklas Dahl
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
| | - Eva‐Lena Stattin
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
| | - Marie‐Louise Bondeson
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
| | - Lars Feuk
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
| | - Ulf Gyllensten
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
| | - Adam Ameur
- Science for Life LaboratoryDepartment of ImmunologyGenetics and PathologyUppsala UniversityUppsalaSweden
- School of Public Health and Preventive MedicineMonash UniversityMelbourneVictoriaAustralia
| |
Collapse
|
27
|
Schwessinger B, Sperschneider J, Cuddy WS, Garnica DP, Miller ME, Taylor JM, Dodds PN, Figueroa M, Park RF, Rathjen JP. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity. mBio 2018; 9:e02275-17. [PMID: 29463659 PMCID: PMC5821087 DOI: 10.1128/mbio.02275-17] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 01/09/2018] [Indexed: 01/01/2023] Open
Abstract
A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb) and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies.IMPORTANCE Current representations of eukaryotic microbial genomes are haploid, hiding the genomic diversity intrinsic to diploid and polyploid life forms. This hidden diversity contributes to the organism's evolutionary potential and ability to adapt to stress conditions. Yet, it is challenging to provide haplotype-specific information at a whole-genome level. Here, we take advantage of long-read DNA sequencing technology and a tailored-assembly algorithm to disentangle the two haploid genomes of a dikaryotic pathogenic wheat rust fungus. The two genomes display high levels of nucleotide and structural variations, which lead to allelic variation and the presence of genes lacking allelic counterparts. Nonallelic candidate effector genes, which likely encode important pathogenicity factors, display distinct genome localization patterns and are less likely to be evolutionary conserved than those which are present as allelic pairs. This genomic diversity may promote rapid host adaptation and/or be related to the age of the sequenced isolate since last meiosis.
Collapse
Affiliation(s)
- Benjamin Schwessinger
- Research School of Biology, the Australian National University, Acton, ACT, Australia
| | - Jana Sperschneider
- Centre for Environment and Life Sciences, CSIRO Agriculture and Food, Perth, WA, Australia
| | - William S Cuddy
- Plant Breeding Institute, Faculty of Agriculture and Environment, the University of Sydney, Narellan, NSW, Australia
- NSW Department of Primary Industries, Elizabeth Macarthur Agricultural Institute, Menangle, NSW, Australia
| | - Diana P Garnica
- Research School of Biology, the Australian National University, Acton, ACT, Australia
| | - Marisa E Miller
- Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota, USA
| | - Jennifer M Taylor
- Black Mountain Laboratories, CSIRO Agriculture and Food, Canberra, ACT, Australia
| | - Peter N Dodds
- Black Mountain Laboratories, CSIRO Agriculture and Food, Canberra, ACT, Australia
| | - Melania Figueroa
- Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota, USA
- Stakman-Borlaug Center for Sustainable Plant Health, University of Minnesota, St. Paul, Minnesota, USA
| | - Robert F Park
- Plant Breeding Institute, Faculty of Agriculture and Environment, the University of Sydney, Narellan, NSW, Australia
| | - John P Rathjen
- Research School of Biology, the Australian National University, Acton, ACT, Australia
| |
Collapse
|
28
|
Editing out five Serpina1 paralogs to create a mouse model of genetic emphysema. Proc Natl Acad Sci U S A 2018; 115:2788-2793. [PMID: 29453277 PMCID: PMC5856518 DOI: 10.1073/pnas.1713689115] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Chronic obstructive pulmonary disease affects 10% of the worldwide population, and the leading genetic cause is a genetic disease, α-1 antitrypsin (AAT) deficiency. Humans have only one gene that codes for the AAT protein, but mice have up to six, which made it impossible for decades to create a mouse model of the disease. Here we succeeded in creating this mouse model using CRISPR technology to target all of the mouse genes at once. Importantly, this mouse model spontaneously develops lung disease and recapitulates many aspects of the human disease. We anticipate that this model will be highly relevant not only to the preclinical development of therapeutics for AAT deficiency, but also to emphysema and smoking research. Chronic obstructive pulmonary disease affects 10% of the worldwide population, and the leading genetic cause is α-1 antitrypsin (AAT) deficiency. Due to the complexity of the murine locus, which includes up to six Serpina1 paralogs, no genetic animal model of the disease has been successfully generated until now. Here we create a quintuple Serpina1a–e knockout using CRISPR/Cas9-mediated genome editing. The phenotype recapitulates the human disease phenotype, i.e., absence of hepatic and circulating AAT translates functionally to a reduced capacity to inhibit neutrophil elastase. With age, Serpina1 null mice develop emphysema spontaneously, which can be induced in younger mice by a lipopolysaccharide challenge. This mouse models not only AAT deficiency but also emphysema and is a relevant genetic model and not one based on developmental impairment of alveolarization or elastase administration. We anticipate that this unique model will be highly relevant not only to the preclinical development of therapeutics for AAT deficiency, but also to emphysema and smoking research.
Collapse
|
29
|
Ezpeleta J, Krsticevic FJ, Bulacio P, Tapia E. Designing robust watermark barcodes for multiplex long-read sequencing. Bioinformatics 2017; 33:807-813. [PMID: 27259539 DOI: 10.1093/bioinformatics/btw322] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 05/16/2016] [Indexed: 11/12/2022] Open
Abstract
Motivation To attain acceptable sample misassignment rates, current approaches to multiplex single-molecule real-time sequencing require upstream quality improvement, which is obtained from multiple passes over the sequenced insert and significantly reduces the effective read length. In order to fully exploit the raw read length on multiplex applications, robust barcodes capable of dealing with the full single-pass error rates are needed. Results We present a method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11%. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10-7 under the above conditions, and are designed to be compatible with chemical constraints imposed by the sequencing process. Availability and Implementation Software tools for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark . Contact ezpeleta@cifasis-conicet.gov.ar.
Collapse
Affiliation(s)
- Joaquín Ezpeleta
- Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas, Rosario, Argentina.,Facultad de Ciencias Exactas, Ingeniería y Agrimensura, Universidad Nacional de Rosario, Rosario, Argentina
| | - Flavia J Krsticevic
- Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas, Rosario, Argentina
| | - Pilar Bulacio
- Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas, Rosario, Argentina.,Facultad de Ciencias Exactas, Ingeniería y Agrimensura, Universidad Nacional de Rosario, Rosario, Argentina
| | - Elizabeth Tapia
- Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas, Rosario, Argentina.,Facultad de Ciencias Exactas, Ingeniería y Agrimensura, Universidad Nacional de Rosario, Rosario, Argentina
| |
Collapse
|
30
|
McGinty RJ, Rubinstein RG, Neil AJ, Dominska M, Kiktev D, Petes TD, Mirkin SM. Nanopore sequencing of complex genomic rearrangements in yeast reveals mechanisms of repeat-mediated double-strand break repair. Genome Res 2017; 27:2072-2082. [PMID: 29113982 PMCID: PMC5741057 DOI: 10.1101/gr.228148.117] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Accepted: 10/26/2017] [Indexed: 01/25/2023]
Abstract
Improper DNA double-strand break (DSB) repair results in complex genomic rearrangements (CGRs) in many cancers and various congenital disorders in humans. Trinucleotide repeat sequences, such as (GAA)n repeats in Friedreich's ataxia, (CTG)n repeats in myotonic dystrophy, and (CGG)n repeats in fragile X syndrome, are also subject to double-strand breaks within the repetitive tract followed by DNA repair. Mapping the outcomes of CGRs is important for understanding their causes and potential phenotypic effects. However, high-resolution mapping of CGRs has traditionally been a laborious and highly skilled process. Recent advances in long-read DNA sequencing technologies, specifically Nanopore sequencing, have made possible the rapid identification of CGRs with single base pair resolution. Here, we have used whole-genome Nanopore sequencing to characterize several CGRs that originated from naturally occurring DSBs at (GAA)n microsatellites in Saccharomyces cerevisiae. These data gave us important insights into the mechanisms of DSB repair leading to CGRs.
Collapse
Affiliation(s)
- Ryan J McGinty
- Department of Biology, Tufts University, Medford, Massachusetts 02155, USA
| | | | - Alexander J Neil
- Department of Biology, Tufts University, Medford, Massachusetts 02155, USA
| | - Margaret Dominska
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Denis Kiktev
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Thomas D Petes
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Sergei M Mirkin
- Department of Biology, Tufts University, Medford, Massachusetts 02155, USA
| |
Collapse
|
31
|
Huang M, Tu J, Lu Z. Recent Advances in Experimental Whole Genome Haplotyping Methods. Int J Mol Sci 2017; 18:E1944. [PMID: 28891974 PMCID: PMC5618593 DOI: 10.3390/ijms18091944] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 09/01/2017] [Accepted: 09/05/2017] [Indexed: 01/06/2023] Open
Abstract
Haplotype plays a vital role in diverse fields; however, the sequencing technologies cannot resolve haplotype directly. Pioneers demonstrated several approaches to resolve haplotype in the early years, which was extensively reviewed. Since then, numerous methods have been developed recently that have significantly improved phasing performance. Here, we review experimental methods that have emerged mainly over the past five years, and categorize them into five classes according to their maximum scale of contiguity: (i) encapsulation, (ii) 3D structure capture and construction, (iii) compartmentalization, (iv) fluorography, (v) long-read sequencing. Several subsections of certain methods are attached to each class as instances. We also discuss the relative advantages and disadvantages of different classes and make comparisons among representative methods of each class.
Collapse
Affiliation(s)
- Mengting Huang
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Jing Tu
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Zuhong Lu
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| |
Collapse
|
32
|
Ravinet M, Faria R, Butlin RK, Galindo J, Bierne N, Rafajlović M, Noor MAF, Mehlig B, Westram AM. Interpreting the genomic landscape of speciation: a road map for finding barriers to gene flow. J Evol Biol 2017; 30:1450-1477. [DOI: 10.1111/jeb.13047] [Citation(s) in RCA: 306] [Impact Index Per Article: 43.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 01/31/2017] [Accepted: 02/01/2017] [Indexed: 12/14/2022]
Affiliation(s)
- M. Ravinet
- Centre for Ecological and Evolutionary Synthesis; University of Oslo; Oslo Norway
- National Institute of Genetics; Mishima Shizuoka Japan
| | - R. Faria
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos; InBIO, Laboratório Associado; Universidade do Porto; Vairão Portugal
- Department of Experimental and Health Sciences; IBE, Institute of Evolutionary Biology (CSIC-UPF); Pompeu Fabra University; Barcelona Spain
- Department of Animal and Plant Sciences; University of Sheffield; Sheffield UK
| | - R. K. Butlin
- Department of Animal and Plant Sciences; University of Sheffield; Sheffield UK
- Department of Marine Sciences; Centre for Marine Evolutionary Biology; University of Gothenburg; Gothenburg Sweden
| | - J. Galindo
- Department of Biochemistry, Genetics and Immunology; University of Vigo; Vigo Spain
| | - N. Bierne
- CNRS; Université Montpellier; ISEM; Station Marine Sète France
| | - M. Rafajlović
- Department of Physics; University of Gothenburg; Gothenburg Sweden
| | | | - B. Mehlig
- Department of Physics; University of Gothenburg; Gothenburg Sweden
| | - A. M. Westram
- Department of Animal and Plant Sciences; University of Sheffield; Sheffield UK
| |
Collapse
|
33
|
Giolai M, Paajanen P, Verweij W, Witek K, Jones JDG, Clark MD. Comparative analysis of targeted long read sequencing approaches for characterization of a plant's immune receptor repertoire. BMC Genomics 2017; 18:564. [PMID: 28747151 PMCID: PMC5530509 DOI: 10.1186/s12864-017-3936-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Accepted: 07/10/2017] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND The Oxford Nanopore Technologies MinION™ sequencer is a small, portable, low cost device that is accessible to labs of all sizes and attractive for in-the-field sequencing experiments. Selective breeding of crops has led to a reduction in genetic diversity, and wild relatives are a key source of new genetic resistance to pathogens, usually via NLR immune receptor-encoding genes. Recent studies have demonstrated how crop NLR repertoires can be targeted for sequencing on Illumina or PacBio (RenSeq) and the specific gene conveying pathogen resistance identified. RESULTS Sequence yields per MinION run are lower than Illumina, making targeted resequencing an efficient approach. While MinION generates long reads similar to PacBio it doesn't generate the highly accurate multipass consensus reads, which presents downstream bioinformatics challenges. Here we demonstrate how MinION data can be used for RenSeq achieving similar results to the PacBio and how novel NLR gene fusions can be identified via a Nanopore RenSeq pipeline. CONCLUSION The described library preparation and bioinformatics methods should be applicable to other gene families or any targeted long DNA fragment nanopore sequencing project.
Collapse
Affiliation(s)
- Michael Giolai
- Earlham Institute (EI), Norwich Research Park, Norwich, NR4 7UZ, UK
| | - Pirita Paajanen
- Earlham Institute (EI), Norwich Research Park, Norwich, NR4 7UZ, UK
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK
| | - Walter Verweij
- Earlham Institute (EI), Norwich Research Park, Norwich, NR4 7UZ, UK
| | - Kamil Witek
- The Sainsbury Laboratory, Norwich Research Park, Norwich, NR4 7UH, UK
| | | | - Matthew D Clark
- Earlham Institute (EI), Norwich Research Park, Norwich, NR4 7UZ, UK.
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK.
| |
Collapse
|
34
|
Zhang L, Wang J, Zhang C, Li D, Carvalho CM, Ji H, Xiao J, Wu Y, Zhou W, Wang H, Jin L, Luo Y, Wu X, Lupski JR, Zhang F, Jiang Y. Efficient CNV breakpoint analysis reveals unexpected structural complexity and correlation of dosage-sensitive genes with clinical severity in genomic disorders. Hum Mol Genet 2017; 26:1927-1941. [PMID: 28334874 PMCID: PMC6075079 DOI: 10.1093/hmg/ddx102] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Revised: 03/08/2017] [Accepted: 03/10/2017] [Indexed: 01/13/2023] Open
Abstract
Genomic disorders are the clinical conditions manifested by submicroscopic genomic rearrangements including copy number variants (CNVs). The CNVs can be identified by array-based comparative genomic hybridization (aCGH), the most commonly used technology for molecular diagnostics of genomic disorders. However, clinical aCGH only informs CNVs in the probe-interrogated regions. Neither orientational information nor the resulting genomic rearrangement structure is provided, which is a key to uncovering mutational and pathogenic mechanisms underlying genomic disorders. Long-range polymerase chain reaction (PCR) is a traditional approach to obtain CNV breakpoint junction, but this method is inefficient when challenged by structural complexity such as often found at the PLP1 locus in association with Pelizaeus-Merzbacher disease (PMD). Here we introduced 'capture and single-molecule real-time sequencing' (cap-SMRT-seq) and newly developed 'asymmetry linker-mediated nested PCR walking' (ALN-walking) for CNV breakpoint sequencing in 49 subjects with PMD-associated CNVs. Remarkably, 29 (94%) of the 31 CNV breakpoint junctions unobtainable by conventional long-range PCR were resolved by cap-SMRT-seq and ALN-walking. Notably, unexpected CNV complexities, including inter-chromosomal rearrangements that cannot be resolved by aCGH, were revealed by efficient breakpoint sequencing. These sequence-based structures of PMD-associated CNVs further support the role of DNA replicative mechanisms in CNV mutagenesis, and facilitate genotype-phenotype correlation studies. Intriguingly, the lengths of gained segments by CNVs are strongly correlated with clinical severity in PMD, potentially reflecting the functional contribution of other dosage-sensitive genes besides PLP1. Our study provides new efficient experimental approaches (especially ALN-walking) for CNV breakpoint sequencing and highlights their importance in uncovering CNV mutagenesis and pathogenesis in genomic disorders.
Collapse
Affiliation(s)
- Ling Zhang
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
- Key Laboratory of Reproduction Regulation of NPFPC, Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Female Reproductive Endocrine Related Diseases, Shanghai 200011, China
| | - Jingmin Wang
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China
| | - Cheng Zhang
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
| | - Dongxiao Li
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China
| | - Claudia M.B. Carvalho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Haoran Ji
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China
| | - Jianqiu Xiao
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
| | - Ye Wu
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China
| | - Weichen Zhou
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
| | - Hongyan Wang
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
- Key Laboratory of Reproduction Regulation of NPFPC, Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Female Reproductive Endocrine Related Diseases, Shanghai 200011, China
| | - Li Jin
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
- Key Laboratory of Reproduction Regulation of NPFPC, Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200032, China
| | - Yang Luo
- MOE Key Laboratory of Medical Cell Biology, The Research Center for Medical Genomics, College of Basic Medical Science, China Medical University, Shenyang 110001, China
| | - Xiru Wu
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China
| | - James R. Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
- Texas Children’s Hospital, Houston, TX 77030, USA
| | - Feng Zhang
- Obstetrics and Gynecology Hospital, State Key Laboratory of Genetic Engineering at School of Life Sciences, Institute of Reproduction and Development, Fudan University, Shanghai 200011, China
- Key Laboratory of Reproduction Regulation of NPFPC, Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Female Reproductive Endocrine Related Diseases, Shanghai 200011, China
| | - Yuwu Jiang
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China
| |
Collapse
|
35
|
Gasc C, Peyret P. Revealing large metagenomic regions through long DNA fragment hybridization capture. MICROBIOME 2017; 5:33. [PMID: 28292322 PMCID: PMC5351058 DOI: 10.1186/s40168-017-0251-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Accepted: 03/05/2017] [Indexed: 05/07/2023]
Abstract
BACKGROUND High-throughput DNA sequencing technologies have revolutionized genomic analysis, including the de novo assembly of whole genomes from single organisms or metagenomic samples. However, due to the limited capacity of short-read sequence data to assemble complex or low coverage regions, genomes are typically fragmented, leading to draft genomes with numerous underexplored large genomic regions. Revealing these missing sequences is a major goal to resolve concerns in numerous biological studies. METHODS To overcome these limitations, we developed an innovative target enrichment method for the reconstruction of large unknown genomic regions. Based on a hybridization capture strategy, this approach enables the enrichment of large genomic regions allowing the reconstruction of tens of kilobase pairs flanking a short, targeted DNA sequence. RESULTS Applied to a metagenomic soil sample targeting the linA gene, the biomarker of hexachlorocyclohexane (HCH) degradation, our method permitted the enrichment of the gene and its flanking regions leading to the reconstruction of several contigs and complete plasmids exceeding tens of kilobase pairs surrounding linA. Thus, through gene association and genome reconstruction, we identified microbial species involved in HCH degradation which constitute targets to improve biostimulation treatments. CONCLUSIONS This new hybridization capture strategy makes surveying and deconvoluting complex genomic regions possible through large genomic regions enrichment and allows the efficient exploration of metagenomic diversity. Indeed, this approach enables to assign identity and function to microorganisms in natural environments, one of the ultimate goals of microbial ecology.
Collapse
Affiliation(s)
- Cyrielle Gasc
- Université Clermont Auvergne, INRA, MEDIS, 63000 Clermont-Ferrand, France
| | - Pierre Peyret
- Université Clermont Auvergne, INRA, MEDIS, 63000 Clermont-Ferrand, France
| |
Collapse
|
36
|
Schwessinger B, Rathjen JP. Extraction of High Molecular Weight DNA from Fungal Rust Spores for Long Read Sequencing. Methods Mol Biol 2017; 1659:49-57. [PMID: 28856640 DOI: 10.1007/978-1-4939-7249-4_5] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Wheat rust fungi are complex organisms with a complete life cycle that involves two different host plants and five different spore types. During the asexual infection cycle on wheat, rusts produce massive amounts of dikaryotic urediniospores. These spores are dikaryotic (two nuclei) with each nucleus containing one haploid genome. This dikaryotic state is likely to contribute to their evolutionary success, making them some of the major wheat pathogens globally. Despite this, most published wheat rust genomes are highly fragmented and contain very little haplotype-specific sequence information. Current long-read sequencing technologies hold great promise to provide more contiguous and haplotype-phased genome assemblies. Long reads are able to span repetitive regions and phase structural differences between the haplomes. This increased genome resolution enables the identification of complex loci and the study of genome evolution beyond simple nucleotide polymorphisms. Long-read technologies require pure high molecular weight DNA as an input for sequencing. Here, we describe a DNA extraction protocol for rust spores that yields pure double-stranded DNA molecules with molecular weight of >50 kilo-base pairs (kbp). The isolated DNA is of sufficient purity for PacBio long-read sequencing, but may require additional purification for other sequencing technologies such as Nanopore and 10× Genomics.
Collapse
Affiliation(s)
- Benjamin Schwessinger
- Research School of Biology, Australian National University, Linnaeus Way, Canberra, CT, Australia.
| | - John P Rathjen
- Research School of Biology, Australian National University, Linnaeus Way, Canberra, CT, Australia
| |
Collapse
|
37
|
da Fonseca RR, Albrechtsen A, Themudo GE, Ramos-Madrigal J, Sibbesen JA, Maretty L, Zepeda-Mendoza ML, Campos PF, Heller R, Pereira RJ. Next-generation biology: Sequencing and data analysis approaches for non-model organisms. Mar Genomics 2016; 30:3-13. [DOI: 10.1016/j.margen.2016.04.012] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Revised: 03/23/2016] [Accepted: 04/26/2016] [Indexed: 10/21/2022]
|
38
|
Giolai M, Paajanen P, Verweij W, Percival-Alwyn L, Baker D, Witek K, Jupe F, Bryan G, Hein I, Jones JDG, Clark MD. Targeted capture and sequencing of gene-sized DNA molecules. Biotechniques 2016; 61:315-322. [PMID: 27938323 DOI: 10.2144/000114484] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 10/06/2016] [Indexed: 11/23/2022] Open
Abstract
Targeted capture provides an efficient and sensitive means for sequencing specific genomic regions in a high-throughput manner. To date, this method has mostly been used to capture exons from the genome (the exome) using short insert libraries and short-read sequencing technology, enabling the identification of genetic variants or new members of large gene families. Sequencing larger molecules results in the capture of whole genes, including intronic and intergenic sequences that are typically more polymorphic and allow the resolution of the gene structure of homologous genes, which are often clustered together on the chromosome. Here, we describe an improved method for the capture and single-molecule sequencing of DNA molecules as large as 7 kb by means of size selection and optimized PCR conditions. Our approach can be used to capture, sequence, and distinguish between similar members of the NB-LRR gene family-key genes in plant immune systems.
Collapse
Affiliation(s)
- Michael Giolai
- Earlham Institute (EI), Norwich Research Park, Norwich, UK
| | | | - Walter Verweij
- Earlham Institute (EI), Norwich Research Park, Norwich, UK
| | | | - David Baker
- Earlham Institute (EI), Norwich Research Park, Norwich, UK
| | - Kamil Witek
- The Sainsbury Laboratory, Norwich Research Park, Norwich, UK
| | - Florian Jupe
- The Sainsbury Laboratory, Norwich Research Park, Norwich, UK.,Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA
| | | | - Ingo Hein
- The James Hutton Institute, Dundee, UK
| | | | - Matthew D Clark
- Earlham Institute (EI), Norwich Research Park, Norwich, UK.,School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, UK
| |
Collapse
|
39
|
Dapprich J, Ferriola D, Mackiewicz K, Clark PM, Rappaport E, D’Arcy M, Sasson A, Gai X, Schug J, Kaestner KH, Monos D. The next generation of target capture technologies - large DNA fragment enrichment and sequencing determines regional genomic variation of high complexity. BMC Genomics 2016; 17:486. [PMID: 27393338 PMCID: PMC4938946 DOI: 10.1186/s12864-016-2836-6] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 06/15/2016] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND The ability to capture and sequence large contiguous DNA fragments represents a significant advancement towards the comprehensive characterization of complex genomic regions. While emerging sequencing platforms are capable of producing several kilobases-long reads, the fragment sizes generated by current DNA target enrichment technologies remain a limiting factor, producing DNA fragments generally shorter than 1 kbp. The DNA enrichment methodology described herein, Region-Specific Extraction (RSE), produces DNA segments in excess of 20 kbp in length. Coupling this enrichment method to appropriate sequencing platforms will significantly enhance the ability to generate complete and accurate sequence characterization of any genomic region without the need for reference-based assembly. RESULTS RSE is a long-range DNA target capture methodology that relies on the specific hybridization of short (20-25 base) oligonucleotide primers to selected sequence motifs within the DNA target region. These capture primers are then enzymatically extended on the 3'-end, incorporating biotinylated nucleotides into the DNA. Streptavidin-coated beads are subsequently used to pull-down the original, long DNA template molecules via the newly synthesized, biotinylated DNA that is bound to them. We demonstrate the accuracy, simplicity and utility of the RSE method by capturing and sequencing a 4 Mbp stretch of the major histocompatibility complex (MHC). Our results show an average depth of coverage of 164X for the entire MHC. This depth of coverage contributes significantly to a 99.94 % total coverage of the targeted region and to an accuracy that is over 99.99 %. CONCLUSIONS RSE represents a cost-effective target enrichment method capable of producing sequencing templates in excess of 20 kbp in length. The utility of our method has been proven to generate superior coverage across the MHC as compared to other commercially available methodologies, with the added advantage of producing longer sequencing templates amenable to DNA sequencing on recently developed platforms. Although our demonstration of the method does not utilize these DNA sequencing platforms directly, our results indicate that the capture of long DNA fragments produce superior coverage of the targeted region.
Collapse
Affiliation(s)
| | - Deborah Ferriola
- />Generation Biotech, Lawrenceville, NJ 08648 USA
- />Department of Pathology and Laboratory Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Kate Mackiewicz
- />Department of Pathology and Laboratory Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Peter M. Clark
- />Department of Pathology and Laboratory Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Eric Rappaport
- />Nucleic Acids & Protein Core Facility, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Monica D’Arcy
- />The Center for Biomedical Informatics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Ariella Sasson
- />The Center for Biomedical Informatics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Xiaowu Gai
- />The Center for Biomedical Informatics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Jonathan Schug
- />Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104 USA
| | - Klaus H. Kaestner
- />Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104 USA
| | - Dimitri Monos
- />Department of Pathology and Laboratory Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
- />The Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104 USA
| |
Collapse
|
40
|
Yuan B, Neira J, Gu S, Harel T, Liu P, Briceño I, Elsea SH, Gómez A, Potocki L, Lupski JR. Nonrecurrent PMP22-RAI1 contiguous gene deletions arise from replication-based mechanisms and result in Smith-Magenis syndrome with evident peripheral neuropathy. Hum Genet 2016; 135:1161-74. [PMID: 27386852 DOI: 10.1007/s00439-016-1703-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 06/21/2016] [Indexed: 11/29/2022]
Abstract
Hereditary neuropathy with liability to pressure palsies (HNPP) and Smith-Magenis syndrome (SMS) are genomic disorders associated with deletion copy number variants involving chromosome 17p12 and 17p11.2, respectively. Nonallelic homologous recombination (NAHR)-mediated recurrent deletions are responsible for the majority of HNPP and SMS cases; the rearrangement products encompass the key dosage-sensitive genes PMP22 and RAI1, respectively, and result in haploinsufficiency for these genes. Less frequently, nonrecurrent genomic rearrangements occur at this locus. Contiguous gene duplications encompassing both PMP22 and RAI1, i.e., PMP22-RAI1 duplications, have been investigated, and replication-based mechanisms rather than NAHR have been proposed for these rearrangements. In the current study, we report molecular and clinical characterizations of six subjects with the reciprocal phenomenon of deletions spanning both genes, i.e., PMP22-RAI1 deletions. Molecular studies utilizing high-resolution array comparative genomic hybridization and breakpoint junction sequencing identified mutational signatures that were suggestive of replication-based mechanisms. Systematic clinical studies revealed features consistent with SMS, including features of intellectual disability, speech and gross motor delays, behavioral problems and ocular abnormalities. Five out of six subjects presented clinical signs and/or objective electrophysiologic studies of peripheral neuropathy. Clinical profiling may improve the clinical management of this unique group of subjects, as the peripheral neuropathy can be more severe or of earlier onset as compared to SMS patients having the common recurrent deletion. Moreover, the current study, in combination with the previous report of PMP22-RAI1 duplications, contributes to the understanding of rare complex phenotypes involving multiple dosage-sensitive genes from a genetic mechanistic standpoint.
Collapse
Affiliation(s)
- Bo Yuan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Juanita Neira
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Shen Gu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Tamar Harel
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Ignacio Briceño
- Instituto de Genética Humana, Facultad de Medicina, Pontificia Universidad Javeriana, Bogotá, Colombia
- Instituto de Referencia Andino, Bogotá, Colombia
- Facultad de Medicina, Universidad de La Sabana, Chía, Colombia
| | - Sarah H Elsea
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Alberto Gómez
- Instituto de Genética Humana, Facultad de Medicina, Pontificia Universidad Javeriana, Bogotá, Colombia
- Instituto de Referencia Andino, Bogotá, Colombia
| | - Lorraine Potocki
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
- Texas Children's Hospital, Houston, TX, 77030, USA
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA.
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA.
- Texas Children's Hospital, Houston, TX, 77030, USA.
| |
Collapse
|
41
|
Gasc C, Peyretaillade E, Peyret P. Sequence capture by hybridization to explore modern and ancient genomic diversity in model and nonmodel organisms. Nucleic Acids Res 2016; 44:4504-18. [PMID: 27105841 PMCID: PMC4889952 DOI: 10.1093/nar/gkw309] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Revised: 04/07/2016] [Accepted: 04/12/2016] [Indexed: 12/25/2022] Open
Abstract
The recent expansion of next-generation sequencing has significantly improved biological research. Nevertheless, deep exploration of genomes or metagenomic samples remains difficult because of the sequencing depth and the associated costs required. Therefore, different partitioning strategies have been developed to sequence informative subsets of studied genomes. Among these strategies, hybridization capture has proven to be an innovative and efficient tool for targeting and enriching specific biomarkers in complex DNA mixtures. It has been successfully applied in numerous areas of biology, such as exome resequencing for the identification of mutations underlying Mendelian or complex diseases and cancers, and its usefulness has been demonstrated in the agronomic field through the linking of genetic variants to agricultural phenotypic traits of interest. Moreover, hybridization capture has provided access to underexplored, but relevant fractions of genomes through its ability to enrich defined targets and their flanking regions. Finally, on the basis of restricted genomic information, this method has also allowed the expansion of knowledge of nonreference species and ancient genomes and provided a better understanding of metagenomic samples. In this review, we present the major advances and discoveries permitted by hybridization capture and highlight the potency of this approach in all areas of biology.
Collapse
Affiliation(s)
- Cyrielle Gasc
- EA 4678 CIDAM, Université d'Auvergne, Clermont-Ferrand, 63001, France
| | | | - Pierre Peyret
- EA 4678 CIDAM, Université d'Auvergne, Clermont-Ferrand, 63001, France
| |
Collapse
|
42
|
Athanasio CG, Chipman JK, Viant MR, Mirbahai L. Optimisation of DNA extraction from the crustacean Daphnia. PeerJ 2016; 4:e2004. [PMID: 27190714 PMCID: PMC4867708 DOI: 10.7717/peerj.2004] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Accepted: 04/12/2016] [Indexed: 11/20/2022] Open
Abstract
Daphnia are key model organisms for mechanistic studies of phenotypic plasticity, adaptation and microevolution, which have led to an increasing demand for genomics resources. A key step in any genomics analysis, such as high-throughput sequencing, is the availability of sufficient and high quality DNA. Although commercial kits exist to extract genomic DNA from several species, preparation of high quality DNA from Daphnia spp. and other chitinous species can be challenging. Here, we optimise methods for tissue homogenisation, DNA extraction and quantification customised for different downstream analyses (e.g., LC-MS/MS, Hiseq, mate pair sequencing or Nanopore). We demonstrate that if Daphnia magna are homogenised as whole animals (including the carapace), absorbance-based DNA quantification methods significantly over-estimate the amount of DNA, resulting in using insufficient starting material for experiments, such as preparation of sequencing libraries. This is attributed to the high refractive index of chitin in Daphnia's carapace at 260 nm. Therefore, unless the carapace is removed by overnight proteinase digestion, the extracted DNA should be quantified with fluorescence-based methods. However, overnight proteinase digestion will result in partial fragmentation of DNA therefore the prepared DNA is not suitable for downstream methods that require high molecular weight DNA, such as PacBio, mate pair sequencing and Nanopore. In conclusion, we found that the MasterPure DNA purification kit, coupled with grinding of frozen tissue, is the best method for extraction of high molecular weight DNA as long as the extracted DNA is quantified with fluorescence-based methods. This method generated high yield and high molecular weight DNA (3.10 ± 0.63 ng/µg dry mass, fragments >60 kb), free of organic contaminants (phenol, chloroform) and is suitable for large number of downstream analyses.
Collapse
Affiliation(s)
| | - James K Chipman
- School of Biosciences, University of Birmingham , Birmingham , United Kingdom
| | - Mark R Viant
- School of Biosciences, University of Birmingham , Birmingham , United Kingdom
| | - Leda Mirbahai
- School of Biosciences, University of Birmingham , Birmingham , United Kingdom
| |
Collapse
|
43
|
Transformation-associated recombination (TAR) cloning for genomics studies and synthetic biology. Chromosoma 2016; 125:621-32. [PMID: 27116033 DOI: 10.1007/s00412-016-0588-3] [Citation(s) in RCA: 75] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Revised: 03/22/2016] [Accepted: 03/29/2016] [Indexed: 12/25/2022]
Abstract
Transformation-associated recombination (TAR) cloning represents a unique tool for isolation and manipulation of large DNA molecules. The technique exploits a high level of homologous recombination in the yeast Sacharomyces cerevisiae. So far, TAR cloning is the only method available to selectively recover chromosomal segments up to 300 kb in length from complex and simple genomes. In addition, TAR cloning allows the assembly and cloning of entire microbe genomes up to several Mb as well as engineering of large metabolic pathways. In this review, we summarize applications of TAR cloning for functional/structural genomics and synthetic biology.
Collapse
|
44
|
Accelerated cloning of a potato late blight-resistance gene using RenSeq and SMRT sequencing. Nat Biotechnol 2016; 34:656-60. [PMID: 27111721 DOI: 10.1038/nbt.3540] [Citation(s) in RCA: 149] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 03/15/2016] [Indexed: 01/18/2023]
Abstract
Global yields of potato and tomato crops have fallen owing to potato late blight disease, which is caused by Phytophthora infestans. Although most commercial potato varieties are susceptible to blight, many wild potato relatives show variation for resistance and are therefore a potential source of Resistance to P. infestans (Rpi) genes. Resistance breeding has exploited Rpi genes from closely related tuber-bearing potato relatives, but is laborious and slow. Here we report that the wild, diploid non-tuber-bearing Solanum americanum harbors multiple Rpi genes. We combine resistance (R) gene sequence capture (RenSeq) with single-molecule real-time (SMRT) sequencing (SMRT RenSeq) to clone Rpi-amr3i. This technology should enable de novo assembly of complete nucleotide-binding, leucine-rich repeat receptor (NLR) genes, their regulatory elements and complex multi-NLR loci from uncharacterized germplasm. SMRT RenSeq can be applied to rapidly clone multiple R genes for engineering pathogen-resistant crops.
Collapse
|
45
|
Gagic D, Ciric M, Wen WX, Ng F, Rakonjac J. Exploring the Secretomes of Microbes and Microbial Communities Using Filamentous Phage Display. Front Microbiol 2016; 7:429. [PMID: 27092113 PMCID: PMC4823517 DOI: 10.3389/fmicb.2016.00429] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 03/17/2016] [Indexed: 01/12/2023] Open
Abstract
Microbial surface and secreted proteins (the secretome) contain a large number of proteins that interact with other microbes, host and/or environment. These proteins are exported by the coordinated activities of the protein secretion machinery present in the cell. A group of bacteriophage, called filamentous phage, have the ability to hijack bacterial protein secretion machinery in order to amplify and assemble via a secretion-like process. This ability has been harnessed in the use of filamentous phage of Escherichia coli in biotechnology applications, including screening large libraries of variants for binding to “bait” of interest, from tissues in vivo to pure proteins or even inorganic substrates. In this review we discuss the roles of secretome proteins in pathogenic and non-pathogenic bacteria and corresponding secretion pathways. We describe the basics of phage display technology and its variants applied to discovery of bacterial proteins that are implicated in colonization of host tissues and pathogenesis, as well as vaccine candidates through filamentous phage display library screening. Secretome selection aided by next-generation sequence analysis was successfully applied for selective display of the secretome at a microbial community scale, the latter revealing the richness of secretome functions of interest and surprising versatility in filamentous phage display of secretome proteins from large number of Gram-negative as well as Gram-positive bacteria and archaea.
Collapse
Affiliation(s)
- Dragana Gagic
- Institute of Fundamental Sciences, Massey UniversityPalmerston North, New Zealand; Animal Science, Grasslands Research Centre, AgResearch Ltd, Palmerston NorthNew Zealand
| | - Milica Ciric
- Institute of Fundamental Sciences, Massey UniversityPalmerston North, New Zealand; Animal Science, Grasslands Research Centre, AgResearch Ltd, Palmerston NorthNew Zealand
| | - Wesley X Wen
- Institute of Fundamental Sciences, Massey University Palmerston North, New Zealand
| | - Filomena Ng
- Animal Science, Grasslands Research Centre, AgResearch Ltd, Palmerston North New Zealand
| | - Jasna Rakonjac
- Institute of Fundamental Sciences, Massey University Palmerston North, New Zealand
| |
Collapse
|
46
|
Park SJ, Saito-Adachi M, Komiyama Y, Nakai K. Advances, practice, and clinical perspectives in high-throughput sequencing. Oral Dis 2016; 22:353-64. [DOI: 10.1111/odi.12403] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Revised: 11/16/2015] [Accepted: 11/16/2015] [Indexed: 01/06/2023]
Affiliation(s)
- S-J Park
- Human Genome Center; The Institute of Medical Science; The University of Tokyo; Tokyo Japan
| | - M Saito-Adachi
- Division of Cancer Genomics; National Cancer Center Research Institute; Tokyo Japan
| | - Y Komiyama
- Human Genome Center; The Institute of Medical Science; The University of Tokyo; Tokyo Japan
| | - K Nakai
- Human Genome Center; The Institute of Medical Science; The University of Tokyo; Tokyo Japan
| |
Collapse
|
47
|
Yuan B, Harel T, Gu S, Liu P, Burglen L, Chantot-Bastaraud S, Gelowani V, Beck C, Carvalho C, Cheung S, Coe A, Malan V, Munnich A, Magoulas P, Potocki L, Lupski J. Nonrecurrent 17p11.2p12 Rearrangement Events that Result in Two Concomitant Genomic Disorders: The PMP22-RAI1 Contiguous Gene Duplication Syndrome. Am J Hum Genet 2015; 97:691-707. [PMID: 26544804 DOI: 10.1016/j.ajhg.2015.10.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 10/05/2015] [Indexed: 12/31/2022] Open
Abstract
The genomic duplication associated with Potocki-Lupski syndrome (PTLS) maps in close proximity to the duplication associated with Charcot-Marie-Tooth disease type 1A (CMT1A). PTLS is characterized by hypotonia, failure to thrive, reduced body weight, intellectual disability, and autistic features. CMT1A is a common autosomal dominant distal symmetric peripheral polyneuropathy. The key dosage-sensitive genes RAI1 and PMP22 are respectively associated with PTLS and CMT1A. Recurrent duplications accounting for the majority of subjects with these conditions are mediated by nonallelic homologous recombination between distinct low-copy repeat (LCR) substrates. The LCRs flanking a contiguous genomic interval encompassing both RAI1 and PMP22 do not share extensive homology; thus, duplications encompassing both loci are rare and potentially generated by a different mutational mechanism. We characterized genomic rearrangements that simultaneously duplicate PMP22 and RAI1, including nine potential complex genomic rearrangements, in 23 subjects by high-resolution array comparative genomic hybridization and breakpoint junction sequencing. Insertions and microhomologies were found at the breakpoint junctions, suggesting potential replicative mechanisms for rearrangement formation. At the breakpoint junctions of these nonrecurrent rearrangements, enrichment of repetitive DNA sequences was observed, indicating that they might predispose to genomic instability and rearrangement. Clinical evaluation revealed blended PTLS and CMT1A phenotypes with a potential earlier onset of neuropathy. Moreover, additional clinical findings might be observed due to the extra duplicated material included in the rearrangements. Our genomic analysis suggests replicative mechanisms as a predominant mechanism underlying PMP22-RAI1 contiguous gene duplications and provides further evidence supporting the role of complex genomic architecture in genomic instability.
Collapse
|
48
|
Hargreaves CE, Rose-Zerilli MJJ, Machado LR, Iriyama C, Hollox EJ, Cragg MS, Strefford JC. Fcγ receptors: genetic variation, function, and disease. Immunol Rev 2015; 268:6-24. [DOI: 10.1111/imr.12341] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Chantal E. Hargreaves
- Cancer Genomics Group; Cancer Sciences; Faculty of Medicine; University of Southampton; Southampton UK
- Antibody and Vaccine Group; Cancer Sciences; Faculty of Medicine; University of Southampton; Southampton UK
| | | | - Lee R. Machado
- Department of Genetics; University of Leicester; Leicester UK
- School of Health; University of Northampton; Northampton UK
| | - Chisako Iriyama
- Department of Hematology and Oncology; Nagoya University Graduate School of Medicine; Nagoya Japan
| | | | - Mark S. Cragg
- Antibody and Vaccine Group; Cancer Sciences; Faculty of Medicine; University of Southampton; Southampton UK
| | - Jonathan C. Strefford
- Cancer Genomics Group; Cancer Sciences; Faculty of Medicine; University of Southampton; Southampton UK
| |
Collapse
|
49
|
Analysis of Pre-Analytic Factors Affecting the Success of Clinical Next-Generation Sequencing of Solid Organ Malignancies. Cancers (Basel) 2015; 7:1699-715. [PMID: 26343728 PMCID: PMC4586792 DOI: 10.3390/cancers7030859] [Citation(s) in RCA: 92] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2015] [Revised: 08/20/2015] [Accepted: 08/21/2015] [Indexed: 01/28/2023] Open
Abstract
Application of next-generation sequencing (NGS) technology to routine clinical practice has enabled characterization of personalized cancer genomes to identify patients likely to have a response to targeted therapy. The proper selection of tumor sample for downstream NGS based mutational analysis is critical to generate accurate results and to guide therapeutic intervention. However, multiple pre-analytic factors come into play in determining the success of NGS testing. In this review, we discuss pre-analytic requirements for AmpliSeq PCR-based sequencing using Ion Torrent Personal Genome Machine (PGM) (Life Technologies), a NGS sequencing platform that is often used by clinical laboratories for sequencing solid tumors because of its low input DNA requirement from formalin fixed and paraffin embedded tissue. The success of NGS mutational analysis is affected not only by the input DNA quantity but also by several other factors, including the specimen type, the DNA quality, and the tumor cellularity. Here, we review tissue requirements for solid tumor NGS based mutational analysis, including procedure types, tissue types, tumor volume and fraction, decalcification, and treatment effects.
Collapse
|
50
|
Hehir-Kwa JY, Pfundt R, Veltman JA. Exome sequencing and whole genome sequencing for the detection of copy number variation. Expert Rev Mol Diagn 2015; 15:1023-32. [PMID: 26088785 DOI: 10.1586/14737159.2015.1053467] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Many laboratories now use genomic microarrays as their first-tier diagnostic test for copy number variation (CNV) detection. In addition, whole exome sequencing is increasingly being offered as a diagnostic test for heterogeneous disorders. Although mostly used for the detection of point mutations and small insertion-deletions, exome sequencing can also be used to call CNVs, allowing combined small and large variant analysis. Whole genome sequencing in addition to these advantages also offers the potential to characterize CNVs to unprecedented levels of accuracy, providing position and orientation information. In this review, we discuss the clinical potential of CNV identification in whole exome sequencing and whole genome sequencing data and the implications this has on diagnostic laboratories.
Collapse
Affiliation(s)
- Jayne Y Hehir-Kwa
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | | | | |
Collapse
|