1
|
Shim Y, Koo YK, Shin S, Lee ST, Lee KA, Choi JR. Comparison of Optical Genome Mapping With Conventional Diagnostic Methods for Structural Variant Detection in Hematologic Malignancies. Ann Lab Med 2024; 44:324-334. [PMID: 38433573 PMCID: PMC10961627 DOI: 10.3343/alm.2023.0339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/21/2023] [Accepted: 02/13/2024] [Indexed: 03/05/2024] Open
Abstract
Background Structural variants (SVs) are currently analyzed using a combination of conventional methods; however, this approach has limitations. Optical genome mapping (OGM), an emerging technology for detecting SVs using a single-molecule strategy, has the potential to replace conventional methods. We compared OGM with conventional diagnostic methods for detecting SVs in various hematologic malignancies. Methods Residual bone marrow aspirates from 27 patients with hematologic malignancies in whom SVs were observed using conventional methods (chromosomal banding analysis, FISH, an RNA fusion panel, and reverse transcription PCR) were analyzed using OGM. The concordance between the OGM and conventional method results was evaluated. Results OGM showed concordance in 63% (17/27) and partial concordance in 37% (10/27) of samples. OGM detected 76% (52/68) of the total SVs correctly (concordance rate for each type of SVs: aneuploidies, 83% [15/18]; balanced translocation, 80% [12/15] unbalanced translocation, 54% [7/13] deletions, 81% [13/16]; duplications, 100% [2/2] inversion 100% [1/1]; insertion, 100% [1/1]; marker chromosome, 0% [0/1]; isochromosome, 100% [1/1]). Sixteen discordant results were attributed to the involvement of centromeric/telomeric regions, detection sensitivity, and a low mapping rate and coverage. OGM identified additional SVs, including submicroscopic SVs and novel fusions, in five cases. Conclusions OGM shows a high level of concordance with conventional diagnostic methods for the detection of SVs and can identify novel variants, suggesting its potential utility in enabling more comprehensive SV analysis in routine diagnostics of hematologic malignancies, although further studies and improvements are required.
Collapse
Affiliation(s)
- Yeeun Shim
- Brain Korea 21 PLUS Project for Medical Science, Yonsei University, Seoul, Korea
- MDxK (Molecular Diagnostics Korea), Inc., Gwacheon, Korea
| | - Yu-Kyung Koo
- Department of Laboratory Medicine, Yonsei University College of Medicine, Seoul, Korea
| | - Saeam Shin
- Department of Laboratory Medicine, Yonsei University College of Medicine, Seoul, Korea
| | - Seung-Tae Lee
- Department of Laboratory Medicine, Yonsei University College of Medicine, Seoul, Korea
- Dxome Co., Ltd., Seongnam, Korea
| | - Kyung-A Lee
- Department of Laboratory Medicine, Yonsei University College of Medicine, Seoul, Korea
| | - Jong Rak Choi
- Department of Laboratory Medicine, Yonsei University College of Medicine, Seoul, Korea
- Dxome Co., Ltd., Seongnam, Korea
| |
Collapse
|
2
|
Lestringant V, Guermouche-Flament H, Jimenez-Pocquet M, Gaillard JB, Penther D. Cytogenetics in the management of hematological malignancies: An overview of alternative technologies for cytogenetic characterization. Curr Res Transl Med 2024; 72:103440. [PMID: 38447270 DOI: 10.1016/j.retram.2024.103440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 12/22/2023] [Accepted: 01/11/2024] [Indexed: 03/08/2024]
Abstract
Genomic characterization is an essential part of the clinical management of hematological malignancies for diagnostic, prognostic and therapeutic purposes. Although CBA and FISH are still the gold standard in hematology for the detection of CNA and SV, some alternative technologies are intended to complement their deficiencies or even replace them in the more or less near future. In this article, we provide a technological overview of these alternatives. CMA is the historical and well established technique for the high-resolution detection of CNA. For SV detection, there are emerging techniques based on the study of chromatin conformation and more established ones such as RTMLPA for the detection of fusion transcripts and RNA-seq to reveal the molecular consequences of SV. Comprehensive techniques that detect both CNA and SV are the most interesting because they provide all the information in a single examination. Among these, OGM is a promising emerging higher-solution technique that offers a complete solution at a contained cost, at the expense of a relatively low throughput per machine. WGS remains the most adaptable solution, with long-read approaches enabling very high-resolution detection of CAs, but requiring a heavy bioinformatics installation and at a still high cost. However, the development of high-resolution genome-wide detection techniques for CAs allows for a much better description of chromoanagenesis. Therefore, we have included in this review an update on the various existing mechanisms and their consequences and implications, especially prognostic, in hematological malignancies.
Collapse
Affiliation(s)
| | | | | | - Jean-Baptiste Gaillard
- Unité de Génétique Chromosomique, Service de Génétique moléculaire et cytogénomique, CHU Montpellier, Montpellier, France
| | | |
Collapse
|
3
|
Yu S, Liu Z, Li M, Zhou D, Hua P, Cheng H, Fan W, Xu Y, Liu D, Liang S, Zhang Y, Xie M, Tang J, Jiang Y, Hou S, Zhou Z. Resequencing of a Pekin duck breeding population provides insights into the genomic response to short-term artificial selection. Gigascience 2023; 12:giad016. [PMID: 36971291 PMCID: PMC10041536 DOI: 10.1093/gigascience/giad016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 02/04/2023] [Accepted: 02/27/2023] [Indexed: 03/29/2023] Open
Abstract
BACKGROUND Short-term, intense artificial selection drives fast phenotypic changes in domestic animals and leaves imprints on their genomes. However, the genetic basis of this selection response is poorly understood. To better address this, we employed the Pekin duck Z2 pure line, in which the breast muscle weight was increased nearly 3-fold after 10 generations of breeding. We denovo assembled a high-quality reference genome of a female Pekin duck of this line (GCA_003850225.1) and identified 8.60 million genetic variants in 119 individuals among 10 generations of the breeding population. RESULTS We identified 53 selected regions between the first and tenth generations, and 93.8% of the identified variations were enriched in regulatory and noncoding regions. Integrating the selection signatures and genome-wide association approach, we found that 2 regions covering 0.36 Mb containing UTP25 and FBRSL1 were most likely to contribute to breast muscle weight improvement. The major allele frequencies of these 2 loci increased gradually with each generation following the same trend. Additionally, we found that a copy number variation region containing the entire EXOC4 gene could explain 1.9% of the variance in breast muscle weight, indicating that the nervous system may play a role in economic trait improvement. CONCLUSIONS Our study not only provides insights into genomic dynamics under intense artificial selection but also provides resources for genomics-enabled improvements in duck breeding.
Collapse
Affiliation(s)
- Simeng Yu
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Zihua Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | - Ming Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | - Dongke Zhou
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | - Ping Hua
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | - Hong Cheng
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | - Wenlei Fan
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Yaxi Xu
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Dapeng Liu
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Suyun Liang
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Yunsheng Zhang
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Ming Xie
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Jing Tang
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | - Shuisheng Hou
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Zhengkui Zhou
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs; Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| |
Collapse
|
4
|
Rehel DA, Polson JM. Equilibrium behaviour of two cavity-confined polymers: effects of polymer width and system asymmetries. SOFT MATTER 2023; 19:1092-1108. [PMID: 36625101 DOI: 10.1039/d2sm01413k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Experiments using nanofluidic devices have proven effective in characterizing the physical properties of polymers confined to small cavities. Two recent studies using such methods examined the organization and dynamics of two DNA molecules in box-like cavities with strong confinement in one direction and with square and elliptical cross sections in the lateral plane. Motivated by these experiments, we employ Monte Carlo and Brownian dynamics simulations to study the physical behaviour of two polymers confined to small cavities with shapes comparable to those used in the experiments. We quantify the effects of varying the following polymer properties and confinement dimensions on the organization and dynamics of the polymers: the polymer width, the polymer contour length ratio, the cavity cross-sectional area, and the degree of cavity elongation for cavities with rectangular and elliptical cross sections. We find that the tendency for polymers to segregate is enhanced by increasing polymer width. For sufficiently small cavities, increasing cavity elongation promotes segregation and localization of identical polymers to opposite sides of the cavity along its long axis. A free-energy barrier controls the rate of polymers swapping positions, and the observed dynamics are roughly in accord with predictions of a simple theoretical model. Increasing the contour length difference between polymers significantly affects their organization in the cavity. In the case of a large linear polymer co-trapped with a small ring polymer in an elliptical cavity, the small polymer tends to lie near the lateral confining walls, and especially at the cavity poles for highly elongated ellipses.
Collapse
Affiliation(s)
- Desiree A Rehel
- Department of Physics, University of Prince Edward Island, 550 University Ave., Charlottetown, Prince Edward Island, C1A 4P3, Canada.
| | - James M Polson
- Department of Physics, University of Prince Edward Island, 550 University Ave., Charlottetown, Prince Edward Island, C1A 4P3, Canada.
| |
Collapse
|
5
|
Maestri S, Gambino G, Lopatriello G, Minio A, Perrone I, Cosentino E, Giovannone B, Marcolungo L, Alfano M, Rombauts S, Cantu D, Rossato M, Delledonne M, Calderón L. 'Nebbiolo' genome assembly allows surveying the occurrence and functional implications of genomic structural variations in grapevines (Vitis vinifera L.). BMC Genomics 2022; 23:159. [PMID: 35209840 PMCID: PMC8867635 DOI: 10.1186/s12864-022-08389-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 02/15/2022] [Indexed: 12/18/2022] Open
Abstract
Background ‘Nebbiolo’ is a grapevine cultivar typical of north-western Italy, appreciated for producing high-quality red wines. Grapevine cultivars are characterized by possessing highly heterozygous genomes, including a great incidence of genomic rearrangements larger than 50 bp, so called structural variations (SVs). Even though abundant, SVs are an under-explored source of genetic variation mainly due to methodological limitations at their detection. Results We employed a multiple platform approach to produce long-range genomic data for two different ‘Nebbiolo’ clones, namely: optical mapping, long-reads and linked-reads. We performed a haplotype-resolved de novo assembly for cultivar ‘Nebbiolo’ (clone CVT 71) and used an ab-initio strategy to annotate it. The annotated assembly enhanced our ability to detect SVs, enabling the study of genomic regions not present in the grapevines’ reference genome and accounting for their functional implications. We performed variant calling analyses at three different organizational levels: i) between haplotypes of clone CVT 71 (primary assembly vs haplotigs), ii) between ‘Nebbiolo’ and ‘Cabernet Sauvignon’ assemblies and iii) between clones CVT 71 and CVT 185, representing different ‘Nebbiolo’ biotypes. The cumulative size of non-redundant merged SVs indicated a total of 79.6 Mbp for the first comparison and 136.1 Mbp for the second one, while no SVs were detected for the third comparison. Interestingly, SVs differentiating cultivars and haplotypes affected similar numbers of coding genes. Conclusions Our results suggest that SVs accumulation rate and their functional implications in ‘Nebbiolo’ genome are highly-dependent on the organizational level under study. SVs are abundant when comparing ‘Nebbiolo’ to a different cultivar or the two haplotypes of the same individual, while they turned absent between the two analysed clones. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08389-9.
Collapse
Affiliation(s)
- Simone Maestri
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Giorgio Gambino
- Institute for Sustainable Plant Protection, National Research Council (IPSP-CNR), Strada delle Cacce 73, 10135, Torino, Italy
| | - Giulia Lopatriello
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Andrea Minio
- Department of Viticulture & Enology, University of California Davis, 595 Hilgard Lane, Davis, CA, 95616, USA
| | - Irene Perrone
- Institute for Sustainable Plant Protection, National Research Council (IPSP-CNR), Strada delle Cacce 73, 10135, Torino, Italy
| | - Emanuela Cosentino
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Barbara Giovannone
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Luca Marcolungo
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Massimiliano Alfano
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Stephane Rombauts
- Department of Bioinformatics and Systems Biology, Ghent University, Technologiepark 927, B-9052, Gent, Belgium.,VIB Center for Plant Systems Biology, 9052, Gent, Belgium
| | - Dario Cantu
- Department of Viticulture & Enology, University of California Davis, 595 Hilgard Lane, Davis, CA, 95616, USA
| | - Marzia Rossato
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy
| | - Massimo Delledonne
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134, Verona, Italy.
| | - Luciano Calderón
- Instituto de Biología Agrícola de Mendoza (IBAM, CONICET-UNCuyo), Almirante Brown 500, M5528AHB. Chacras de Coria, Mendoza, Argentina.
| |
Collapse
|
6
|
Wang L, Zhu T, Rodriguez JC, Deal KR, Dubcovsky J, McGuire PE, Lux T, Spannagl M, Mayer KFX, Baldrich P, Meyers BC, Huo N, Gu YQ, Zhou H, Devos KM, Bennetzen JL, Unver T, Budak H, Gulick PJ, Galiba G, Kalapos B, Nelson DR, Li P, You FM, Luo MC, Dvorak J. Aegilops tauschii genome assembly Aet v5.0 features greater sequence contiguity and improved annotation. G3-GENES GENOMES GENETICS 2021; 11:6369516. [PMID: 34515796 PMCID: PMC8664484 DOI: 10.1093/g3journal/jkab325] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Accepted: 08/31/2021] [Indexed: 01/01/2023]
Abstract
Aegilops tauschii is the donor of the D subgenome of hexaploid wheat and an important genetic resource. The reference-quality genome sequence Aet v4.0 for Ae. tauschii acc. AL8/78 was therefore an important milestone for wheat biology and breeding. Further advances in sequencing acc. AL8/78 and release of the Aet v5.0 sequence assembly are reported here. Two new optical maps were constructed and used in the revision of pseudomolecules. Gaps were closed with Pacific Biosciences long-read contigs, decreasing the gap number by 38,899. Transposable elements and protein-coding genes were reannotated. The number of annotated high-confidence genes was reduced from 39,635 in Aet v4.0 to 32,885 in Aet v5.0. A total of 2245 biologically important genes, including those affecting plant phenology, grain quality, and tolerance of abiotic stresses in wheat, was manually annotated and disease-resistance genes were annotated by a dedicated pipeline. Disease-resistance genes encoding nucleotide-binding site domains, receptor-like protein kinases, and receptor-like proteins were preferentially located in distal chromosome regions, whereas those encoding transmembrane coiled-coil proteins were dispersed more evenly along the chromosomes. Discovery, annotation, and expression analyses of microRNA (miRNA) precursors, mature miRNAs, and phasiRNAs are reported, including miRNA target genes. Other small RNAs, such as hc-siRNAs and tRFs, were characterized. These advances enhance the utility of the Ae. tauschii genome sequence for wheat genetics, biotechnology, and breeding.
Collapse
Affiliation(s)
- Le Wang
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Juan C Rodriguez
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Karin R Deal
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Jorge Dubcovsky
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Patrick E McGuire
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Thomas Lux
- Plant Genome and Systems Biology, Helmholtz Zentrum München, Munich 85764, Germany
| | - Manuel Spannagl
- Plant Genome and Systems Biology, Helmholtz Zentrum München, Munich 85764, Germany
| | - Klaus F X Mayer
- Plant Genome and Systems Biology, Helmholtz Zentrum München, Munich 85764, Germany
| | - Patricia Baldrich
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA
| | - Blake C Meyers
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA.,University of Missouri, Columbia, Division of Plant Sciences, Columbia, Missouri 65211, USA
| | - Naxin Huo
- Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, California 94710, USA
| | - Yong Q Gu
- Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, California 94710, USA
| | - Hongye Zhou
- Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, USA
| | - Katrien M Devos
- Institute of Plant Breeding, Genetics and Genomics (Dept. of Crop & Soil Sciences) and Dept. of Plant Biology, University of Georgia, Athens, Georgia 30602, USA
| | | | - Turgay Unver
- Ficus Biotechnology, Ostim Teknopark, Ankara 06374, Turkey
| | - Hikmet Budak
- Montana BioAg Inc., Missoula, Montana 59801, USA
| | - Patrick J Gulick
- Department of Biology, Concordia University, Montreal, Quebec H3G 1M8, Canada
| | - Gabor Galiba
- Department of Biological Resources, Centre for Agricultural Research, Eötvös Loránd Research Network, H-2462 Martonvásár, Hungary.,Department of Environmental Sustainability, IES, Hungarian University of Agriculture and Life Sciences, H-8360 Keszthely, Hungary
| | - Balázs Kalapos
- Department of Biological Resources, Centre for Agricultural Research, Eötvös Loránd Research Network, H-2462 Martonvásár, Hungary
| | - David R Nelson
- University of Tennessee Health Science Center, Memphis, Tennessee 38163, USA
| | - Pingchuan Li
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario K1A 0C5, Canada
| | - Frank M You
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario K1A 0C5, Canada
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| | - Jan Dvorak
- Department of Plant Sciences, University of California, Davis, Davis, California 95616, USA
| |
Collapse
|
7
|
Zhu T, Wang L, Rimbert H, Rodriguez JC, Deal KR, De Oliveira R, Choulet F, Keeble‐Gagnère G, Tibbits J, Rogers J, Eversole K, Appels R, Gu YQ, Mascher M, Dvorak J, Luo M. Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:303-314. [PMID: 33893684 PMCID: PMC8360199 DOI: 10.1111/tpj.15289] [Citation(s) in RCA: 196] [Impact Index Per Article: 65.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 04/12/2021] [Accepted: 04/19/2021] [Indexed: 05/09/2023]
Abstract
Until recently, achieving a reference-quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5-Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high-confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.
Collapse
Affiliation(s)
- Tingting Zhu
- Department of Plant SciencesUniversity of CaliforniaDavisCA95616USA
| | - Le Wang
- Department of Plant SciencesUniversity of CaliforniaDavisCA95616USA
| | - Hélène Rimbert
- GDECUniversité Clermont AuvergneINRAEClermont‐Ferrand63000France
| | | | - Karin R. Deal
- Department of Plant SciencesUniversity of CaliforniaDavisCA95616USA
| | | | - Frédéric Choulet
- GDECUniversité Clermont AuvergneINRAEClermont‐Ferrand63000France
| | | | - Josquin Tibbits
- Centre for AgriBioscienceAgriculture VictoriaAgriBioBundooraVIC3083Australia
| | - Jane Rogers
- International Wheat Genome Sequencing ConsortiumEau ClaireWI54701USA
| | - Kellye Eversole
- International Wheat Genome Sequencing ConsortiumEau ClaireWI54701USA
| | - Rudi Appels
- Centre for AgriBioscienceAgriculture VictoriaAgriBioBundooraVIC3083Australia
- International Wheat Genome Sequencing ConsortiumEau ClaireWI54701USA
| | - Yong Q. Gu
- Crop Improvement and Genetics Research UnitUSDA‐ARSAlbanyCA94710USA
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)SeelandGermany
| | - Jan Dvorak
- Department of Plant SciencesUniversity of CaliforniaDavisCA95616USA
| | - Ming‐Cheng Luo
- Department of Plant SciencesUniversity of CaliforniaDavisCA95616USA
| |
Collapse
|
8
|
Polson JM, Rehel DA. Equilibrium organization, conformation, and dynamics of two polymers under box-like confinement. SOFT MATTER 2021; 17:5792-5805. [PMID: 34028486 DOI: 10.1039/d1sm00308a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Motivated by recent nanofluidics experiments, we use Brownian dynamics and Monte Carlo simulations to study the conformation, organization and dynamics of two polymer chains confined to a single box-like cavity. The polymers are modeled as flexible bead-spring chains, and the box has a square cross-section of side length L and a height that is small enough to compress the polymers in that dimension. For sufficiently large L, the system behaviour approaches that of an isolated polymer in a slit. However, the combined effects of crowding and confinement on the polymer organization, conformation and equilibrium dynamics become significant when where is the transverse radius of gyration for a slit geometry. In this regime, the centre-of-mass probability distribution in the transverse plane exhibits a depletion zone near the centre of the cavity (except at very small L) and a 4-fold symmetry with quasi-discrete positions. Reduction in polymer size with decreasing L arises principally from confinement rather than inter-polymer crowding. By contrast, polymer diffusion and internal motion are strongly affected by inter-polymer crowding. The two polymers tend to occupy opposite positions relative to the box centre, about which they diffuse relatively freely. Qualitatively, this static and dynamical behaviour differs significantly from that previously observed for confinement of two polymers to a narrow channel. The simulation results for a suitably chosen box width are qualitatively consistent with results from a recent experimental study of two λ-DNA chains confined to a nanofluidic cavity.
Collapse
Affiliation(s)
- James M Polson
- Department of Physics, University of Prince Edward Island, 550 University Ave., Charlottetown, Prince Edward Island C1A 4P3, Canada.
| | - Desiree A Rehel
- Department of Physics, University of Prince Edward Island, 550 University Ave., Charlottetown, Prince Edward Island C1A 4P3, Canada.
| |
Collapse
|
9
|
Jeffet J, Margalit S, Michaeli Y, Ebenstein Y. Single-molecule optical genome mapping in nanochannels: multidisciplinarity at the nanoscale. Essays Biochem 2021; 65:51-66. [PMID: 33739394 PMCID: PMC8056043 DOI: 10.1042/ebc20200021] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 02/24/2021] [Accepted: 02/26/2021] [Indexed: 12/12/2022]
Abstract
The human genome contains multiple layers of information that extend beyond the genetic sequence. In fact, identical genetics do not necessarily yield identical phenotypes as evident for the case of two different cell types in the human body. The great variation in structure and function displayed by cells with identical genetic background is attributed to additional genomic information content. This includes large-scale genetic aberrations, as well as diverse epigenetic patterns that are crucial for regulating specific cell functions. These genetic and epigenetic patterns operate in concert in order to maintain specific cellular functions in health and disease. Single-molecule optical genome mapping is a high-throughput genome analysis method that is based on imaging long chromosomal fragments stretched in nanochannel arrays. The access to long DNA molecules coupled with fluorescent tagging of various genomic information presents a unique opportunity to study genetic and epigenetic patterns in the genome at a single-molecule level over large genomic distances. Optical mapping entwines synergistically chemical, physical, and computational advancements, to uncover invaluable biological insights, inaccessible by sequencing technologies. Here we describe the method's basic principles of operation, and review the various available mechanisms to fluorescently tag genomic information. We present some of the recent biological and clinical impact enabled by optical mapping and present recent approaches for increasing the method's resolution and accuracy. Finally, we discuss how multiple layers of genomic information may be mapped simultaneously on the same DNA molecule, thus paving the way for characterizing multiple genomic observables on individual DNA molecules.
Collapse
Affiliation(s)
- Jonathan Jeffet
- Raymond and Beverly Sackler Faculty of Exact Sciences, Center for Nanoscience and Nanotechnology, Center for Light Matter Interaction, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Sapir Margalit
- Raymond and Beverly Sackler Faculty of Exact Sciences, Center for Nanoscience and Nanotechnology, Center for Light Matter Interaction, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Yael Michaeli
- Raymond and Beverly Sackler Faculty of Exact Sciences, Center for Nanoscience and Nanotechnology, Center for Light Matter Interaction, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Yuval Ebenstein
- Raymond and Beverly Sackler Faculty of Exact Sciences, Center for Nanoscience and Nanotechnology, Center for Light Matter Interaction, Tel Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
10
|
Hall ND, Patel JD, McElroy JS, Goertzen LR. Detection of subgenome bias using an anchored syntenic approach in Eleusine coracana (finger millet). BMC Genomics 2021; 22:175. [PMID: 33706694 PMCID: PMC7953713 DOI: 10.1186/s12864-021-07447-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 02/16/2021] [Indexed: 01/15/2023] Open
Abstract
Background Finger millet (Eleusine coracana 2n = 4x = 36) is a hardy, nutraceutical, climate change tolerant, orphan crop that is consumed throughout eastern Africa and India. Its genome has been sequenced multiple times, but A and B subgenomes could not be separated because no published genome for E. indica existed. The classification of A and B subgenomes is important for understanding the evolution of this crop and provide a means to improve current and future breeding programs. Results We produced subgenome calls for 704 syntenic blocks and inferred A or B subgenomic identity for 59,377 genes 81% of the annotated genes. Phylogenetic analysis of a super matrix containing 455 genes shows high support for A and B divergence within the Eleusine genus. Synonymous substitution rates between A and B genes support A and B calls. The repetitive content on highly supported B contigs is higher than that on similar A contigs. Analysis of syntenic singletons showed evidence of biased fractionation showed a pattern of A genome dominance, with 61% A, 37% B and 1% unassigned, and was further supported by the pattern of loss observed among cyto-nuclear interacting genes. Conclusion The evidence of individual gene calls within each syntenic block, provides a powerful tool for inference for subgenome classification. Our results show the utility of a draft genome in resolving A and B subgenomes calls, primarily it allows for the proper polarization of A and B syntenic blocks. There have been multiple calls for the use of phylogenetic inference in subgenome classification, our use of synteny is a practical application in a system that has only one parental genome available. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07447-y.
Collapse
Affiliation(s)
- Nathan D Hall
- Department of Crop, Soil and Environmental Science Auburn University, Auburn, AL, USA.
| | - Jinesh D Patel
- Department of Crop, Soil and Environmental Science Auburn University, Auburn, AL, USA
| | - J Scott McElroy
- Department of Crop, Soil and Environmental Science Auburn University, Auburn, AL, USA
| | - Leslie R Goertzen
- Department of Biological Sciences, Auburn University, Auburn, AL, USA
| |
Collapse
|
11
|
Goldrich DY, LaBarge B, Chartrand S, Zhang L, Sadowski HB, Zhang Y, Pham K, Way H, Lai CYJ, Pang AWC, Clifford B, Hastie AR, Oldakowski M, Goldenberg D, Broach JR. Identification of Somatic Structural Variants in Solid Tumors by Optical Genome Mapping. J Pers Med 2021; 11:142. [PMID: 33670576 PMCID: PMC7921992 DOI: 10.3390/jpm11020142] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 02/12/2021] [Accepted: 02/15/2021] [Indexed: 12/12/2022] Open
Abstract
Genomic structural variants comprise a significant fraction of somatic mutations driving cancer onset and progression. However, such variants are not readily revealed by standard next-generation sequencing. Optical genome mapping (OGM) surpasses short-read sequencing in detecting large (>500 bp) and complex structural variants (SVs) but requires isolation of ultra-high-molecular-weight DNA from the tissue of interest. We have successfully applied a protocol involving a paramagnetic nanobind disc to a wide range of solid tumors. Using as little as 6.5 mg of input tumor tissue, we show successful extraction of high-molecular-weight genomic DNA that provides a high genomic map rate and effective coverage by optical mapping. We demonstrate the system's utility in identifying somatic SVs affecting functional and cancer-related genes for each sample. Duplicate/triplicate analysis of select samples shows intra-sample reliability but also intra-sample heterogeneity. We also demonstrate that simply filtering SVs based on a GRCh38 human control database provides high positive and negative predictive values for true somatic variants. Our results indicate that the solid tissue DNA extraction protocol, OGM and SV analysis can be applied to a wide variety of solid tumors to capture SVs across the entire genome with functional importance in cancer prognosis and treatment.
Collapse
Affiliation(s)
- David Y. Goldrich
- Department of Otolaryngology—Head and Neck Surgery, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA; (D.Y.G.); (B.L.); (D.G.)
| | - Brandon LaBarge
- Department of Otolaryngology—Head and Neck Surgery, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA; (D.Y.G.); (B.L.); (D.G.)
| | - Scott Chartrand
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA; (S.C.); (L.Z.)
| | - Lijun Zhang
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA; (S.C.); (L.Z.)
| | - Henry B. Sadowski
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Yang Zhang
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Khoa Pham
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Hannah Way
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Chi-Yu Jill Lai
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Andy Wing Chun Pang
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Benjamin Clifford
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Alex R. Hastie
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - Mark Oldakowski
- Bionano Genomics, San Diego, CA 92121, USA; (H.B.S.); (Y.Z.); (K.P.); (H.W.); (C.-Y.J.L.); (A.W.C.P.); (B.C.); (A.R.H.); (M.O.)
| | - David Goldenberg
- Department of Otolaryngology—Head and Neck Surgery, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA; (D.Y.G.); (B.L.); (D.G.)
| | - James R. Broach
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA; (S.C.); (L.Z.)
| |
Collapse
|
12
|
Polson JM, Hastie CG. Free energy of a knotted polymer confined to narrow cylindrical and conical channels. Phys Rev E 2020; 102:052502. [PMID: 33327190 DOI: 10.1103/physreve.102.052502] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 10/16/2020] [Indexed: 06/12/2023]
Abstract
Monte Carlo simulations are used to study the conformational behavior of a semiflexible polymer confined to cylindrical and conical channels. The channels are sufficiently narrow that the conditions for the Odijk regime are marginally satisfied. For cylindrical confinement, we examine polymers with a single knot of topology 3_{1}, 4_{1}, or 5_{1}, as well as unknotted polymers that are capable of forming S loops. We measure the variation of the free energy F with the end-to-end polymer extension length X and examine the effect of varying the polymer topology, persistence length P, and cylinder diameter D on the free-energy functions. Similarly, we characterize the behavior of the knot span along the channel. We find that increasing the knot complexity increases the typical size of the knot. In the regime of low X, where the knot/S-loop size is large, the conformational behavior is independent of polymer topology. In addition, the scaling properties of the free energy and knot span are in agreement with predictions from a theoretical model constructed using known properties of interacting polymers in the Odijk regime. We also examine the variation of F with the position of a knot in conical channels for various values of the cone angle α. The free energy decreases as the knot moves in a direction where the cone widens, and it also decreases with increasing α and with increasing knot complexity. The behavior is in agreement with predictions from a theoretical model in which the dominant contribution to the change in F is the change in the size of the hairpins as the knot moves to the wider region of the channel.
Collapse
Affiliation(s)
- James M Polson
- Department of Physics, University of Prince Edward Island, 550 University Ave., Charlottetown, Prince Edward Island, C1A 4P3, Canada
| | - Cameron G Hastie
- Department of Physics, University of Prince Edward Island, 550 University Ave., Charlottetown, Prince Edward Island, C1A 4P3, Canada
| |
Collapse
|
13
|
Etherington GJ, Heavens D, Baker D, Lister A, McNelly R, Garcia G, Clavijo B, Macaulay I, Haerty W, Di Palma F. Sequencing smart: De novo sequencing and assembly approaches for a non-model mammal. Gigascience 2020; 9:5836134. [PMID: 32396200 PMCID: PMC7216774 DOI: 10.1093/gigascience/giaa045] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 02/28/2020] [Accepted: 04/15/2020] [Indexed: 01/05/2023] Open
Abstract
Background Whilst much sequencing effort has focused on key mammalian model organisms such as mouse and human, little is known about the relationship between genome sequencing techniques for non-model mammals and genome assembly quality. This is especially relevant to non-model mammals, where the samples to be sequenced are often degraded and of low quality. A key aspect when planning a genome project is the choice of sequencing data to generate. This decision is driven by several factors, including the biological questions being asked, the quality of DNA available, and the availability of funds. Cutting-edge sequencing technologies now make it possible to achieve highly contiguous, chromosome-level genome assemblies, but rely on high-quality high molecular weight DNA. However, funding is often insufficient for many independent research groups to use these techniques. Here we use a range of different genomic technologies generated from a roadkill European polecat (Mustela putorius) to assess various assembly techniques on this low-quality sample. We evaluated different approaches for de novo assemblies and discuss their value in relation to biological analyses. Results Generally, assemblies containing more data types achieved better scores in our ranking system. However, when accounting for misassemblies, this was not always the case for Bionano and low-coverage 10x Genomics (for scaffolding only). We also find that the extra cost associated with combining multiple data types is not necessarily associated with better genome assemblies. Conclusions The high degree of variability between each de novo assembly method (assessed from the 7 key metrics) highlights the importance of carefully devising the sequencing strategy to be able to carry out the desired analysis. Adding more data to genome assemblies does not always result in better assemblies, so it is important to understand the nuances of genomic data integration explained here, in order to obtain cost-effective value for money when sequencing genomes.
Collapse
Affiliation(s)
| | - Darren Heavens
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - David Baker
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Ashleigh Lister
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Rose McNelly
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Gonzalo Garcia
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Bernardo Clavijo
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Iain Macaulay
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Wilfried Haerty
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | | |
Collapse
|
14
|
Crumbaker M, Chan EKF, Gong T, Corcoran N, Jaratlerdsiri W, Lyons RJ, Haynes AM, Kulidjian AA, Kalsbeek AMF, Petersen DC, Stricker PD, Jamieson CAM, Croucher PI, Hovens CM, Joshua AM, Hayes VM. The Impact of Whole Genome Data on Therapeutic Decision-Making in Metastatic Prostate Cancer: A Retrospective Analysis. Cancers (Basel) 2020; 12:E1178. [PMID: 32392735 PMCID: PMC7280976 DOI: 10.3390/cancers12051178] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 04/21/2020] [Accepted: 04/28/2020] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND While critical insights have been gained from evaluating the genomic landscape of metastatic prostate cancer, utilizing this information to inform personalized treatment is in its infancy. We performed a retrospective pilot study to assess the current impact of precision medicine for locally advanced and metastatic prostate adenocarcinoma and evaluate how genomic data could be harnessed to individualize treatment. METHODS Deep whole genome-sequencing was performed on 16 tumour-blood pairs from 13 prostate cancer patients; whole genome optical mapping was performed in a subset of 9 patients to further identify large structural variants. Tumour samples were derived from prostate, lymph nodes, bone and brain. RESULTS Most samples had acquired genomic alterations in multiple therapeutically relevant pathways, including DNA damage response (11/13 cases), PI3K (7/13), MAPK (10/13) and Wnt (9/13). Five patients had somatic copy number losses in genes that may indicate sensitivity to immunotherapy (LRP1B, CDK12, MLH1) and one patient had germline and somatic BRCA2 alterations. CONCLUSIONS Most cases, whether primary or metastatic, harboured therapeutically relevant alterations, including those associated with PARP inhibitor sensitivity, immunotherapy sensitivity and resistance to androgen pathway targeting agents. The observed intra-patient heterogeneity and presence of genomic alterations in multiple growth pathways in individual cases suggests that a precision medicine model in prostate cancer needs to simultaneously incorporate multiple pathway-targeting agents. Our whole genome approach allowed for structural variant assessment in addition to the ability to rapidly reassess an individual's molecular landscape as knowledge of relevant biomarkers evolve. This retrospective oncological assessment highlights the genomic complexity of prostate cancer and the potential impact of assessing genomic data for an individual at any stage of the disease.
Collapse
Affiliation(s)
- Megan Crumbaker
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
- St. Vincent’s Clinical School, University of New South Wales, Sydney, Randwick, NSW 2031, Australia
- Kinghorn Cancer Centre, Department of Medical Oncology, St. Vincent’s Hospital, Darlinghurst, NSW 2010, Australia
| | - Eva K. F. Chan
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
- St. Vincent’s Clinical School, University of New South Wales, Sydney, Randwick, NSW 2031, Australia
| | - Tingting Gong
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
- Central Clinical School, University of Sydney, Sydney, Camperdown, NSW 2050, Australia
| | - Niall Corcoran
- Australian Prostate Cancer Research Centre Epworth, Richmond, VIC 3121, Australia;
- Department of Surgery, University of Melbourne, Melbourne, VIC 3010, Australia
- Division of Urology, Royal Melbourne Hospital, Melbourne, VIC 3050, Australia
| | - Weerachai Jaratlerdsiri
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
| | - Ruth J. Lyons
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
| | - Anne-Maree Haynes
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
| | - Anna A. Kulidjian
- Department of Orthopedic Surgery, Scripps Clinic, La Jolla, CA 92037, USA.;
- Orthopedic Oncology Program, Scripps MD Anderson Cancer Center, La Jolla, CA 92037, USA
| | - Anton M. F. Kalsbeek
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
| | - Desiree C. Petersen
- The Centre for Proteomic and Genomic Research, Cape Town 7925, South Africa;
| | - Phillip D. Stricker
- Department of Urology, St. Vincent’s Hospital, Darlinghurst, NSW 2010, Australia;
| | - Christina A. M. Jamieson
- Department of Urology, Moores Cancer Center, University of California, San Diego, La Jolla, CA 92037, USA;
| | - Peter I. Croucher
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Randwick, NSW 2031, Australia
| | - Christopher M. Hovens
- Australian Prostate Cancer Research Centre Epworth, Richmond, VIC 3121, Australia;
- Department of Surgery, University of Melbourne, Melbourne, VIC 3010, Australia
| | - Anthony M. Joshua
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
- St. Vincent’s Clinical School, University of New South Wales, Sydney, Randwick, NSW 2031, Australia
- Kinghorn Cancer Centre, Department of Medical Oncology, St. Vincent’s Hospital, Darlinghurst, NSW 2010, Australia
| | - Vanessa M. Hayes
- Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia; (M.C.); (E.K.F.C.); (T.G.); (W.J.); (R.J.L.); (A.-M.H.); (A.M.F.K.); (P.I.C.)
- St. Vincent’s Clinical School, University of New South Wales, Sydney, Randwick, NSW 2031, Australia
- Central Clinical School, University of Sydney, Sydney, Camperdown, NSW 2050, Australia
| |
Collapse
|
15
|
Luo J, Chai J, Wen Y, Tao M, Lin G, Liu X, Ren L, Chen Z, Wu S, Li S, Wang Y, Qin Q, Wang S, Gao Y, Huang F, Wang L, Ai C, Wang X, Li L, Ye C, Yang H, Luo M, Chen J, Hu H, Yuan L, Zhong L, Wang J, Xu J, Du Z, Ma Z(S, Murphy RW, Meyer A, Gui J, Xu P, Ruan J, Chen ZJ, Liu S, Lu X, Zhang YP. From asymmetrical to balanced genomic diversification during rediploidization: Subgenomic evolution in allotetraploid fish. SCIENCE ADVANCES 2020; 6:eaaz7677. [PMID: 32766441 PMCID: PMC7385415 DOI: 10.1126/sciadv.aaz7677] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 03/20/2020] [Indexed: 05/27/2023]
Abstract
A persistent enigma is the rarity of polyploidy in animals, compared to its prevalence in plants. Although animal polyploids are thought to experience deleterious genomic chaos during initial polyploidization and subsequent rediploidization processes, this hypothesis has not been tested. We provide an improved reference-quality de novo genome for allotetraploid goldfish whose origin dates to ~15 million years ago. Comprehensive analyses identify changes in subgenomic evolution from asymmetrical oscillation in goldfish and common carp to diverse stabilization and balanced gene expression during continuous rediploidization. The homoeologs are coexpressed in most pathways, and their expression dominance shifts temporally during embryogenesis. Homoeolog expression correlates negatively with alternation of DNA methylation. The results show that allotetraploid cyprinids have a unique strategy for balancing subgenomic stabilization and diversification. Rediploidization process in these fishes provides intriguing insights into genome evolution and function in allopolyploid vertebrates.
Collapse
Affiliation(s)
- Jing Luo
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Jing Chai
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
| | - Yanling Wen
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Min Tao
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Guoliang Lin
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Xiaochuan Liu
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Li Ren
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Zeyu Chen
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Shigang Wu
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Shengnan Li
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Yude Wang
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Qinbo Qin
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Shi Wang
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Yun Gao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
| | - Feng Huang
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Lu Wang
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Cheng Ai
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Xiaobo Wang
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Lianwei Li
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
| | - Chengxi Ye
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
- Department of Computer Science, University of Maryland, College Park, MD 20742, USA
| | - Huimin Yang
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Mi Luo
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Jie Chen
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Hong Hu
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Liujiao Yuan
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Li Zhong
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Jing Wang
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
| | - Jian Xu
- Centre for Applied Aquatic Genomics, Chinese Academy of Fishery Sciences, Beijing 100141, China
| | - Zhenglin Du
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhanshan (Sam) Ma
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
| | - Robert W. Murphy
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, Toronto, ON M5S 2C6, Canada
| | - Axel Meyer
- Department of Biology, University of Konstanz, Konstanz 78457, Germany
| | - Jianfang Gui
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Innovation Academy for Seed Design, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Wuhan 430072, China
| | - Peng Xu
- College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102 Fujian, China
| | - Jue Ruan
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- Peng Cheng Laboratory, Shenzhen 518052, China
| | - Z. Jeffrey Chen
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095 Jiangsu, China
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712-0159, USA
| | - Shaojun Liu
- State Key Laboratory of Developmental Biology of Freshwater Fish and College of Life Sciences, Hunan Normal University, Changsha, 410081 Hunan, China
| | - Xuemei Lu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- CAS Center for Excellence in Animal Evolution and Genetics, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ya-ping Zhang
- State Key Laboratory for Conservation and Utilization of Bio-resource and School of Life Sciences, Yunnan University, Kunming, 650091 Yunnan, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223 Yunnan, China
| |
Collapse
|
16
|
Field MA, Rosen BD, Dudchenko O, Chan EKF, Minoche AE, Edwards RJ, Barton K, Lyons RJ, Tuipulotu DE, Hayes VM, D. Omer A, Colaric Z, Keilwagen J, Skvortsova K, Bogdanovic O, Smith MA, Aiden EL, Smith TPL, Zammit RA, Ballard JWO. Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C. Gigascience 2020; 9:giaa027. [PMID: 32236524 PMCID: PMC7111595 DOI: 10.1093/gigascience/giaa027] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 01/29/2020] [Accepted: 02/20/2020] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. FINDINGS Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. CONCLUSIONS GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology.
Collapse
Affiliation(s)
- Matt A Field
- Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Smithfield Road, Cairns, QLD 4878, Australia
- John Curtin School of Medical Research, Australian National University, Garran Rd, Canberra, ACT 2600, Australia
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, Agricultural Research Service USDA, Baltimore Ave, Beltsville, MD 20705, USA
| | - Olga Dudchenko
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Baylor Plaza, Houston, TX 77030, USA
- Department of Computer Science, Rice University, Main St, Houston, TX 77005, USA
- Center for Theoretical and Biological Physics, Rice University, Main St, Houston, TX 77005, USA
| | - Eva K F Chan
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
- Faculty of Medicine, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| | - Andre E Minoche
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
- St Vincent’s Clinical School, University of New South Wales Sydney, Victoria Street, Darlinghurst NSW 2010, Australia
| | - Richard J Edwards
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| | - Kirston Barton
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
- Faculty of Medicine, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| | - Ruth J Lyons
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
| | - Daniel Enosi Tuipulotu
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| | - Vanessa M Hayes
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
- Faculty of Medicine, UNSW Sydney, High St, Kensington, NSW 2052, Australia
- Central Clinical School, University of Sydney, Parramatta Road, Camperdown, NSW 2050, Australia
| | - Arina D. Omer
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Baylor Plaza, Houston, TX 77030, USA
- Department of Computer Science, Rice University, Main St, Houston, TX 77005, USA
| | - Zane Colaric
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Baylor Plaza, Houston, TX 77030, USA
- Department of Computer Science, Rice University, Main St, Houston, TX 77005, USA
| | - Jens Keilwagen
- Julius Kühn-Institut, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany
| | - Ksenia Skvortsova
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
| | - Ozren Bogdanovic
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| | - Martin A Smith
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia
- Faculty of Medicine, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| | - Erez Lieberman Aiden
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Baylor Plaza, Houston, TX 77030, USA
- Department of Computer Science, Rice University, Main St, Houston, TX 77005, USA
- Center for Theoretical and Biological Physics, Rice University, Main St, Houston, TX 77005, USA
- Broad Institute of MIT and Harvard, Main St, Cambridge, MA 02142, USA
- Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, ShanghaiTech University, Huaxia Middle Rd, Pudong 201210, China
| | - Timothy P L Smith
- US Meat Animal Research Center, Agricultural Research Service USDA, Rd 313, Clay Center, NE 68933, USA
| | - Robert A Zammit
- Vineyard Veterinary Hospital, Windsor Rd, Vineyard, NSW 2765, Australia
| | - J William O Ballard
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, High St, Kensington, NSW 2052, Australia
| |
Collapse
|
17
|
Schotzinger RM, Menard LD, Ramsey JM. Single-Molecule DNA Extension in Rectangular and Square Profile Nanochannels in the Extended de Gennes Regime. Macromolecules 2020. [DOI: 10.1021/acs.macromol.9b02249] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- R. Michael Schotzinger
- Department of Biomedical Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| | | | - J. Michael Ramsey
- Department of Biomedical Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| |
Collapse
|
18
|
Luo X, Zhou Y, Zhang B, Zhang Y, Wang X, Feng T, Li Z, Cui K, Wang Z, Luo C, Li H, Deng Y, Lu F, Han J, Miao Y, Mao H, Yi X, Ai C, Wu S, Li A, Wu Z, Zhuo Z, Da Giang D, Mitra B, Vahidi MF, Mansoor S, Al-Bayatti SA, Sari EM, Gorkhali NA, Prastowo S, Shafique L, Ye G, Qian Q, Chen B, Shi D, Ruan J, Liu Q. Understanding divergent domestication traits from the whole-genome sequencing of swamp- and river-buffalo populations. Natl Sci Rev 2020; 7:686-701. [PMID: 34692087 PMCID: PMC8289072 DOI: 10.1093/nsr/nwaa024] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 12/27/2019] [Accepted: 02/12/2020] [Indexed: 01/01/2023] Open
Abstract
Abstract
Domesticated buffaloes have been integral to rice-paddy agro-ecosystems for millennia, yet relatively little is known about the buffalo genomics. Here, we sequenced and assembled reference genomes for both swamp and river buffaloes and we re-sequenced 230 individuals (132 swamp buffaloes and 98 river buffaloes) sampled from across Asia and Europe. Beyond the many actionable insights that our study revealed about the domestication, basic physiology and breeding of buffalo, we made the striking discovery that the divergent domestication traits between swamp and river buffaloes can be explained with recent selections of genes on social behavior, digestion metabolism, strengths and milk production.
Collapse
Affiliation(s)
- Xier Luo
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Yu Zhou
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Bing Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yi Zhang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100083, China
| | - Xiaobo Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Tong Feng
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Zhipeng Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Kuiqing Cui
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Zhiqiang Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Chan Luo
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Hui Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Yanfei Deng
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Fenghua Lu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Jianlin Han
- CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
- International Livestock Research Institute, Nairobi 00100, Kenya
| | - Yongwang Miao
- Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Huaming Mao
- Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Xiaoyan Yi
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Cheng Ai
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Shigang Wu
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Alun Li
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Zhichao Wu
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Zijun Zhuo
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Do Da Giang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
- Bacgiang Agriculture and Forestry University, Bacgiang 230000, Vietnam
| | - Bikash Mitra
- Cellular Immunology Lab, Department of Zoology, University of North Bengal, Siligun 734013, India
| | - Mohammad Farhad Vahidi
- Animal Biotechnology Department, Agricultural Biotechnology Research Institute of Iran-North Region, Agricultural Research, Education and Extension Organization, Rasht 999067, Iran
| | - Shahid Mansoor
- National Institute for Biotechnology and Genetic Engineering, Faisalabad 999010, Pakistan
| | - Sahar Ahmed Al-Bayatti
- Animal Genetic Sources Department, Directorate of Animal Resources, Ministry of Agriculture, Baghdad 19207, Iraq
| | - Eka Meutia Sari
- Department of Animal Science, Faculty of Agriculture, Syiah Kuala University, Darussalam-Banda Aceh 23111, Indonesia
| | - Neena Amatya Gorkhali
- Animal Breeding Division, National Animal Science Research Institute, Nepal Agriculture Research Council, Khumaltar 999098, Nepal
| | - Sigit Prastowo
- Animal Science Department Universitas Sebelas Maret, Surakarta 999006, Indonesia
| | - Laiba Shafique
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Guoyou Ye
- International Rice Research Institute, Manila 999005, Philippines
| | - Qian Qian
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Baoshan Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Deshun Shi
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Jue Ruan
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Qingyou Liu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| |
Collapse
|
19
|
Young E, Abid HZ, Kwok PY, Riethman H, Xiao M. Comprehensive Analysis of Human Subtelomeres by Whole Genome Mapping. PLoS Genet 2020; 16:e1008347. [PMID: 31986135 PMCID: PMC7004388 DOI: 10.1371/journal.pgen.1008347] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 02/06/2020] [Accepted: 10/15/2019] [Indexed: 12/03/2022] Open
Abstract
Detailed comprehensive knowledge of the structures of individual long-range telomere-terminal haplotypes are needed to understand their impact on telomere function, and to delineate the population structure and evolution of subtelomere regions. However, the abundance of large evolutionarily recent segmental duplications and high levels of large structural variations have complicated both the mapping and sequence characterization of human subtelomere regions. Here, we use high throughput optical mapping of large single DNA molecules in nanochannel arrays for 154 human genomes from 26 populations to present a comprehensive look at human subtelomere structure and variation. The results catalog many novel long-range subtelomere haplotypes and determine the frequencies and contexts of specific subtelomeric duplicons on each chromosome arm, helping to clarify the currently ambiguous nature of many specific subtelomere structures as represented in the current reference sequence (HG38). The organization and content of some duplicons in subtelomeres appear to show both chromosome arm and population-specific trends. Based upon these trends we estimate a timeline for the spread of these duplication blocks.
Collapse
Affiliation(s)
- Eleanor Young
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Heba Z. Abid
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, University of California–San Francisco, San Francisco, CA, United States of America
- Department of Dermatology, University of California–San Francisco, San Francisco, CA, United States of America
- Institute for Human Genetics, University of California–San Francisco, San Francisco, CA, United States of America
| | - Harold Riethman
- Medical Diagnostic & Translational Sciences, Old Dominium University, Norfolk, VA, United States of America
| | - Ming Xiao
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
- Institute of Molecular Medicine and Infectious Disease in the School of Medicine, Drexel University, Philadelphia, PA, United States of America
| |
Collapse
|
20
|
Li M, Wang J. Stretching Wormlike Chains in Narrow Tubes of Arbitrary Cross-Sections. Polymers (Basel) 2019; 11:E2050. [PMID: 31835594 PMCID: PMC6960511 DOI: 10.3390/polym11122050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Accepted: 12/06/2019] [Indexed: 12/03/2022] Open
Abstract
We considered the stretching of semiflexible polymer chains confined in narrow tubes with arbitrary cross-sections. Based on the wormlike chain model and technique of normal mode decomposition in statistical physics, we derived a compact analytical expression on the force-confinement-extension relation of the chains. This single formula was generalized to be valid for tube confinements with arbitrary cross-sections. In addition, we extended the generalized bead-rod model for Brownian dynamics simulations of confined polymer chains subjected to force stretching, so that the confinement effects to the chains applied by the tubes with arbitrary cross-sections can be quantitatively taken into account through numerical simulations. Extensive simulation examples on the wormlike chains confined in tubes of various shapes quantitatively justified the theoretically derived generalized formula on the force-confinement-extension relation of the chains.
Collapse
Affiliation(s)
| | - Jizeng Wang
- Key Laboratory of Mechanics on Disaster and Environment in Western China, Ministry of Education, College of Civil Engineering and Mechanics, Lanzhou University, Lanzhou 730000, China;
| |
Collapse
|
21
|
Müller V, Dvirnas A, Andersson J, Singh V, Kk S, Johansson P, Ebenstein Y, Ambjörnsson T, Westerlund F. Enzyme-free optical DNA mapping of the human genome using competitive binding. Nucleic Acids Res 2019; 47:e89. [PMID: 31165870 PMCID: PMC6735870 DOI: 10.1093/nar/gkz489] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/03/2019] [Accepted: 05/22/2019] [Indexed: 01/24/2023] Open
Abstract
Optical DNA mapping (ODM) allows visualization of long-range sequence information along single DNA molecules. The data can for example be used for detecting long range structural variations, for aiding DNA sequence assembly of complex genomes and for mapping epigenetic marks and DNA damage across the genome. ODM traditionally utilizes sequence specific marks based on nicking enzymes, combined with a DNA stain, YOYO-1, for detection of the DNA contour. Here we use a competitive binding approach, based on YOYO-1 and netropsin, which highlights the contour of the DNA molecules, while simultaneously creating a continuous sequence specific pattern, based on the AT/GC variation along the detected molecule. We demonstrate and validate competitive-binding-based ODM using bacterial artificial chromosomes (BACs) derived from the human genome and then turn to DNA extracted from white blood cells. We generalize our findings with in-silico simulations that show that we can map a vast majority of the human genome. Finally, we demonstrate the possibility of combining competitive binding with enzymatic labeling by mapping DNA damage sites induced by the cytotoxic drug etoposide to the human genome. Overall, we demonstrate that competitive-binding-based ODM has the potential to be used both as a standalone assay for studies of the human genome, as well as in combination with enzymatic approaches, some of which are already commercialized.
Collapse
Affiliation(s)
- Vilhelm Müller
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Albertas Dvirnas
- Department of Astronomy and Theoretical Physics, Lund University, Lund, Sweden
| | - John Andersson
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Vandana Singh
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Sriram Kk
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Pegah Johansson
- Clinical Chemistry, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Yuval Ebenstein
- School of Chemistry, Center for Nanoscience and Nanotechnology, Center for Light-Matter Interaction, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Tobias Ambjörnsson
- Department of Astronomy and Theoretical Physics, Lund University, Lund, Sweden
| | - Fredrik Westerlund
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| |
Collapse
|
22
|
Giani AM, Gallo GR, Gianfranceschi L, Formenti G. Long walk to genomics: History and current approaches to genome sequencing and assembly. Comput Struct Biotechnol J 2019; 18:9-19. [PMID: 31890139 PMCID: PMC6926122 DOI: 10.1016/j.csbj.2019.11.002] [Citation(s) in RCA: 109] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 11/03/2019] [Accepted: 11/06/2019] [Indexed: 12/13/2022] Open
Abstract
Genomes represent the starting point of genetic studies. Since the discovery of DNA structure, scientists have devoted great efforts to determine their sequence in an exact way. In this review we provide a comprehensive historical background of the improvements in DNA sequencing technologies that have accompanied the major milestones in genome sequencing and assembly, ranging from early sequencing methods to Next-Generation Sequencing platforms. We then focus on the advantages and challenges of the current technologies and approaches, collectively known as Third Generation Sequencing. As these technical advancements have been accompanied by progress in analytical methods, we also review the bioinformatic tools currently employed in de novo genome assembly, as well as some applications of Third Generation Sequencing technologies and high-quality reference genomes.
Collapse
Key Words
- BAC, Bacterial Artificial Chromosome
- Bioinformatics
- Genome assembly
- HGP, Human Genome Project
- HMW, high molecular weight
- HapMap, haplotype map
- NGS, Next Generation Sequencing
- Next-generation
- OLC, Overlap-Layout-Consensus
- QV, Quality Value (QV)
- Reference
- SBS, Sequencing by Synthesis
- SMRT, Single Molecule Real-Time
- SNPs, Single Nucleotide Polymorphisms
- SRA, Short Read Archive
- SV, Structural Variant
- Sequencing
- TGS, Third Generation Sequencing
- Third-generation
- WGS, Whole Genome Sequencing
- ZMW, Zero-Mode Waveguide
- bp, base pair
- dNTPs, deoxynucleoside triphosphates
- ddNTP, 2,3-dideoxynucleoside triphosphate
Collapse
Affiliation(s)
- Alice Maria Giani
- Department of Surgery, Weill Cornell Medical College, New York, NY, USA
| | | | | | | |
Collapse
|
23
|
Varapula D, LaBouff E, Raseley K, Uppuluri L, Ehrlich GD, Noh M, Xiao M. A micropatterned substrate for on-surface enzymatic labelling of linearized long DNA molecules. Sci Rep 2019; 9:15059. [PMID: 31636335 PMCID: PMC6803683 DOI: 10.1038/s41598-019-51507-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 10/02/2019] [Indexed: 12/22/2022] Open
Abstract
Optical mapping of linearized DNA molecules is a promising new technology for sequence assembly and scaffolding, large structural variant detection, and diagnostics. This is currently achieved either using nanochannel confinement or by stretching single DNA molecules on a solid surface. While the first method necessitates DNA labelling before linearization, the latter allows for modification post-linearization, thereby affording increased process flexibility. Each method is constrained by various physical and chemical limitations. One of the most common techniques for linearization of DNA uses a hydrophobic surface and a receding meniscus, termed molecular combing. Here, we report the development of a microfabricated surface that can not only comb the DNA molecules efficiently but also provides for sequence-specific enzymatic fluorescent DNA labelling. By modifying a glass surface with two contrasting functionalities, such that DNA binds selectively to one of the two regions, we can control DNA extension, which is known to be critical for sequence-recognition by an enzyme. Moreover, the surface modification provides enzymatic access to the DNA backbone, as well as minimizing non-specific fluorescent dye adsorption. These enhancements make the designed surface suitable for large-scale and high-resolution single DNA molecule studies.
Collapse
Affiliation(s)
- Dharma Varapula
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, 19104, USA
| | - Eric LaBouff
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, 19104, USA
- Department of Microbiology and Immunology, Drexel University College of Medicine, Philadelphia, PA, 19102, USA
- Center for Genomic Sciences and Center for Advanced Microbial Processing, Institute of Molecular Medicine and Infectious Disease, Drexel University College of Medicine, Philadelphia, PA, 19102, USA
| | - Kaitlin Raseley
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, 19104, USA
| | - Lahari Uppuluri
- Department of Mechanical Engineering and Mechanics, Drexel University, Philadelphia, PA, 19104, USA
| | - Garth D Ehrlich
- Department of Microbiology and Immunology, Drexel University College of Medicine, Philadelphia, PA, 19102, USA
- Center for Genomic Sciences and Center for Advanced Microbial Processing, Institute of Molecular Medicine and Infectious Disease, Drexel University College of Medicine, Philadelphia, PA, 19102, USA
- Department of Otolaryngology Head and Neck Surgery, Drexel University College of Medicine, Philadelphia, PA, 19102, USA
| | - Moses Noh
- Department of Mechanical Engineering and Mechanics, Drexel University, Philadelphia, PA, 19104, USA
| | - Ming Xiao
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, 19104, USA.
- Center for Genomic Sciences and Center for Advanced Microbial Processing, Institute of Molecular Medicine and Infectious Disease, Drexel University College of Medicine, Philadelphia, PA, 19102, USA.
| |
Collapse
|
24
|
Douglas GM, Langille MGI. Current and Promising Approaches to Identify Horizontal Gene Transfer Events in Metagenomes. Genome Biol Evol 2019; 11:2750-2766. [PMID: 31504488 PMCID: PMC6777429 DOI: 10.1093/gbe/evz184] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/19/2019] [Indexed: 12/16/2022] Open
Abstract
High-throughput shotgun metagenomics sequencing has enabled the profiling of myriad natural communities. These data are commonly used to identify gene families and pathways that were potentially gained or lost in an environment and which may be involved in microbial adaptation. Despite the widespread interest in these events, there are no established best practices for identifying gene gain and loss in metagenomics data. Horizontal gene transfer (HGT) represents several mechanisms of gene gain that are especially of interest in clinical microbiology due to the rapid spread of antibiotic resistance genes in natural communities. Several additional mechanisms of gene gain and loss, including gene duplication, gene loss-of-function events, and de novo gene birth are also important to consider in the context of metagenomes but have been less studied. This review is largely focused on detecting HGT in prokaryotic metagenomes, but methods for detecting these other mechanisms are first discussed. For this article to be self-contained, we provide a general background on HGT and the different possible signatures of this process. Lastly, we discuss how improved assembly of genomes from metagenomes would be the most straight-forward approach for improving the inference of gene gain and loss events. Several recent technological advances could help improve metagenome assemblies: long-read sequencing, determining the physical proximity of contigs, optical mapping of short sequences along chromosomes, and single-cell metagenomics. The benefits and limitations of these advances are discussed and open questions in this area are highlighted.
Collapse
Affiliation(s)
- Gavin M Douglas
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Morgan G I Langille
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
25
|
Huo N, Zhu T, Zhang S, Mohr T, Luo MC, Lee JY, Distelfeld A, Altenbach S, Gu YQ. Rapid evolution of α-gliadin gene family revealed by analyzing Gli-2 locus regions of wild emmer wheat. Funct Integr Genomics 2019; 19:993-1005. [PMID: 31197605 PMCID: PMC6797660 DOI: 10.1007/s10142-019-00686-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Revised: 04/23/2019] [Accepted: 04/30/2019] [Indexed: 12/13/2022]
Abstract
α-Gliadins are a major group of gluten proteins in wheat flour that contribute to the end-use properties for food processing and contain major immunogenic epitopes that can cause serious health-related issues including celiac disease (CD). α-Gliadins are also the youngest group of gluten proteins and are encoded by a large gene family. The majority of the gene family members evolved independently in the A, B, and D genomes of different wheat species after their separation from a common ancestral species. To gain insights into the origin and evolution of these complex genes, the genomic regions of the Gli-2 loci encoding α-gliadins were characterized from the tetraploid wild emmer, a progenitor of hexaploid bread wheat that contributed the AABB genomes. Genomic sequences of Gli-2 locus regions for the wild emmer A and B genomes were first reconstructed using the genome sequence scaffolds along with optical genome maps. A total of 24 and 16 α-gliadin genes were identified for the A and B genome regions, respectively. α-Gliadin pseudogene frequencies of 86% for the A genome and 69% for the B genome were primarily caused by C to T substitutions in the highly abundant glutamine codons, resulting in the generation of premature stop codons. Comparison with the homologous regions from the hexaploid wheat cv. Chinese Spring indicated considerable sequence divergence of the two A genomes at the genomic level. In comparison, conserved regions between the two B genomes were identified that included α-gliadin pseudogenes containing shared nested TE insertions. Analyses of the genomic organization and phylogenetic tree reconstruction indicate that although orthologous gene pairs derived from speciation were present, large portions of α-gliadin genes were likely derived from differential gene duplications or deletions after the separation of the homologous wheat genomes ~ 0.5 MYA. The higher number of full-length intact α-gliadin genes in hexaploid wheat than that in wild emmer suggests that human selection through domestication might have an impact on α-gliadin evolution. Our study provides insights into the rapid and dynamic evolution of genomic regions harboring the α-gliadin genes in wheat.
Collapse
Affiliation(s)
- Naxin Huo
- United States Department of Agriculture-Agricultural Research Service USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA, 94710, USA.,Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Shengli Zhang
- Hena Institute of Science and Technology, Xinxiang, Hena Province, 453003, China
| | - Toni Mohr
- United States Department of Agriculture-Agricultural Research Service USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA, 94710, USA
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Jong-Yeol Lee
- National Institute of Agricultural Sciences, RDA, Jeonju, 54874, South Korea
| | - Assaf Distelfeld
- Institute for Crop Improvement, Tel Aviv University, Tel Aviv-Yafo, Israel
| | - Susan Altenbach
- United States Department of Agriculture-Agricultural Research Service USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA, 94710, USA
| | - Yong Q Gu
- United States Department of Agriculture-Agricultural Research Service USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA, 94710, USA.
| |
Collapse
|
26
|
Nagasaki M, Kuroki Y, Shibata TF, Katsuoka F, Mimori T, Kawai Y, Minegishi N, Hozawa A, Kuriyama S, Suzuki Y, Kawame H, Nagami F, Takai-Igarashi T, Ogishima S, Kojima K, Misawa K, Tanabe O, Fuse N, Tanaka H, Yaegashi N, Kinoshita K, Kure S, Yasuda J, Yamamoto M. Construction of JRG (Japanese reference genome) with single-molecule real-time sequencing. Hum Genome Var 2019; 6:27. [PMID: 31231536 PMCID: PMC6555796 DOI: 10.1038/s41439-019-0057-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Revised: 01/28/2019] [Accepted: 03/15/2019] [Indexed: 12/14/2022] Open
Abstract
In recent genome analyses, population-specific reference panels have indicated important. However, reference panels based on short-read sequencing data do not sufficiently cover long insertions. Therefore, the nature of long insertions has not been well documented. Here, we assembled a Japanese genome using single-molecule real-time sequencing data and characterized insertions found in the assembled genome. We identified 3691 insertions ranging from 100 bps to ~10,000 bps in the assembled genome relative to the international reference sequence (GRCh38). To validate and characterize these insertions, we mapped short-reads from 1070 Japanese individuals and 728 individuals from eight other populations to insertions integrated into GRCh38. With this result, we constructed JRGv1 (Japanese Reference Genome version 1) by integrating the 903 verified insertions, totaling 1,086,173 bases, shared by at least two Japanese individuals into GRCh38. We also constructed decoyJRGv1 by concatenating 3559 verified insertions, totaling 2,536,870 bases, shared by at least two Japanese individuals or by six other assemblies. This assembly improved the alignment ratio by 0.4% on average. These results demonstrate the importance of refining the reference assembly and creating a population-specific reference genome. JRGv1 and decoyJRGv1 are available at the JRG website. Researchers in Japan have assembled a Japanese reference genome, which includes sequences missing from the international reference genome, as well as others specific to East Asian populations. A team led by Masao Nagasaki and Masayuki Yamamoto sequenced a Japanese individual using a method, which produces longer sequences than previous technologies. Using this approach, they identified thousands of sequences spanning 2.5 million bases, which were absent in the international reference genome. Many of these were sequences able to move within the genome. They showed that the majority of these sequences are also present in early humans and chimpanzees, demonstrating that their absence from the current reference is due to deletions or limitations of earlier sequencing methodologies. In addition to providing a population-specific reference, these findings demonstrate the importance of continually improving the international reference genome.
Collapse
Affiliation(s)
- Masao Nagasaki
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,3Graduate School of Information Sciences, Tohoku University, Sendai, Japan
| | - Yoko Kuroki
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,4Department of Genome Medicine, National Center for Child Health and Development, Tokyo, Japan
| | - Tomoko F Shibata
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Fumiki Katsuoka
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Takahiro Mimori
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Yosuke Kawai
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,3Graduate School of Information Sciences, Tohoku University, Sendai, Japan
| | - Naoko Minegishi
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Atsushi Hozawa
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Shinichi Kuriyama
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,5International Research Institute of Disaster Science, Tohoku University, Sendai, Japan
| | - Yoichi Suzuki
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Hiroshi Kawame
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Fuji Nagami
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | | | - Soichi Ogishima
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Kaname Kojima
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,3Graduate School of Information Sciences, Tohoku University, Sendai, Japan
| | - Kazuharu Misawa
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Osamu Tanabe
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Nobuo Fuse
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,6Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Hiroshi Tanaka
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Nobuo Yaegashi
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,6Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Kengo Kinoshita
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,3Graduate School of Information Sciences, Tohoku University, Sendai, Japan
| | - Shiego Kure
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan.,6Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Jun Yasuda
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Masayuki Yamamoto
- 1Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,2Graduate School of Medicine, Tohoku University, Sendai, Japan
| |
Collapse
|
27
|
Korhonen PK, Hall RS, Young ND, Gasser RB. Common workflow language (CWL)-based software pipeline for de novo genome assembly from long- and short-read data. Gigascience 2019; 8:giz014. [PMID: 30821816 PMCID: PMC6451199 DOI: 10.1093/gigascience/giz014] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 11/03/2018] [Accepted: 01/25/2019] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Here, we created an automated pipeline for the de novoassembly of genomes from Pacific Biosciences long-read and Illumina short-read data using common workflow language (CWL). To evaluate the performance of this pipeline, we assembled the nuclear genomes of the eukaryotes Caenorhabditis elegans (∼100 Mb), Drosophila melanogaster (∼138 Mb), and Plasmodium falciparum (∼23 Mb) directly from publicly accessible nucleotide sequence datasets and assessed the quality of the assemblies against curated reference genomes. FINDINGS We showed a dependency of the accuracy of assembly on sequencing technology and GC content and repeatedly achieved assemblies that meet the high standards set by the National Human Genome Research Institute, being applicable to gene prediction and subsequent genomic analyses. CONCLUSIONS This CWL pipeline overcomes current challenges of achieving repeatability and reproducibility of assembly results and offers a platform for the re-use of the workflow and the integration of diverse datasets. This workflow is publicly available via GitHub (https://github.com/vetscience/Assemblosis) and is currently applicable to the assembly of haploid and diploid genomes of eukaryotes.
Collapse
Affiliation(s)
- Pasi K Korhonen
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Ross S Hall
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Neil D Young
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
28
|
Paajanen P, Kettleborough G, López-Girona E, Giolai M, Heavens D, Baker D, Lister A, Cugliandolo F, Wilde G, Hein I, Macaulay I, Bryan GJ, Clark MD. A critical comparison of technologies for a plant genome sequencing project. Gigascience 2019; 8:giy163. [PMID: 30624602 PMCID: PMC6423373 DOI: 10.1093/gigascience/giy163] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2018] [Revised: 09/26/2018] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND A high-quality genome sequence of any model organism is an essential starting point for genetic and other studies. Older clone-based methods are slow and expensive, whereas faster, cheaper short-read-only assemblies can be incomplete and highly fragmented, which minimizes their usefulness. The last few years have seen the introduction of many new technologies for genome assembly. These new technologies and associated new algorithms are typically benchmarked on microbial genomes or, if they scale appropriately, on larger (e.g., human) genomes. However, plant genomes can be much more repetitive and larger than the human genome, and plant biochemistry often makes obtaining high-quality DNA that is free from contaminants difficult. Reflecting their challenging nature, we observe that plant genome assembly statistics are typically poorer than for vertebrates. RESULTS Here, we compare Illumina short read, Pacific Biosciences long read, 10x Genomics linked reads, Dovetail Hi-C, and BioNano Genomics optical maps, singly and combined, in producing high-quality long-range genome assemblies of the potato species Solanum verrucosum. We benchmark the assemblies for completeness and accuracy, as well as DNA compute requirements and sequencing costs. CONCLUSIONS The field of genome sequencing and assembly is reaching maturity, and the differences we observe between assemblies are surprisingly small. We expect that our results will be helpful to other genome projects, and that these datasets will be used in benchmarking by assembly algorithm developers.
Collapse
Affiliation(s)
- Pirita Paajanen
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - George Kettleborough
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Elena López-Girona
- Cell and Molcular Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
- The New Zealand Institute for Plant & Food Research Limited, Palmerston North 4442, New Zealand
| | - Michael Giolai
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Darren Heavens
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - David Baker
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Ashleigh Lister
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Fiorella Cugliandolo
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Gail Wilde
- Cell and Molcular Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Ingo Hein
- Cell and Molcular Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Iain Macaulay
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Glenn J Bryan
- Cell and Molcular Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Matthew D Clark
- Technology Development, Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
- Department of Life Sciences, Natural History Museum, Cromwell Road, London WC2 5BD, UK
| |
Collapse
|
29
|
Dangi S, Riehn R. Nanoplumbing with 2D Metamaterials. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2019; 15:e1803478. [PMID: 30537130 PMCID: PMC6785347 DOI: 10.1002/smll.201803478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Revised: 11/09/2018] [Indexed: 06/09/2023]
Abstract
Complex manipulations of DNA in a nanofluidic device require channels with branches and junctions. However, the dynamic response of DNA in such nanofluidic networks is relatively unexplored. Here, the transport of DNA in a 2D metamaterial made by arrays of nanochannel junctions is investigated. The mechanism of transport is explained as Brownian motion through an energy landscape formed by the combination of the confinement free energy of DNA and the effective potential of hydrodynamic flow, which both can be tuned independently within the device. For the quantitative understanding of DNA transport, a dynamic mean-field model of DNA at a nanochannel junction is proposed. It is shown that the dynamics of DNA in a nanofluidic device with branched channels and junctions is well described by the model.
Collapse
|
30
|
Dvorak J, Wang L, Zhu T, Jorgensen CM, Luo MC, Deal KR, Gu YQ, Gill BS, Distelfeld A, Devos KM, Qi P, McGuire PE. Reassessment of the evolution of wheat chromosomes 4A, 5A, and 7B. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2018; 131:2451-2462. [PMID: 30141064 PMCID: PMC6208953 DOI: 10.1007/s00122-018-3165-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2018] [Accepted: 08/13/2018] [Indexed: 05/02/2023]
Abstract
Comparison of genome sequences of wild emmer wheat and Aegilops tauschii suggests a novel scenario of the evolution of rearranged wheat chromosomes 4A, 5A, and 7B. Past research suggested that wheat chromosome 4A was subjected to a reciprocal translocation T(4AL;5AL)1 that occurred in the diploid progenitor of the wheat A subgenome and to three major rearrangements that occurred in polyploid wheat: pericentric inversion Inv(4AS;4AL)1, paracentric inversion Inv(4AL;4AL)1, and reciprocal translocation T(4AL;7BS)1. Gene collinearity along the pseudomolecules of tetraploid wild emmer wheat (Triticum turgidum ssp. dicoccoides, subgenomes AABB) and diploid Aegilops tauschii (genomes DD) was employed to confirm these rearrangements and to analyze the breakpoints. The exchange of distal regions of chromosome arms 4AS and 4AL due to pericentric inversion Inv(4AS;4AL)1 was detected, and breakpoints were validated with an optical Bionano genome map. Both breakpoints contained satellite DNA. The breakpoints of reciprocal translocation T(4AL;7BS)1 were also found. However, the breakpoints that generated paracentric inversion Inv(4AL;4AL)1 appeared to be collocated with the 4AL breakpoints that had produced Inv(4AS;4AL)1 and T(4AL;7BS)1. Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 either originated sequentially, and Inv(4AL;4AL)1 was produced by recurrent chromosome breaks at the same breakpoints that generated Inv(4AS;4AL)1 and T(4AL;7BS)1, or Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 originated simultaneously. We prefer the latter hypothesis since it makes fewer assumptions about the sequence of events that produced these chromosome rearrangements.
Collapse
Affiliation(s)
- Jan Dvorak
- Department of Plant Sciences, University of California, Davis, CA USA
| | - Le Wang
- Department of Plant Sciences, University of California, Davis, CA USA
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA USA
| | - Chad M. Jorgensen
- Department of Plant Sciences, University of California, Davis, CA USA
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA USA
| | - Karin R. Deal
- Department of Plant Sciences, University of California, Davis, CA USA
| | - Yong Q. Gu
- Crop Improvement and Genetics Research, USDA-ARS, Albany, CA USA
| | - Bikram S. Gill
- Department of Plant Pathology, Kansas State University, Manhattan, KS USA
| | - Assaf Distelfeld
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | - Katrien M. Devos
- Institute of Plant Breeding, Genetics and Genomics, Department of Crop and Soil Sciences, University of Georgia, Athens, GA USA
- Department of Plant Biology, University of Georgia, Athens, GA USA
| | - Peng Qi
- Institute of Plant Breeding, Genetics and Genomics, Department of Crop and Soil Sciences, University of Georgia, Athens, GA USA
- Department of Plant Biology, University of Georgia, Athens, GA USA
| | | |
Collapse
|
31
|
Single-molecule DNA-mapping and whole-genome sequencing of individual cells. Proc Natl Acad Sci U S A 2018; 115:11192-11197. [PMID: 30322920 DOI: 10.1073/pnas.1804194115] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
To elucidate cellular diversity and clonal evolution in tissues and tumors, one must resolve genomic heterogeneity in single cells. To this end, we have developed low-cost, mass-producible micro-/nanofluidic chips for DNA extraction from individual cells. These chips have modules that collect genomic DNA for sequencing or map genomic structure directly, on-chip, with denaturation-renaturation (D-R) optical mapping [Marie R, et al. (2013) Proc Natl Acad Sci USA 110:4893-4898]. Processing of single cells from the LS174T colorectal cancer cell line showed that D-R mapping of single molecules can reveal structural variation (SV) in the genome of single cells. In one experiment, we processed 17 fragments covering 19.8 Mb of the cell's genome. One megabase-large fragment aligned well to chromosome 19 with half its length, while the other half showed variable alignment. Paired-end single-cell sequencing supported this finding, revealing a region of complexity and a 50-kb deletion. Sequencing struggled, however, to detect a 20-kb gap that D-R mapping showed clearly in a megabase fragment that otherwise mapped well to the reference at the pericentromeric region of chromosome 4. Pericentromeric regions are complex and show substantial sequence homology between different chromosomes, making mapping of sequence reads ambiguous. Thus, D-R mapping directly, from a single molecule, revealed characteristics of the single-cell genome that were challenging for short-read sequencing.
Collapse
|
32
|
Bhandari AB, Reifenberger JG, Chuang HM, Cao H, Dorfman KD. Measuring the wall depletion length of nanoconfined DNA. J Chem Phys 2018; 149:104901. [PMID: 30219022 PMCID: PMC6135644 DOI: 10.1063/1.5040458] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 08/20/2018] [Indexed: 12/14/2022] Open
Abstract
Efforts to study the polymer physics of DNA confined in nanochannels have been stymied by a lack of consensus regarding its wall depletion length. We have measured this quantity in 38 nm wide, square silicon dioxide nanochannels for five different ionic strengths between 15 mM and 75 mM. Experiments used the Bionano Genomics Irys platform for massively parallel data acquisition, attenuating the effect of the sequence-dependent persistence length and finite-length effects by using nick-labeled E. coli genomic DNA with contour length separations of at least 30 µm (88 325 base pairs) between nick pairs. Over 5 × 106 measurements of the fractional extension were obtained from 39 291 labeled DNA molecules. Analyzing the stretching via Odijk's theory for a strongly confined wormlike chain yielded a linear relationship between the depletion length and the Debye length. This simple linear fit to the experimental data exhibits the same qualitative trend as previously defined analytical models for the depletion length but now quantitatively captures the experimental data.
Collapse
Affiliation(s)
- Aditya Bikram Bhandari
- Department of Chemical Engineering and Materials Science, University of Minnesota-Twin Cities, 421 Washington Ave. SE, Minneapolis, Minnesota 55455, USA
| | - Jeffrey G Reifenberger
- Bionano Genomics, Inc., 9640 Towne Centre Drive, Suite 100, San Diego, California 92121, USA
| | - Hui-Min Chuang
- Department of Chemical Engineering and Materials Science, University of Minnesota-Twin Cities, 421 Washington Ave. SE, Minneapolis, Minnesota 55455, USA
| | - Han Cao
- Bionano Genomics, Inc., 9640 Towne Centre Drive, Suite 100, San Diego, California 92121, USA
| | - Kevin D Dorfman
- Department of Chemical Engineering and Materials Science, University of Minnesota-Twin Cities, 421 Washington Ave. SE, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
33
|
Dvorak J, Wang L, Zhu T, Jorgensen CM, Deal KR, Dai X, Dawson MW, Müller HG, Luo MC, Ramasamy RK, Dehghani H, Gu YQ, Gill BS, Distelfeld A, Devos KM, Qi P, You FM, Gulick PJ, McGuire PE. Structural variation and rates of genome evolution in the grass family seen through comparison of sequences of genomes greatly differing in size. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 95:487-503. [PMID: 29770515 DOI: 10.1111/tpj.13964] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2018] [Revised: 05/04/2018] [Accepted: 05/08/2018] [Indexed: 05/05/2023]
Abstract
Homology was searched with genes annotated in the Aegilops tauschii pseudomolecules against genes annotated in the pseudomolecules of tetraploid wild emmer wheat, Brachypodium distachyon, sorghum and rice. Similar searches were performed with genes annotated in the rice pseudomolecules. Matrices of collinear genes and rearrangements in their order were constructed. Optical BioNano genome maps were constructed and used to validate rearrangements unique to the wild emmer and Ae. tauschii genomes. Most common rearrangements were short paracentric inversions and short intrachromosomal translocations. Intrachromosomal translocations outnumbered segmental intrachromosomal duplications. The densities of paracentric inversion lengths were approximated by exponential distributions in all six genomes. Densities of collinear genes along the Ae. tauschii chromosomes were highly correlated with meiotic recombination rates but those of rearrangements were not, suggesting different causes of the erosion of gene collinearity and evolution of major chromosome rearrangements. Frequent rearrangements sharing breakpoints suggested that chromosomes have been rearranged recurrently at some sites. The distal 4 Mb of the short arms of rice chromosomes Os11 and Os12 and corresponding regions in the sorghum, B. distachyon and Triticeae genomes contain clusters of interstitial translocations including from 1 to 7 collinear genes. The rates of acquisition of major rearrangements were greater in the large wild emmer wheat and Ae. tauschii genomes than in the lineage preceding their divergence or in the B. distachyon, rice and sorghum lineages. It is suggested that synergy between large quantities of dynamic transposable elements and annual growth habit have been the primary causes of the fast evolution of the Triticeae genomes.
Collapse
Affiliation(s)
- Jan Dvorak
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Le Wang
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Chad M Jorgensen
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Karin R Deal
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Xiongtao Dai
- Department of Statistics, University of California, Davis, CA, USA
| | - Matthew W Dawson
- Department of Statistics, University of California, Davis, CA, USA
| | | | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Ramesh K Ramasamy
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Hamid Dehghani
- Department of Plant Sciences, University of California, Davis, CA, USA
- Department of Plant Breeding, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
| | - Yong Q Gu
- Crop Improvement & Genetics Research, USDA-ARS, Albany, CA, USA
| | - Bikram S Gill
- Department of Plant Pathology, Kansas State University, Manhattan, KS, USA
| | - Assaf Distelfeld
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | - Katrien M Devos
- Institute of Plant Breeding, Genetics and Genomics (Department of Crop & Soil Sciences), University of Georgia, Athens, GA, USA
- Department of Plant Biology, University of Georgia, Athens, GA, USA
| | - Peng Qi
- Institute of Plant Breeding, Genetics and Genomics (Department of Crop & Soil Sciences), University of Georgia, Athens, GA, USA
- Department of Plant Biology, University of Georgia, Athens, GA, USA
| | - Frank M You
- Agriculture & Agri-Food Canada, Morden, MB, Canada
| | - Patrick J Gulick
- Department of Biology, Concordia University, Montreal, QC, Canada
| | - Patrick E McGuire
- Department of Plant Sciences, University of California, Davis, CA, USA
| |
Collapse
|
34
|
Gaiero P, Šimková H, Vrána J, Santiñaque FF, López-Carro B, Folle GA, van de Belt J, Peters SA, Doležel J, de Jong H. Intact DNA purified from flow-sorted nuclei unlocks the potential of next-generation genome mapping and assembly in Solanum species. MethodsX 2018; 5:328-336. [PMID: 30046519 PMCID: PMC6058011 DOI: 10.1016/j.mex.2018.03.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 03/31/2018] [Indexed: 12/21/2022] Open
Abstract
Next-generation genome mapping through nanochannels (Bionano optical mapping) of plant genomes brings genome assemblies to the ‘nearly-finished’ level for reliable and detailed gene annotations and assessment of structural variations. Despite the recent progress in its development, researchers face the technical challenges of obtaining sufficient high molecular weight (HMW) nuclear DNA due to cell walls which are difficult to disrupt and to the presence of cytoplasmic polyphenols and polysaccharides that co-precipitate or are covalently bound to DNA and might cause oxidation and/or affect the access of nicking enzymes to DNA, preventing downstream applications. Here we describe important improvements for obtaining HMW DNA that we tested on Solanum crops and wild relatives. The methods that we further elaborated and refined focus on Improving flexibility of using different tissues as source materials, like fast-growing root tips and young leaves from seedlings or in vitro plantlets. Obtaining nuclei suspensions through either lab homogenizers or by chopping. Increasing flow sorting efficiency using DAPI (4′,6-diamidino-2-phenylindole) and PI (propidium iodide) DNA stains, with different lasers (UV or 488 nm) and sorting platforms such as the FACSAria and FACSVantage flow sorters, thus making it appropriate for more laboratories working on plant genomics.
The obtained nuclei are embedded into agarose plugs for processing and isolating uncontaminated HMW DNA, which is a prerequisite for nanochannel-based next-generation optical mapping strategies.
Collapse
Affiliation(s)
- Paola Gaiero
- Faculty of Agronomy, University of the Republic, Montevideo, Uruguay
- Laboratory of Genetics, Wageningen University & Research, Wageningen, The Netherlands
| | - Hana Šimková
- Centre of Plant Structural and Functional Genomics, Institute of Experimental Botany, Olomouc, Czech Republic
| | - Jan Vrána
- Centre of Plant Structural and Functional Genomics, Institute of Experimental Botany, Olomouc, Czech Republic
| | - Federico F. Santiñaque
- Flow Cytometry and Cell Sorting Core, Instituto de Investigaciones Biológicas Clemente Estable (IIBCE), Montevideo, Uruguay
| | - Beatriz López-Carro
- Flow Cytometry and Cell Sorting Core, Instituto de Investigaciones Biológicas Clemente Estable (IIBCE), Montevideo, Uruguay
| | - Gustavo A. Folle
- Flow Cytometry and Cell Sorting Core, Instituto de Investigaciones Biológicas Clemente Estable (IIBCE), Montevideo, Uruguay
| | - José van de Belt
- Laboratory of Genetics, Wageningen University & Research, Wageningen, The Netherlands
| | - Sander A. Peters
- Applied Bioinformatics, Department of Bioscience, Wageningen University & Research, Wageningen, The Netherlands
| | - Jaroslav Doležel
- Centre of Plant Structural and Functional Genomics, Institute of Experimental Botany, Olomouc, Czech Republic
| | - Hans de Jong
- Laboratory of Genetics, Wageningen University & Research, Wageningen, The Netherlands
- Corresponding author: Laboratory of Genetics, Wageningen University Research, Droevendaalsesteeg 1, P.O. Box 16, 6708 PB, Wageningen, The Netherlands.
| |
Collapse
|
35
|
You FM, Xiao J, Li P, Yao Z, Jia G, He L, Zhu T, Luo MC, Wang X, Deyholos MK, Cloutier S. Chromosome-scale pseudomolecules refined by optical, physical and genetic maps in flax. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 95:371-384. [PMID: 29681136 DOI: 10.1111/tpj.13944] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Revised: 03/19/2018] [Accepted: 03/22/2018] [Indexed: 05/19/2023]
Abstract
Genomes of varying sizes have been sequenced with next-generation sequencing platforms. However, most reference sequences include draft unordered scaffolds containing chimeras caused by mis-scaffolding. A BioNano genome (BNG) optical map was constructed to improve the previously sequenced flax genome (Linum usitatissimum L., 2n = 30, about 373 Mb), which consisted of 3852 scaffolds larger than 1 kb and totalling 300.6 Mb. The high-resolution BNG map of cv. CDC Bethune totalled 317 Mb and consisted of 251 BNG contigs with an N50 of 2.15 Mb. A total of 622 scaffolds (286.6 Mb, 94.9%) aligned to 211 BNG contigs (298.6 Mb, 94.2%). Of those, 99 scaffolds, diagnosed to contain assembly errors, were refined into 225 new scaffolds. Using the newly refined scaffold sequences and the validated bacterial artificial chromosome-based physical map of CDC Bethune, the 211 BNG contigs were scaffolded into 94 super-BNG contigs (N50 of 6.64 Mb) that were further assigned to the 15 flax chromosomes using the genetic map. The pseudomolecules total about 316 Mb, with individual chromosomes of 15.6 to 29.4 Mb, and cover 97% of the annotated genes. Evidence from the chromosome-scale pseudomolecules suggests that flax has undergone palaeopolyploidization and mesopolyploidization events, followed by rearrangements and deletions or fusion of chromosome arms from an ancient progenitor with a haploid chromosome number of eight.
Collapse
Affiliation(s)
- Frank M You
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB, R6M 1Y5, Canada
| | - Jin Xiao
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB, R6M 1Y5, Canada
- State Key Lab of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Pingchuan Li
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB, R6M 1Y5, Canada
| | - Zhen Yao
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB, R6M 1Y5, Canada
| | - Gaofeng Jia
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB, R6M 1Y5, Canada
- Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK, S7N 5A8, Canada
| | - Liqiang He
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB, R6M 1Y5, Canada
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Xiue Wang
- State Key Lab of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | | | - Sylvie Cloutier
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON, K1A 0C6, Canada
| |
Collapse
|
36
|
Zhu T, Hu Z, Rodriguez JC, Deal KR, Dvorak J, Vogel JP, Liu Z, Luo MC. Analysis of Brachypodium genomes with genome-wide optical maps. Genome 2018; 61:559-565. [PMID: 29883550 DOI: 10.1139/gen-2018-0013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Brachypodium distachyon (n = 5) is a diploid and has been widely used as a genetic model. Brachypodium stacei (n = 10) and B. hybridum (n = 15) are species that are related to B. distachyon, leading to an hypothesis that they are part of a polyploid series based on x = 5. Several lines of evidence suggest that this hypothesis is incorrect and that the genomes of the three taxa may have evolved by a more complex process. We constructed an optical whole-genome BioNano genome (BNG) map for each species and did pairwise alignment of the BNG maps. The maps showed that B. distachyon and B. stacei are both diploid, in spite of B. stacei having twice as many chromosomes as B. distachyon, and that B. hybridum is an allopolyploid formed from hybridization between B. distachyon and B. stacei. This study also demonstrated the use of BNG maps in the detection and quantification of structural variants among the genomes.
Collapse
Affiliation(s)
- Tingting Zhu
- a Department of Plant Sciences, University of California, Davis, CA 95616, USA
| | - Zhaorong Hu
- a Department of Plant Sciences, University of California, Davis, CA 95616, USA.,b State Key Laboratory for Agrobiotechnology, Key Laboratory of Crop Heterosis Utilization (MOE), China Agricultural University, Beijing, 100193, China
| | - Juan C Rodriguez
- a Department of Plant Sciences, University of California, Davis, CA 95616, USA
| | - Karin R Deal
- a Department of Plant Sciences, University of California, Davis, CA 95616, USA
| | - Jan Dvorak
- a Department of Plant Sciences, University of California, Davis, CA 95616, USA
| | - John P Vogel
- c DOE Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, CA 94598, USA
| | - Zhiyong Liu
- d State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Ming-Cheng Luo
- a Department of Plant Sciences, University of California, Davis, CA 95616, USA
| |
Collapse
|
37
|
Chan EKF, Cameron DL, Petersen DC, Lyons RJ, Baldi BF, Papenfuss AT, Thomas DM, Hayes VM. Optical mapping reveals a higher level of genomic architecture of chained fusions in cancer. Genome Res 2018; 28:726-738. [PMID: 29618486 PMCID: PMC5932612 DOI: 10.1101/gr.227975.117] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 03/21/2018] [Indexed: 01/21/2023]
Abstract
Genomic rearrangements are common in cancer, with demonstrated links to disease progression and treatment response. These rearrangements can be complex, resulting in fusions of multiple chromosomal fragments and generation of derivative chromosomes. Although methods exist for detecting individual fusions, they are generally unable to reconstruct complex chained events. To overcome these limitations, we adopted a new optical mapping approach, allowing megabase-length genome maps to be reconstructed and rearranged genomes to be visualized without loss of integrity. Whole-genome mapping (Bionano Genomics) of a well-studied highly rearranged liposarcoma cell line resulted in 3338 assembled consensus genome maps, including 72 fusion maps. These fusion maps represent 112.3 Mb of highly rearranged genomic regions, illuminating the complex architecture of chained fusions, including content, order, orientation, and size. Spanning the junction of 147 chromosomal translocations, we found a total of 28 Mb of interspersed sequences that could not be aligned to the reference genome. Traversing these interspersed sequences using short-read sequencing breakpoint calls, we were able to identify and place 399 sequencing fragments within the optical mapping gaps, thus illustrating the complementary nature of optical mapping and short-read sequencing. We demonstrate that optical mapping provides a powerful new approach for capturing a higher level of complex genomic architecture, creating a scaffold for renewed interpretation of sequencing data of particular relevance to human cancer.
Collapse
Affiliation(s)
- Eva K F Chan
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia.,St Vincent's Clinical School, University of New South Wales, New South Wales 2052, Australia
| | - Daniel L Cameron
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Victoria 3052, Australia.,Department of Medical Biology, University of Melbourne, Victoria 3010, Australia
| | - Desiree C Petersen
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia.,St Vincent's Clinical School, University of New South Wales, New South Wales 2052, Australia
| | - Ruth J Lyons
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia
| | - Benedetta F Baldi
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia
| | - Anthony T Papenfuss
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Victoria 3052, Australia.,Department of Medical Biology, University of Melbourne, Victoria 3010, Australia.,Department of Mathematics and Statistics, University of Melbourne, Victoria 3010, Australia.,Sir Peter MacCallum Department of Oncology, University of Melbourne, Victoria 3010, Australia.,Bioinformatics and Cancer Genomics, Peter MacCallum Cancer Centre, Victoria 3002, Australia
| | - David M Thomas
- The Kinghorn Cancer Centre, Garvan Institute of Medical Research, New South Wales 2010, Australia.,Cancer Division, Garvan Institute of Medical Research, New South Wales 2010, Australia
| | - Vanessa M Hayes
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia.,St Vincent's Clinical School, University of New South Wales, New South Wales 2052, Australia.,School of Health Systems and Public Health, University of Pretoria, Hatfield 0002, South Africa.,Central Clinical School, University of Sydney, New South Wales 2006, Australia
| |
Collapse
|
38
|
Huo N, Zhu T, Altenbach S, Dong L, Wang Y, Mohr T, Liu Z, Dvorak J, Luo MC, Gu YQ. Dynamic Evolution of α-Gliadin Prolamin Gene Family in Homeologous Genomes of Hexaploid Wheat. Sci Rep 2018; 8:5181. [PMID: 29581476 PMCID: PMC5980091 DOI: 10.1038/s41598-018-23570-5] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Accepted: 03/13/2018] [Indexed: 12/21/2022] Open
Abstract
Wheat Gli-2 loci encode complex groups of α-gliadin prolamins that are important for breadmaking, but also major triggers of celiac disease (CD). Elucidation of α-gliadin evolution provides knowledge to produce wheat with better end-use properties and reduced immunogenic potential. The Gli-2 loci contain a large number of tandemly duplicated genes and highly repetitive DNA, making sequence assembly of their genomic regions challenging. Here, we constructed high-quality sequences spanning the three wheat homeologous α-gliadin loci by aligning PacBio-based sequence contigs with BioNano genome maps. A total of 47 α-gliadin genes were identified with only 26 encoding intact full-length protein products. Analyses of α-gliadin loci and phylogenetic tree reconstruction indicate significant duplications of α-gliadin genes in the last ~2.5 million years after the divergence of the A, B and D genomes, supporting its rapid lineage-independent expansion in different Triticeae genomes. We showed that dramatic divergence in expression of α-gliadin genes could not be attributed to sequence variations in the promoter regions. The study also provided insights into the evolution of CD epitopes and identified a single indel event in the hexaploid wheat D genome that likely resulted in the generation of the highly toxic 33-mer CD epitope.
Collapse
Affiliation(s)
- Naxin Huo
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, California, 94710, USA.,Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Susan Altenbach
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, California, 94710, USA
| | - Lingli Dong
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yi Wang
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, California, 94710, USA
| | - Toni Mohr
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, California, 94710, USA
| | - Zhiyong Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Jan Dvorak
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA.
| | - Yong Q Gu
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, California, 94710, USA.
| |
Collapse
|
39
|
Loose MW. The potential impact of nanopore sequencing on human genetics. Hum Mol Genet 2018; 26:R202-R207. [PMID: 28977449 DOI: 10.1093/hmg/ddx287] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Accepted: 07/17/2017] [Indexed: 12/21/2022] Open
Abstract
Nanopore sequencing has been available to researchers for a little over 3 years. Recently, the milestone of sequencing and assembling a human genome on this platform was achieved for the first time. Significant improvements to the platform in yield and accuracy, coupled with higher throughput nanopore sequencers, mean that human genome sequencing at scale is now possible. Here, a brief recent history of the nanopore platform is provided, key papers and innovations are highlighted and some of the challenges for the future are discussed.
Collapse
Affiliation(s)
- Matthew W Loose
- School of Life Sciences, University of Nottingham, Nottingham NG7 2UH, UK
| |
Collapse
|
40
|
Reifenberger JG, Cao H, Dorfman KD. Odijk excluded volume interactions during the unfolding of DNA confined in a nanochannel. Macromolecules 2018; 51:1172-1180. [PMID: 29479117 PMCID: PMC5823525 DOI: 10.1021/acs.macromol.7b02466] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We report experimental data on the unfolding of human and E. coli genomic DNA molecules shortly after injection into a 45 nm nanochannel. The unfolding dynamics are deterministic, consistent with previous experiments and modeling in larger channels, and do not depend on the biological origin of the DNA. The measured entropic unfolding force per friction per unit contour length agrees with that predicted by combining the Odijk excluded volume with numerical calculations of the Kirkwood diffusivity of confined DNA. The time scale emerging from our analysis has implications for genome mapping in nanochannels, especially as the technology moves towards longer DNA, by setting a lower bound for the delay time before making a measurement.
Collapse
Affiliation(s)
| | - Han Cao
- BioNano Genomics Inc., 9640 Towne Centre Drive, Suite 100, San Diego, CA 92121
| | - Kevin D. Dorfman
- Department of Chemical Engineering and Materials Science, University of Minnesota – Twin Cities, 421 Washington Ave SE, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
41
|
Marie R, Pedersen JN, Mir KU, Bilenberg B, Kristensen A. Concentrating and labeling genomic DNA in a nanofluidic array. NANOSCALE 2018; 10:1376-1382. [PMID: 29300409 DOI: 10.1039/c7nr06016e] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Nucleotide incorporation by DNA polymerase forms the basis of DNA sequencing-by-synthesis. In current platforms, either the single-stranded DNA or the enzyme is immobilized on a solid surface to locate the incorporation of individual nucleotides in space and/or time. Solid-phase reactions may, however, hinder the polymerase activity. We demonstrate a device and a protocol for the enzymatic labeling of genomic DNA arranged in a dense array of single molecules without attaching the enzyme or the DNA to a surface. DNA molecules accumulate in a dense array of pits embedded within a nanoslit due to entropic trapping. We then perform ϕ29 polymerase extension from single-strand nicks created on the trapped molecules to incorporate fluorescent nucleotides into the DNA. The array of entropic traps can be loaded with λ-DNA molecules to more than 90% of capacity at a flow rate of 10 pL min-1. The final concentration can reach up to 100 μg mL-1, and the DNA is eluted from the array by increasing the flow rate. The device may be an important preparative module for carrying out enzymatic processing on DNA extracted from single-cells in a microfluidic chip.
Collapse
Affiliation(s)
- Rodolphe Marie
- Department of Micro and Nanotechnology, Technical University of Denmark, Kongens Lyngby, Denmark.
| | | | | | | | | |
Collapse
|
42
|
Abstract
Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains.
Collapse
|
43
|
Huo N, Zhang S, Zhu T, Dong L, Wang Y, Mohr T, Hu T, Liu Z, Dvorak J, Luo MC, Wang D, Lee JY, Altenbach S, Gu YQ. Gene Duplication and Evolution Dynamics in the Homeologous Regions Harboring Multiple Prolamin and Resistance Gene Families in Hexaploid Wheat. FRONTIERS IN PLANT SCIENCE 2018; 9:673. [PMID: 29875781 PMCID: PMC5974169 DOI: 10.3389/fpls.2018.00673] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 05/03/2018] [Indexed: 05/19/2023]
Abstract
Improving end-use quality and disease resistance are important goals in wheat breeding. The genetic loci controlling these traits are highly complex, consisting of large families of prolamin and resistance genes with members present in all three homeologous A, B, and D genomes in hexaploid bread wheat. Here, orthologous regions harboring both prolamin and resistance gene loci were reconstructed and compared to understand gene duplication and evolution in different wheat genomes. Comparison of the two orthologous D regions from the hexaploid wheat Chinese Spring and the diploid progenitor Aegilops tauschii revealed their considerable difference due to the presence of five large structural variations with sizes ranging from 100 kb to 2 Mb. As a result, 44% of the Ae. tauschii and 71% of the Chinese Spring sequences in the analyzed regions, including 79 genes, are not shared. Gene rearrangement events, including differential gene duplication and deletion in the A, B, and D regions, have resulted in considerable erosion of gene collinearity in the analyzed regions, suggesting rapid evolution of prolamin and resistance gene families after the separation of the three wheat genomes. We hypothesize that this fast evolution is attributed to the co-evolution of the two gene families dispersed within a high recombination region. The identification of a full set of prolamin genes facilitated transcriptome profiling and revealed that the A genome contributes the least to prolamin expression because of its smaller number of expressed intact genes and their low expression levels, while the B and D genomes contribute similarly.
Collapse
Affiliation(s)
- Naxin Huo
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, United States
- Department of Plant Sciences, University of California, Davis, Davis, CA, United States
| | - Shengli Zhang
- Hena Institute of Science and Technology, Xinxiang, China
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, Davis, CA, United States
| | - Lingli Dong
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Yi Wang
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, United States
| | - Toni Mohr
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, United States
| | - Tiezhu Hu
- Hena Institute of Science and Technology, Xinxiang, China
| | - Zhiyong Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Jan Dvorak
- Department of Plant Sciences, University of California, Davis, Davis, CA, United States
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, Davis, CA, United States
| | - Daowen Wang
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Jong-Yeol Lee
- National Institute of Agricultural Science, Rural Development Administration, Jeonju, South Korea
| | - Susan Altenbach
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, United States
- *Correspondence: Susan Altenbach, Yong Q. Gu,
| | - Yong Q. Gu
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, United States
- *Correspondence: Susan Altenbach, Yong Q. Gu,
| |
Collapse
|
44
|
Yuan Y, Milec Z, Bayer PE, Vrána J, Doležel J, Edwards D, Erskine W, Kaur P. Large-Scale Structural Variation Detection in Subterranean Clover Subtypes Using Optical Mapping. FRONTIERS IN PLANT SCIENCE 2018; 9:971. [PMID: 30065731 PMCID: PMC6056659 DOI: 10.3389/fpls.2018.00971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Accepted: 06/15/2018] [Indexed: 05/05/2023]
Abstract
We selected two genetically diverse subspecies of the Trifolium model species, subterranean clover cvs. Daliak and Yarloop. The structural variations (SVs) discovered by Bionano optical mapping (BOM) were validated using Illumina short reads. In the analysis, BOM identified 12 large-scale regions containing deletions and 19 regions containing insertions in Yarloop. The 12 large-scale regions contained 71 small deletions when validated by Illumina short reads. The results suggest that BOM could detect the total size of deletions and insertions, but it could not precisely report the location and actual quantity of SVs in the genome. Nucleotide-level validation is crucial to confirm and characterize SVs reported by optical mapping. The accuracy of SV detection by BOM is highly dependent on the quality of reference genomes and the density of selected nickases.
Collapse
Affiliation(s)
- Yuxuan Yuan
- School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
- Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Zbyněk Milec
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - Philipp E. Bayer
- School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
- Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Jan Vrána
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - Jaroslav Doležel
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - David Edwards
- School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
- Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - William Erskine
- Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
- Centre for Plant Genetics and Breeding, School of Agriculture and Environment, The University of Western Australia, Perth, WA, Australia
| | - Parwinder Kaur
- Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
- Centre for Plant Genetics and Breeding, School of Agriculture and Environment, The University of Western Australia, Perth, WA, Australia
- Telethon Kids Institute, Perth, WA, Australia
- *Correspondence: Parwinder Kaur,
| |
Collapse
|
45
|
Abstract
Repeated sequences make up approximately two-thirds of the human genome, which become fully accountable when very large DNA molecules are analyzed. Long, single DNA molecules are problematic using common experimental techniques and fluidic devices because of mechanical considerations that include breakage, dealing with the massive size of these coils, or the huge length of stretched DNAs. Accordingly, we harness analyte “issues” as exploitable advantages by invention and characterization of the “molecular gate,” which controls and synchronizes formation of stretched molecules as DNA dumbbells within nanoslit geometries that may also offer new routes to separation. This was accomplished by theoretical studies and experiments leveraging a series of electrical forces acting on DNA molecules, device walls, and the fluid flows within our devices. Very large DNA molecules enable comprehensive analysis of complex genomes, such as human, cancer, and plants because they span across sequence repeats and complex somatic events. When physically manipulated, or analyzed as single molecules, long polyelectrolytes are problematic because of mechanical considerations that include shear-mediated breakage, dealing with the massive size of these coils, or the length of stretched DNAs using common experimental techniques and fluidic devices. Accordingly, we harness analyte “issues” as exploitable advantages by our invention and characterization of the “molecular gate,” which controls and synchronizes formation of stretched DNA molecules as DNA dumbbells within nanoslit geometries. Molecular gate geometries comprise micro- and nanoscale features designed to synergize very low ionic strength conditions in ways we show effectively create an “electrostatic bottle.” This effect greatly enhances molecular confinement within large slit geometries and supports facile, synchronized electrokinetic loading of nanoslits, even without dumbbell formation. Device geometries were considered at the molecular and continuum scales through computer simulations, which also guided our efforts to optimize design and functionalities. In addition, we show that the molecular gate may govern DNA separations because DNA molecules can be electrokinetically triggered, by varying applied voltage, to enter slits in a size-dependent manner. Lastly, mapping the Mesoplasmaflorum genome, via synchronized dumbbell formation, validates our nascent approach as a viable starting point for advanced development that will build an integrated system capable of large-scale genome analysis.
Collapse
|
46
|
Li L, Leung AKY, Kwok TP, Lai YYY, Pang IK, Chung GTY, Mak ACY, Poon A, Chu C, Li M, Wu JJK, Lam ET, Cao H, Lin C, Sibert J, Yiu SM, Xiao M, Lo KW, Kwok PY, Chan TF, Yip KY. OMSV enables accurate and comprehensive identification of large structural variations from nanochannel-based single-molecule optical maps. Genome Biol 2017; 18:230. [PMID: 29195502 PMCID: PMC5709945 DOI: 10.1186/s13059-017-1356-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 11/03/2017] [Indexed: 12/20/2022] Open
Abstract
We present a new method, OMSV, for accurately and comprehensively identifying structural variations (SVs) from optical maps. OMSV detects both homozygous and heterozygous SVs, SVs of various types and sizes, and SVs with or without creating or destroying restriction sites. We show that OMSV has high sensitivity and specificity, with clear performance gains over the latest method. Applying OMSV to a human cell line, we identified hundreds of SVs >2 kbp, with 68 % of them missed by sequencing-based callers. Independent experimental validation confirmed the high accuracy of these SVs. The OMSV software is available at http://yiplab.cse.cuhk.edu.hk/omsv/ .
Collapse
Affiliation(s)
- Le Li
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Alden King-Yung Leung
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Tsz-Piu Kwok
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Yvonne Y Y Lai
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, California, USA
| | - Iris K Pang
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Grace Tin-Yun Chung
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Angel C Y Mak
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, California, USA
| | - Annie Poon
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, California, USA
| | - Catherine Chu
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, California, USA
| | - Menglu Li
- Department of Computer Science, The University of Hong Kong, Pokfulam, Hong Kong
| | - Jacob J K Wu
- Department of Computer Science, The University of Hong Kong, Pokfulam, Hong Kong
| | | | - Han Cao
- BioNano Genomics, San Diego, California, USA
| | - Chin Lin
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, California, USA
| | - Justin Sibert
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, Pennsylvania, USA
| | - Siu-Ming Yiu
- Department of Computer Science, The University of Hong Kong, Pokfulam, Hong Kong
| | - Ming Xiao
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, Pennsylvania, USA
| | - Kwok-Wai Lo
- Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, California, USA.,Institute for Human Genetics, University of California San Francisco, San Francisco, California, USA
| | - Ting-Fung Chan
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. .,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. .,Hong Kong Institute of Diabetes and Obesity, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. .,CUHK-BGI Innovation Institute of Trans-omics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.
| | - Kevin Y Yip
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. .,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. .,Hong Kong Institute of Diabetes and Obesity, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. .,CUHK-BGI Innovation Institute of Trans-omics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.
| |
Collapse
|
47
|
Kachalova GS, Popov AN, Yunusova AK, Artyukh RI, Perevyazova TA, Zheleznaya LA, Atanasov BP. Global conformational changes induced by the removal of the carboxyl group of D456 in the cleavage scaffold of nickase BspD6I: Structural and electrostatic analysis. CRYSTALLOGR REP+ 2017. [DOI: 10.1134/s1063774517060141] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
48
|
Huo N, Dong L, Zhang S, Wang Y, Zhu T, Mohr T, Altenbach S, Liu Z, Dvorak J, Anderson OD, Luo MC, Wang D, Gu YQ. New insights into structural organization and gene duplication in a 1.75-Mb genomic region harboring the α-gliadin gene family in Aegilops tauschii, the source of wheat D genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 92:571-583. [PMID: 28857322 DOI: 10.1111/tpj.13675] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 08/18/2017] [Accepted: 08/22/2017] [Indexed: 06/07/2023]
Abstract
Among the wheat prolamins important for its end-use traits, α-gliadins are the most abundant, and are also a major cause of food-related allergies and intolerances. Previous studies of various wheat species estimated that between 25 and 150 α-gliadin genes reside in the Gli-2 locus regions. To better understand the evolution of this complex gene family, the DNA sequence of a 1.75-Mb genomic region spanning the Gli-2 locus was analyzed in the diploid grass, Aegilops tauschii, the ancestral source of D genome in hexaploid bread wheat. Comparison with orthologous regions from rice, sorghum, and Brachypodium revealed rapid and dynamic changes only occurring to the Ae. tauschii Gli-2 region, including insertions of high numbers of non-syntenic genes and a high rate of tandem gene duplications, the latter of which have given rise to 12 copies of α-gliadin genes clustered within a 550-kb region. Among them, five copies have undergone pseudogenization by various mutation events. Insights into the evolutionary relationship of the duplicated α-gliadin genes were obtained from their genomic organization, transcription patterns, transposable element insertions and phylogenetic analyses. An ancestral glutamate-like receptor (GLR) gene encoding putative amino acid sensor in all four grass species has duplicated only in Ae. tauschii and generated three more copies that are interspersed with the α-gliadin genes. Phylogenetic inference and different gene expression patterns support functional divergence of the Ae. tauschii GLR copies after duplication. Our results suggest that the duplicates of α-gliadin and GLR genes have likely taken different evolutionary paths; conservation for the former and neofunctionalization for the latter.
Collapse
Affiliation(s)
- Naxin Huo
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Lingli Dong
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Shengli Zhang
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
- Henan Institute of Science and Technology, Xinxiang, 453003, China
| | - Yi Wang
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Toni Mohr
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
| | - Susan Altenbach
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
| | - Zhiyong Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Jan Dvorak
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Olin D Anderson
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Daowen Wang
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yong Q Gu
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, CA, 94710, USA
| |
Collapse
|
49
|
Young E, Pastor S, Rajagopalan R, McCaffrey J, Sibert J, Mak ACY, Kwok PY, Riethman H, Xiao M. High-throughput single-molecule mapping links subtelomeric variants and long-range haplotypes with specific telomeres. Nucleic Acids Res 2017; 45:e73. [PMID: 28180280 PMCID: PMC5605236 DOI: 10.1093/nar/gkx017] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Accepted: 02/07/2017] [Indexed: 01/22/2023] Open
Abstract
Accurate maps and DNA sequences for human subtelomere regions, along with detailed knowledge of subtelomere variation and long-range telomere-terminal haplotypes in individuals, are critical for understanding telomere function and its roles in human biology. Here, we use a highly automated whole genome mapping technology in nano-channel arrays to analyze large terminal human chromosome segments extending from chromosome-specific subtelomere sequences through subtelomeric repeat regions to terminal (TTAGGG)n repeat tracts. We establish detailed maps for subtelomere gap regions in the human reference sequence, detect many new large subtelomeric variants and demonstrate the feasibility of long-range haplotyping through segmentally duplicated subtelomere regions. These features make the method a uniquely valuable new tool for improving the quality of genome assemblies in complex DNA regions. Based on single molecule mapping of telomere-terminal DNA fragments, we provide proof of principle for a novel method to estimate telomere lengths linked to distinguishable telomeric haplotypes; this single-telomere genotyping method may ultimately enable delineation of human cis elements involved in telomere length regulation.
Collapse
Affiliation(s)
- Eleanor Young
- Drexel University, School of Biomedical Engineering, Philadelphia, PA, 19104 USA
| | - Steven Pastor
- Drexel University, School of Biomedical Engineering, Philadelphia, PA, 19104 USA
| | | | - Jennifer McCaffrey
- Drexel University, School of Biomedical Engineering, Philadelphia, PA, 19104 USA
| | - Justin Sibert
- Drexel University, School of Biomedical Engineering, Philadelphia, PA, 19104 USA
| | - Angel C Y Mak
- Cardiovascular Research Institute, University of California, San Francisco, CA, 94158 USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, University of California, San Francisco, CA, 94158 USA
| | - Harold Riethman
- Old Dominion University, Medical Diagnostic and Translational Sciences, Norfolk, VA, 23529 USA
| | - Ming Xiao
- Drexel University, School of Biomedical Engineering, Philadelphia, PA, 19104 USA.,Institute of Molecular Medicine and Infectious Disease, School of Medicine, Drexel University, Philadelphia, PA, 19102 USA
| |
Collapse
|
50
|
Yuan Y, Bayer PE, Scheben A, Chan CKK, Edwards D. BioNanoAnalyst: a visualisation tool to assess genome assembly quality using BioNano data. BMC Bioinformatics 2017; 18:323. [PMID: 28666410 PMCID: PMC5493081 DOI: 10.1186/s12859-017-1735-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Accepted: 06/20/2017] [Indexed: 12/14/2022] Open
Abstract
Background Reference genome assemblies are valuable, as they provide insights into gene content, genetic evolution and domestication. The higher the quality of a reference genome assembly the more accurate the downstream analysis will be. During the last few years, major efforts have been made towards improving the quality of genome assemblies. However, erroneous and incomplete assemblies are still common. Complementary to DNA sequencing technologies, optical mapping has advanced genomic studies by facilitating the production of genome scaffolds and assessing structural variation. However, there are few tools available to comprehensively examine misassemblies in reference genome sequences using optical map data. Results We present BioNanoAnalyst, a software package to examine genome assemblies based on restriction endonuclease cut sites and optical map data. A graphical user interface (GUI) allows users to assess reference genome sequences on different computer platforms without the requirement of programming knowledge. The zoom function makes visualisation convenient, while a GFF3 format output file gives an option to directly visualise questionable assembly regions by location and nucleotides following import into a local genome browser. Conclusions BioNanoAnalyst is a tool to identify misassemblies in a reference genome sequence using optical map data. With the reported information, users can rapidly identify assembly errors and correct them using other software tools, which could facilitate an accurate downstream analysis. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1735-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yuxuan Yuan
- School of Biological Sciences, the University of Western Australia, Perth, WA, Australia
| | - Philipp E Bayer
- School of Biological Sciences, the University of Western Australia, Perth, WA, Australia
| | - Armin Scheben
- School of Biological Sciences, the University of Western Australia, Perth, WA, Australia
| | - Chon-Kit Kenneth Chan
- School of Biological Sciences, the University of Western Australia, Perth, WA, Australia
| | - David Edwards
- School of Biological Sciences, the University of Western Australia, Perth, WA, Australia.
| |
Collapse
|