1
|
Chen C, Li R, Sun J, Zhu Y, Jiang L, Li J, Fu F, Wan J, Guo F, An X, Wang Y, Fan L, Sun Y, Guo X, Zhao S, Wang W, Zeng F, Yang Y, Ni P, Ding Y, Xiang B, Peng Z, Liao C. Noninvasive prenatal testing of α-thalassemia and β-thalassemia through population-based parental haplotyping. Genome Med 2021; 13:18. [PMID: 33546747 PMCID: PMC7866698 DOI: 10.1186/s13073-021-00836-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 01/20/2021] [Indexed: 02/07/2023] Open
Abstract
Background Noninvasive prenatal testing (NIPT) of recessive monogenic diseases depends heavily on knowing the correct parental haplotypes. However, the currently used family-based haplotyping method requires pedigrees, and molecular haplotyping is highly challenging due to its high cost, long turnaround time, and complexity. Here, we proposed a new two-step approach, population-based haplotyping-NIPT (PBH-NIPT), using α-thalassemia and β-thalassemia as prototypes. Methods First, we deduced parental haplotypes with Beagle 4.0 with training on a large retrospective carrier screening dataset (4356 thalassemia carrier screening-positive cases). Second, we inferred fetal haplotypes using a parental haplotype-assisted hidden Markov model (HMM) and the Viterbi algorithm. Results With this approach, we enrolled 59 couples at risk of having a fetus with thalassemia and successfully inferred 94.1% (111/118) of fetal alleles. We confirmed these alleles by invasive prenatal diagnosis, with 99.1% (110/111) accuracy (95% CI, 95.1–100%). Conclusions These results demonstrate that PBH-NIPT is a sensitive, fast, and inexpensive strategy for NIPT of thalassemia. Supplementary Information The online version contains supplementary material available at 10.1186/s13073-021-00836-8.
Collapse
Affiliation(s)
- Chao Chen
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Ru Li
- Department of Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, China
| | - Jun Sun
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Yaping Zhu
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Lu Jiang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Jian Li
- Department of Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, China
| | - Fang Fu
- Department of Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, China
| | - Junhui Wan
- Department of Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, China
| | - Fengyu Guo
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Xiaoying An
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Yaoshen Wang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Linlin Fan
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Yan Sun
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 490079, China
| | - Xiaosen Guo
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Sumin Zhao
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Wanyang Wang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Fanwei Zeng
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Yun Yang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 490079, China.,Department of Obstetrics and Gynecology, The Second Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, China
| | - Peixiang Ni
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Yi Ding
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.,Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China
| | - Bixia Xiang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Zhiyu Peng
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China.
| | - Can Liao
- Department of Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, China.
| |
Collapse
|
2
|
Wang O, Chin R, Cheng X, Wu MKY, Mao Q, Tang J, Sun Y, Anderson E, Lam HK, Chen D, Zhou Y, Wang L, Fan F, Zou Y, Xie Y, Zhang RY, Drmanac S, Nguyen D, Xu C, Villarosa C, Gablenz S, Barua N, Nguyen S, Tian W, Liu JS, Wang J, Liu X, Qi X, Chen A, Wang H, Dong Y, Zhang W, Alexeev A, Yang H, Wang J, Kristiansen K, Xu X, Drmanac R, Peters BA. Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly. Genome Res 2019; 29:798-808. [PMID: 30940689 PMCID: PMC6499310 DOI: 10.1101/gr.245126.118] [Citation(s) in RCA: 130] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 03/21/2019] [Indexed: 01/25/2023]
Abstract
Here, we describe single-tube long fragment read (stLFR), a technology that enables sequencing of data from long DNA molecules using economical second-generation sequencing technology. It is based on adding the same barcode sequence to subfragments of the original long DNA molecule (DNA cobarcoding). To achieve this efficiently, stLFR uses the surface of microbeads to create millions of miniaturized barcoding reactions in a single tube. Using a combinatorial process, up to 3.6 billion unique barcode sequences were generated on beads, enabling practically nonredundant cobarcoding with 50 million barcodes per sample. Using stLFR, we demonstrate efficient unique cobarcoding of more than 8 million 20- to 300-kb genomic DNA fragments. Analysis of the human genome NA12878 with stLFR demonstrated high-quality variant calling and phase block lengths up to N50 34 Mb. We also demonstrate detection of complex structural variants and complete diploid de novo assembly of NA12878. These analyses were all performed using single stLFR libraries, and their construction did not significantly add to the time or cost of whole-genome sequencing (WGS) library preparation. stLFR represents an easily automatable solution that enables high-quality sequencing, phasing, SV detection, scaffolding, cost-effective diploid de novo genome assembly, and other long DNA sequencing applications.
Collapse
Affiliation(s)
- Ou Wang
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China.,Department of Biology, Laboratory of Genomics and Molecular Biomedicine, University of Copenhagen, 2100 Copenhagen, Denmark
| | - Robert Chin
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Xiaofang Cheng
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Michelle Ka Yan Wu
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Qing Mao
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | | | - Yuhui Sun
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Ellis Anderson
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Han K Lam
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Dan Chen
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Yujun Zhou
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Linying Wang
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Fei Fan
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Yan Zou
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | | | - Rebecca Yu Zhang
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Snezana Drmanac
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Darlene Nguyen
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Chongjun Xu
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China.,Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Christian Villarosa
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Scott Gablenz
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Nina Barua
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Staci Nguyen
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Wenlan Tian
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Jia Sophie Liu
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Jingwan Wang
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Xiao Liu
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Xiaojuan Qi
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Ao Chen
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - He Wang
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Yuliang Dong
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Wenwei Zhang
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Andrei Alexeev
- Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou 310058, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou 310058, China
| | - Karsten Kristiansen
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China.,Department of Biology, Laboratory of Genomics and Molecular Biomedicine, University of Copenhagen, 2100 Copenhagen, Denmark
| | - Xun Xu
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Radoje Drmanac
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China.,Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA.,MGI, BGI-Shenzhen, Shenzhen 518083, China
| | - Brock A Peters
- BGI-Shenzhen, Shenzhen 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China.,Advanced Genomics Technology Laboratory, Complete Genomics Incorporated, San Jose, California 95134, USA.,MGI, BGI-Shenzhen, Shenzhen 518083, China
| |
Collapse
|
3
|
A simple dialysis device for large DNA molecules. Biotechniques 2019; 66:93-95. [PMID: 30744406 DOI: 10.2144/btn-2018-0133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The potential of genomic DNA is realized when new modalities are invented that manipulate large DNAs with minimal breakage or loss of sample. Here, we describe a polydimethylsiloxane-polycarbonate membrane device to remove small molecules from a sample while retaining large DNAs. Dialysis rates dramatically change as DNA size in kb (M) increases and DNA dimensions become comparable to pore size, and chain characteristics go from rod-like to Gaussian. Consequently, we describe empirical rates of dialysis, R, as a function of M as falling into two regimes: DNAs ≤ 1 kb show R(M) ∼e - t/τ M (t = time, τM = time constant), while DNAs ≥1.65 kb slowly passage with R(M) ∼M -1.68; such partitioning potentiates single-molecule imaging.
Collapse
|
4
|
Abelleyro MM, Marchione VD, Palmitelli M, Radic CP, Neme D, Larripa IB, Medina-Acosta E, De Brasi CD, Rossetti LC. Inverse PCR to perform long-distance haplotyping: main applications to improve preimplantation genetic diagnosis in hemophilia. Eur J Hum Genet 2019; 27:603-611. [PMID: 30626931 DOI: 10.1038/s41431-018-0334-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 11/28/2018] [Accepted: 12/04/2018] [Indexed: 12/30/2022] Open
Abstract
Among other applications of long-distance haplotype phasing in clinical genetics, determination of linked DNA markers as surrogate for problematic structural variants (e.g., repeat-mediated rearrangements) is essential to perform diagnosis from low-quality DNA samples. We describe a next-of-kin-independent (physical) phasing approach based on inverse-PCR (iPCR) paired-end amplification (PI). This method enables typing the multialleles of the short tandem repeat (STR) F8Int21[CA]n at the F8-intron 21, as a surrogate DNA marker for the F8-intron 22 inversion (Inv22), the hemophilia A-causative hotspot, within the transmitted haplotype in informative carriers. We provide proof-of-concept by blindly validating the PI approach in 15 carrier mother/affected-son duos. Every F8Int21[CA]n STR allele determined in phase with the Inv22 allele in the female carriers from the informative duos was confirmed in the hemizygous proband (P = 0.00003). A second surrogate STR locus at the F8-IVS22 was obtained by the PI approach improving severe-HA preimplantation genetic diagnosis by augmenting heterozygosity in Inv22 carriers bypassing the requirement for family linkage analysis. The ability of the PI-assay to combine other marker pairs was demonstrated by haplotyping a SNV (F8:c.6118T > C) with a >28kb-distant F8-IVS22 STR. The PI approach has proven flexibility to target different marker pairs and has potential for multiplex characterization of iPCR products by massively parallel sequencing.
Collapse
Affiliation(s)
- Miguel Martín Abelleyro
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina
| | - Vanina Daniela Marchione
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina
| | - Micaela Palmitelli
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina
| | - Claudia Pamela Radic
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina
| | - Daniela Neme
- Fundación de la Hemofilia Alfredo Pavlovsky, Buenos Aires, Argentina
| | - Irene Beatriz Larripa
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina
| | - Enrique Medina-Acosta
- Universidade Estadual do Norte Fluminense Darcy Ribeiro, Centro de Biociências e Biotecnologia, Laboratório de Biotecnologia, Núcleo de Diagnóstico e Investigação Molecular, Campos dos Goytacazes, Rio de Janeiro, Brazil
| | - Carlos Daniel De Brasi
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina.,Instituto de Investigaciones Hematológicas Mariano R Castex, Academia Nacional de Medicina, Buenos Aires, Argentina
| | - Liliana Carmen Rossetti
- Instituto de Medicina Experimental (IMEX), CONICET-Academia Nacional de Medicina, Buenos Aires, Argentina.
| |
Collapse
|
5
|
Wang H, Wang H, Duan X, Liu C, Li Z. Digital quantitative analysis of microRNA in single cell based on ligation-depended polymerase colony (Polony). Biosens Bioelectron 2017; 95:146-151. [DOI: 10.1016/j.bios.2017.04.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Revised: 04/03/2017] [Accepted: 04/06/2017] [Indexed: 12/16/2022]
|
6
|
Fan TW, Yu HLL, Hsing IM. Conditional Displacement Hybridization Assay for Multiple SNP Phasing. Anal Chem 2017; 89:9961-9966. [DOI: 10.1021/acs.analchem.7b02300] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Tsz Wing Fan
- Department
of Chemical and Biomolecular Engineering and ‡Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Henson L. Lee Yu
- Department
of Chemical and Biomolecular Engineering and ‡Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - I-Ming Hsing
- Department
of Chemical and Biomolecular Engineering and ‡Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| |
Collapse
|
7
|
Zhang H, Chao J, Pan D, Liu H, Qiang Y, Liu K, Cui C, Chen J, Huang Q, Hu J, Wang L, Huang W, Shi Y, Fan C. DNA origami-based shape IDs for single-molecule nanomechanical genotyping. Nat Commun 2017; 8:14738. [PMID: 28382928 PMCID: PMC5384221 DOI: 10.1038/ncomms14738] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Accepted: 01/27/2017] [Indexed: 01/28/2023] Open
Abstract
Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ∼10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level.
Collapse
Affiliation(s)
- Honglu Zhang
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Jie Chao
- Key Laboratory for Organic Electronics and Information Displays (KLOEID), Institute of Advanced Materials (IAM), School of Materials Science and Engineering, Nanjing University of Posts and Telecommunications, 9 Wenyuan Road, Nanjing 210046, China
| | - Dun Pan
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Huajie Liu
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
| | - Yu Qiang
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Ke Liu
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Chengjun Cui
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
| | - Jianhua Chen
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Qing Huang
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
| | - Jun Hu
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
| | - Lianhui Wang
- Key Laboratory for Organic Electronics and Information Displays (KLOEID), Institute of Advanced Materials (IAM), School of Materials Science and Engineering, Nanjing University of Posts and Telecommunications, 9 Wenyuan Road, Nanjing 210046, China
| | - Wei Huang
- Key Laboratory for Organic Electronics and Information Displays (KLOEID), Institute of Advanced Materials (IAM), School of Materials Science and Engineering, Nanjing University of Posts and Telecommunications, 9 Wenyuan Road, Nanjing 210046, China
| | - Yongyong Shi
- Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Chunhai Fan
- Division of Physical Biology and Bioimaging Center, Shanghai Synchrotron Radiation Facility, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, PO Box 800-204, Shanghai 201800, China
| |
Collapse
|
8
|
Hongzhou C, Shuping G, Wenju W, Li L, Lulu W, Linjun D, Jingmin L, Xiaoli R, Li B. Lab-on-a-chip technologies for genodermatoses: Recent progress and future perspectives. J Dermatol Sci 2017; 85:71-76. [DOI: 10.1016/j.jdermsci.2016.09.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 08/19/2016] [Accepted: 09/05/2016] [Indexed: 10/21/2022]
|
9
|
Ma L, Li W, Song Q. Chromosome-Range Whole-Genome High-Throughput Experimental Haplotyping by Single-Chromosome Microdissection. Methods Mol Biol 2017; 1551:161-169. [PMID: 28138846 PMCID: PMC6372095 DOI: 10.1007/978-1-4939-6750-6_9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Haplotype is fundamental genetic information; it provides essential information for deciphering the functional and etiological roles of genetic variants. As haplotype information is closely related to the functional and etiological impact of genetic variants, it is widely anticipated that haplotype information will be extremely valuable in a wide spectra of applications, including academic research, clinical diagnosis of genetic disease and in the pharmaceutical industry. Haplotyping is essential for LD (linkage disequilibrium) mapping, functional studies on cis-interactions, big data imputation, association studies, population studies, and evolutionary studies. Unfortunately, current sequencing technologies and genotyping arrays do not routinely deliver this information for each individual, but yield only unphased genotypes. Here, we describe a high-throughput and cost-effective experimental protocol to obtain high-resolution chromosomal haplotypes of each individual diploid (including human) genome by the single-chromosome microdissection and sequencing approach.
Collapse
Affiliation(s)
- Li Ma
- 4DGenome Inc, Atlanta, GA, USA. ,Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, 720 Westview Dr SW, Atlanta, GA, 30310, USA.
| | - Wenzhi Li
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, 720 Westview Dr SW, Atlanta, GA, 30310, USA.,Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Qing Song
- 4DGenome Inc, Atlanta, GA, USA. ,Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, 720 Westview Dr SW, Atlanta, GA, 30310, USA. ,Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China. ,Research Wing D-203, 720 Westview Drive, Atlanta, GA, 30310, USA.
| |
Collapse
|
10
|
Patel A, Edge P, Selvaraj S, Bansal V, Bafna V. InPhaDel: integrative shotgun and proximity-ligation sequencing to phase deletions with single nucleotide polymorphisms. Nucleic Acids Res 2016; 44:e111. [PMID: 27105843 PMCID: PMC4937317 DOI: 10.1093/nar/gkw281] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 04/06/2016] [Indexed: 11/23/2022] Open
Abstract
Phasing of single nucleotide (SNV), and structural variations into chromosome-wide haplotypes in humans has been challenging, and required either trio sequencing or restricting phasing to population-based haplotypes. Selvaraj et al. demonstrated single individual SNV phasing is possible with proximity ligated (HiC) sequencing. Here, we demonstrate HiC can phase structural variants into phased scaffolds of SNVs. Since HiC data is noisy, and SV calling is challenging, we applied a range of supervised classification techniques, including Support Vector Machines and Random Forest, to phase deletions. Our approach was demonstrated on deletion calls and phasings on the NA12878 human genome. We used three NA12878 chromosomes and simulated chromosomes to train model parameters. The remaining NA12878 chromosomes withheld from training were used to evaluate phasing accuracy. Random Forest had the highest accuracy and correctly phased 86% of the deletions with allele-specific read evidence. Allele-specific read evidence was found for 76% of the deletions. HiC provides significant read evidence for accurately phasing 33% of the deletions. Also, eight of eight top ranked deletions phased by only HiC were validated using long range polymerase chain reaction and Sanger. Thus, deletions from a single individual can be accurately phased using a combination of shotgun and proximity ligation sequencing. InPhaDel software is available at: http://l337x911.github.io/inphadel/.
Collapse
Affiliation(s)
- Anand Patel
- Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Peter Edge
- Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Siddarth Selvaraj
- Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Vikas Bansal
- Department of Pediatrics, School of Medicine, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Vineet Bafna
- Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| |
Collapse
|
11
|
A review on continuous-flow microfluidic PCR in droplets: Advances, challenges and future. Anal Chim Acta 2016; 914:7-16. [DOI: 10.1016/j.aca.2016.02.006] [Citation(s) in RCA: 98] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Revised: 01/20/2016] [Accepted: 02/04/2016] [Indexed: 12/23/2022]
|
12
|
Huang L, Ma F, Chapman A, Lu S, Xie XS. Single-Cell Whole-Genome Amplification and Sequencing: Methodology and Applications. Annu Rev Genomics Hum Genet 2015; 16:79-102. [PMID: 26077818 DOI: 10.1146/annurev-genom-090413-025352] [Citation(s) in RCA: 265] [Impact Index Per Article: 29.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
We present a survey of single-cell whole-genome amplification (WGA) methods, including degenerate oligonucleotide-primed polymerase chain reaction (DOP-PCR), multiple displacement amplification (MDA), and multiple annealing and looping-based amplification cycles (MALBAC). The key parameters to characterize the performance of these methods are defined, including genome coverage, uniformity, reproducibility, unmappable rates, chimera rates, allele dropout rates, false positive rates for calling single-nucleotide variations, and ability to call copy-number variations. Using these parameters, we compare five commercial WGA kits by performing deep sequencing of multiple single cells. We also discuss several major applications of single-cell genomics, including studies of whole-genome de novo mutation rates, the early evolution of cancer genomes, circulating tumor cells (CTCs), meiotic recombination of germ cells, preimplantation genetic diagnosis (PGD), and preimplantation genomic screening (PGS) for in vitro-fertilized embryos.
Collapse
Affiliation(s)
- Lei Huang
- Biodynamic Optical Imaging Center (BIOPIC), School of Life Sciences, Peking University, Beijing 100871, China
| | | | | | | | | |
Collapse
|
13
|
Rivas MA, Pirinen M, Conrad DF, Lek M, Tsang EK, Karczewski KJ, Maller JB, Kukurba KR, DeLuca DS, Fromer M, Ferreira PG, Smith KS, Zhang R, Zhao F, Banks E, Poplin R, Ruderfer DM, Purcell SM, Tukiainen T, Minikel EV, Stenson PD, Cooper DN, Huang KH, Sullivan TJ, Nedzel J, Bustamante CD, Li JB, Daly MJ, Guigo R, Donnelly P, Ardlie K, Sammeth M, Dermitzakis ET, McCarthy MI, Montgomery SB, Lappalainen T, MacArthur DG. Human genomics. Effect of predicted protein-truncating genetic variants on the human transcriptome. Science 2015; 348:666-9. [PMID: 25954003 PMCID: PMC4537935 DOI: 10.1126/science.1261877] [Citation(s) in RCA: 196] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Accurate prediction of the functional effect of genetic variation is critical for clinical genome interpretation. We systematically characterized the transcriptome effects of protein-truncating variants, a class of variants expected to have profound effects on gene function, using data from the Genotype-Tissue Expression (GTEx) and Geuvadis projects. We quantitated tissue-specific and positional effects on nonsense-mediated transcript decay and present an improved predictive model for this decay. We directly measured the effect of variants both proximal and distal to splice junctions. Furthermore, we found that robustness to heterozygous gene inactivation is not due to dosage compensation. Our results illustrate the value of transcriptome data in the functional interpretation of genetic variants.
Collapse
Affiliation(s)
- Manuel A Rivas
- Wellcome Trust Centre for Human Genetics, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK.
| | - Matti Pirinen
- FInstitute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | | | - Monkol Lek
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Emily K Tsang
- Department of Genetics, Stanford University, Stanford, CA, USA. Department of Pathology, Stanford University, Stanford, CA, USA. Biomedical Informatics Program, Stanford University, Stanford, CA, USA
| | - Konrad J Karczewski
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Julian B Maller
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Kimberly R Kukurba
- Department of Genetics, Stanford University, Stanford, CA, USA. Department of Pathology, Stanford University, Stanford, CA, USA
| | | | - Menachem Fromer
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA. Department of Psychiatry, Mt. Sinai Hospital, NY, USA
| | - Pedro G Ferreira
- Department of Genetic Medicine and Development,University of Geneva, Geneva, Switzerland. Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, Switzerland. Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Kevin S Smith
- Department of Genetics, Stanford University, Stanford, CA, USA. Department of Pathology, Stanford University, Stanford, CA, USA
| | - Rui Zhang
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Fengmei Zhao
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Eric Banks
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ryan Poplin
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Douglas M Ruderfer
- Department of Psychiatry, Mt. Sinai Hospital, NY, USA. Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, NY, USA
| | - Shaun M Purcell
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA. Department of Psychiatry, Mt. Sinai Hospital, NY, USA. Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, NY, USA
| | - Taru Tukiainen
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Eric V Minikel
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Peter D Stenson
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, UK
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, UK
| | | | | | - Jared Nedzel
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Jin Billy Li
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Mark J Daly
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Roderic Guigo
- Center for Genomic Regulation (CRG), Universitat Pompeu Fabra (UPF), Barcelona, Catalonia, Spain
| | - Peter Donnelly
- Wellcome Trust Centre for Human Genetics, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK. Department of Statistics, University of Oxford, Oxford, UK
| | | | - Michael Sammeth
- Center for Genomic Regulation (CRG), Universitat Pompeu Fabra (UPF), Barcelona, Catalonia, Spain. National Institute for Scientific Computing (LNCC), Petropolis, Rio de Janeiro, Brazil
| | - Emmanouil T Dermitzakis
- Department of Genetic Medicine and Development,University of Geneva, Geneva, Switzerland. Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, Switzerland. Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Mark I McCarthy
- Wellcome Trust Centre for Human Genetics, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK. Oxford Center for Diabetes Endocrinology and Metabolism, University of Oxford, Oxford, UK
| | - Stephen B Montgomery
- Department of Genetics, Stanford University, Stanford, CA, USA. Department of Pathology, Stanford University, Stanford, CA, USA
| | - Tuuli Lappalainen
- Department of Genetics, Stanford University, Stanford, CA, USA. Department of Genetic Medicine and Development,University of Geneva, Geneva, Switzerland. Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, Switzerland. Swiss Institute of Bioinformatics, Geneva, Switzerland. New York Genome Center, New York, NY, USA. Department of Systems Biology, Columbia University, New York, NY, USA.
| | - Daniel G MacArthur
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA. Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
14
|
Abstract
Human genomes are diploid and, for their complete description and interpretation, it is necessary not only to discover the variation they contain but also to arrange it onto chromosomal haplotypes. Although whole-genome sequencing is becoming increasingly routine, nearly all such individual genomes are mostly unresolved with respect to haplotype, particularly for rare alleles, which remain poorly resolved by inferential methods. Here, we review emerging technologies for experimentally resolving (that is, 'phasing') haplotypes across individual whole-genome sequences. We also discuss computational methods relevant to their implementation, metrics for assessing their accuracy and completeness, and the relevance of haplotype information to applications of genome sequencing in research and clinical medicine.
Collapse
|
15
|
Zhang R, Li X, Ramaswami G, Smith KS, Turecki G, Montgomery SB, Li JB. Quantifying RNA allelic ratios by microfluidic multiplex PCR and sequencing. Nat Methods 2013; 11:51-4. [PMID: 24270603 PMCID: PMC3877737 DOI: 10.1038/nmeth.2736] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Accepted: 10/18/2013] [Indexed: 11/16/2022]
Abstract
We developed a targeted RNA sequencing method that couples microfluidics-based multiplex PCR and deep sequencing (mmPCR-seq) to uniformly and simultaneously amplify up to 960 loci in 48 samples independently of their gene expression levels, and accurately and cost-effectively measure allelic ratios even for low-quantity or low-quality RNA samples. We applied mmPCR-seq to RNA editing and allele-specific expression studies. mmPCR-seq complements RNA-seq and provides a highly desirable solution for future applications.
Collapse
Affiliation(s)
- Rui Zhang
- Department of Genetics, Stanford University, Stanford, California, USA
| | - Xin Li
- Department of Pathology, Stanford University, Stanford, California, USA
| | - Gokul Ramaswami
- Department of Genetics, Stanford University, Stanford, California, USA
| | - Kevin S Smith
- Department of Pathology, Stanford University, Stanford, California, USA
| | - Gustavo Turecki
- McGill Group for Suicide Studies, Douglas Mental Health University Institute, McGill University, Montreal, Quebec, Canada
| | - Stephen B Montgomery
- 1] Department of Genetics, Stanford University, Stanford, California, USA. [2] Department of Pathology, Stanford University, Stanford, California, USA
| | - Jin Billy Li
- Department of Genetics, Stanford University, Stanford, California, USA
| |
Collapse
|
16
|
Kirkness EF, Grindberg RV, Yee-Greenbaum J, Marshall CR, Scherer SW, Lasken RS, Venter JC. Sequencing of isolated sperm cells for direct haplotyping of a human genome. Genome Res 2013; 23:826-32. [PMID: 23282328 PMCID: PMC3638138 DOI: 10.1101/gr.144600.112] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
There is increasing evidence that the phenotypic effects of genomic sequence variants are best understood in terms of variant haplotypes rather than as isolated polymorphisms. Haplotype analysis is also critically important for uncovering population histories and for the study of evolutionary genetics. Although the sequencing of individual human genomes to reveal personal collections of sequence variants is now well established, there has been slower progress in the phasing of these variants into pairs of haplotypes along each pair of chromosomes. Here, we have developed a distinct approach to haplotyping that can yield chromosome-length haplotypes, including the vast majority of heterozygous single-nucleotide polymorphisms (SNPs) in an individual human genome. This approach exploits the haploid nature of sperm cells and employs a combination of genotyping and low-coverage sequencing on a short-read platform. In addition to generating chromosome-length haplotypes, the approach can directly identify recombination events (averaging 1.1 per chromosome) with a median resolution of <100 kb.
Collapse
|
17
|
Ruegger PM, Bent E, Li W, Jeske DR, Cui X, Braun J, Jiang T, Borneman J. Improving oligonucleotide fingerprinting of rRNA genes by implementation of polony microarray technology. J Microbiol Methods 2012; 90:235-40. [PMID: 22640891 PMCID: PMC3404216 DOI: 10.1016/j.mimet.2012.05.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Revised: 05/20/2012] [Accepted: 05/20/2012] [Indexed: 10/28/2022]
Abstract
Improvements to oligonucleotide fingerprinting of rRNA genes (OFRG) were obtained by implementing polony microarray technology. OFRG is an array-based method for analyzing microbial community composition. Polonies are discrete clusters of DNA, produced by solid-phase PCR in hydrogels, and derived from individual, spatially isolated DNA molecules. The advantages of a polony-based OFRG method include higher throughput and reductions in the PCR-induced errors and compositional skew inherent in all other PCR-based community composition methods, including high-throughput sequencing of rRNA genes. Given the similarities between polony microarrays and certain aspects of sequencing methods such as the Illumina platform, we suggest that if concepts presented in this study were implemented in high-throughput sequencing protocols, a reduction of PCR-induced errors and compositional skew may be realized.
Collapse
Affiliation(s)
- Paul M. Ruegger
- Department of Plant Pathology and Microbiology, University of California, Riverside, CA, USA
| | - Elizabeth Bent
- Department of Plant Pathology and Microbiology, University of California, Riverside, CA, USA
| | - Wei Li
- Department of Computer Science, University of California, Riverside, CA, USA
| | - Daniel R. Jeske
- Department of Statistics, University of California, Riverside, CA, USA
| | - Xinping Cui
- Department of Statistics, University of California, Riverside, CA, USA
| | - Jonathan Braun
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
| | - Tao Jiang
- Department of Computer Science, University of California, Riverside, CA, USA
| | - James Borneman
- Department of Plant Pathology and Microbiology, University of California, Riverside, CA, USA
| |
Collapse
|
18
|
Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells. Nature 2012; 487:190-5. [PMID: 22785314 PMCID: PMC3397394 DOI: 10.1038/nature11236] [Citation(s) in RCA: 207] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2012] [Accepted: 05/15/2012] [Indexed: 12/16/2022]
Abstract
Recent advances in whole genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, Long Fragment Read (LFR) technology, similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only ~100 pg of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants (SNVs) were assembled into long haplotype contigs. Removal of false positive SNVs not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10 Mb. Cost-effective and accurate genome sequencing and haplotyping from 10-20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.
Collapse
|
19
|
Taly V, Pekin D, Abed AE, Laurent-Puig P. Detecting biomarkers with microdroplet technology. Trends Mol Med 2012; 18:405-16. [DOI: 10.1016/j.molmed.2012.05.001] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Revised: 05/07/2012] [Accepted: 05/07/2012] [Indexed: 12/15/2022]
|
20
|
Perry RT, Dwivedi H, Aissani B. A Simple PCR-RFLP Method for Genetic Phase Determination in Compound Heterozygotes. Front Genet 2012; 2:108. [PMID: 22303402 PMCID: PMC3268647 DOI: 10.3389/fgene.2011.00108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 12/22/2011] [Indexed: 11/13/2022] Open
Abstract
When susceptibility to diseases is caused by cis-effects of multiple alleles at adjacent polymorphic sites, it may be difficult to assess with confidence the genetic phase and identify individuals carrying the risk haplotype. Experimental assessment of genetic phase is still challenging and most population studies use statistical approaches to infer haplotypes given the observed genotypes. While these statistical approaches are powerful and have been proven very useful in large scale genetic population studies, they may be prone to errors in studies with small sample size, especially in the presence of compound heterozygotes. Here, we describe a simple and novel approach using the popular PCR-RFLP based strategy to assess the genetic phase in compound heterozygotes. We apply this method to two extensively studied SNPs in two clustered immune-related genes: The -308 (G > A) and the +252 (A > G) SNPs of the tumor necrosis factor (TNF) alpha and the lymphotoxin alpha (LTA) genes, respectively. Using this method, we successfully determined the genetic phase of these two SNPs in known compound heterozygous individuals and in every sample tested. We show that the A allele of TNF -308 is carried on the same chromosome as the LTA +252(G) allele.
Collapse
Affiliation(s)
- Rodney T Perry
- Department of Epidemiology, University of Alabama at Birmingham Birmingham, AL, USA
| | | | | |
Collapse
|
21
|
Suk EK, McEwen GK, Duitama J, Nowick K, Schulz S, Palczewski S, Schreiber S, Holloway DT, McLaughlin S, Peckham H, Lee C, Huebsch T, Hoehe MR. A comprehensively molecular haplotype-resolved genome of a European individual. Genome Res 2011; 21:1672-85. [PMID: 21813624 DOI: 10.1101/gr.125047.111] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, "Max Planck One" (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ~1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying "haploid landscapes," which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for "phase-sensitive" personal genomics. MP1's annotated haploid genomes are available as a public resource.
Collapse
Affiliation(s)
- Eun-Kyung Suk
- Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Chetverina EV, Chetverin AB. Nanocolonies and diagnostics of oncological diseases associated with chromosomal translocations. BIOCHEMISTRY (MOSCOW) 2011; 75:1667-91. [DOI: 10.1134/s0006297910130109] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
23
|
Whole-genome molecular haplotyping of single cells. Nat Biotechnol 2010; 29:51-7. [PMID: 21170043 DOI: 10.1038/nbt.1739] [Citation(s) in RCA: 274] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2010] [Accepted: 11/24/2010] [Indexed: 01/22/2023]
Abstract
Conventional experimental methods of studying the human genome are limited by the inability to independently study the combination of alleles, or haplotype, on each of the homologous copies of the chromosomes. We developed a microfluidic device capable of separating and amplifying homologous copies of each chromosome from a single human metaphase cell. Single-nucleotide polymorphism (SNP) array analysis of amplified DNA enabled us to achieve completely deterministic, whole-genome, personal haplotypes of four individuals, including a HapMap trio with European ancestry (CEU) and an unrelated European individual. The phases of alleles were determined at ∼99.8% accuracy for up to ∼96% of all assayed SNPs. We demonstrate several practical applications, including direct observation of recombination events in a family trio, deterministic phasing of deletions in individuals and direct measurement of the human leukocyte antigen haplotypes of an individual. Our approach has potential applications in personal genomics, single-cell genomics and statistical genetics.
Collapse
|
24
|
Abstract
The two haploid genome sequences that a person inherits from the two parents represent the most fundamentally useful type of genetic information for the study of heritable diseases and the development of personalized medicine. Because of the difficulty in obtaining long-range phase information, current sequencing methods are unable to provide this information. Here, we introduce and show feasibility of a scalable approach capable of generating genomic sequences completely phased across the entire chromosome.
Collapse
|
25
|
Zhang C, Xing D. Single-Molecule DNA Amplification and Analysis Using Microfluidics. Chem Rev 2010; 110:4910-47. [DOI: 10.1021/cr900081z] [Citation(s) in RCA: 115] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Chunsun Zhang
- MOE Key Laboratory of Laser Life Science & Institute of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou 510631, China
| | - Da Xing
- MOE Key Laboratory of Laser Life Science & Institute of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou 510631, China
| |
Collapse
|
26
|
Ma L, Xiao Y, Huang H, Wang Q, Rao W, Feng Y, Zhang K, Song Q. Direct determination of molecular haplotypes by chromosome microdissection. Nat Methods 2010; 7:299-301. [PMID: 20305652 PMCID: PMC2871314 DOI: 10.1038/nmeth.1443] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2009] [Accepted: 02/15/2010] [Indexed: 12/02/2022]
Abstract
Direct observation of haplotypes is still technical challenging. Here we report a method for the determination of haplotypes through chromosome microdissection. We determine human haplotypes with more than 98.85% accuracy at 24,245 heterozygous single-nucleotide polymorphism (SNP) loci in genome-wide chromosome-range phasing distance.
Collapse
Affiliation(s)
- Li Ma
- Cardiovascular Research Institute, Morehouse School of Medicine, Atlanta, Georgia, USA
| | | | | | | | | | | | | | | |
Collapse
|
27
|
Snyder M, Du J, Gerstein M. Personal genome sequencing: current approaches and challenges. Genes Dev 2010; 24:423-31. [PMID: 20194435 DOI: 10.1101/gad.1864110] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The revolution in DNA sequencing technologies has now made it feasible to determine the genome sequences of many individuals; i.e., "personal genomes." Genome sequences of cells and tissues from both normal and disease states have been determined. Using current approaches, whole human genome sequences are not typically assembled and determined de novo, but, instead, variations relative to a reference sequence are identified. We discuss the current state of personal genome sequencing, the main steps involved in determining a genome sequence (i.e., identifying single-nucleotide polymorphisms [SNPs] and structural variations [SVs], assembling new sequences, and phasing haplotypes), and the challenges and performance metrics for evaluating the accuracy of the reconstruction. Finally, we consider the possible individual and societal benefits of personal genome sequences.
Collapse
Affiliation(s)
- Michael Snyder
- Department of Genetics, Stanford University School of Medicine, California 94305, USA.
| | | | | |
Collapse
|
28
|
Abstract
The term 'single-molecule genomics' (SMG) describes a group of molecular methods in which single molecules are detected or sequenced. The focus on the analysis of individual molecules distinguishes these techniques from more traditional methods, in which template DNA is cloned or PCR-amplified prior to analysis. Although technically challenging, the analysis of single molecules has the potential to play a major role in the delivery of truly personalized medicine. The two main subgroups of SMG methods are single-molecule digital PCR and single-molecule sequencing. Single-molecule PCR has a number of advantages over competing technologies, including improved detection of rare genetic variants and more precise analysis of copy-number variation, and is more easily adapted to the often small amount of material that is available in clinical samples. Single-molecule sequencing refers to a number of different methods that are mainly still in development but have the potential to make a huge impact on personalized medicine in the future.
Collapse
Affiliation(s)
- Frank McCaughan
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK.
| | | |
Collapse
|
29
|
Abstract
Whole genome sequencing provides the most comprehensive collection of an individual's genetic variation. With the falling costs of sequencing technology, we envision paradigm shift from microarray-based genotyping studies to whole genome sequencing. We review methodologies for whole genome sequencing. There are two approaches for assembling short shotgun sequence reads into longer contiguous genomic sequences. In the de novo assembly approach, sequence reads are compared to each other, and then overlapped to build longer contiguous sequences. The reference-based assembly approach involves mapping each read to a reference genome sequence. We discuss methods for identifying genetic variation (single nucleotide polymorphisms, small indels, and copy number variants) and building haplotypes from genome assemblies, and discuss potential pitfalls. We expect methodologies to evolve rapidly as sequencing technologies improve and more human genomes are sequenced.
Collapse
Affiliation(s)
- Pauline C Ng
- The J. Craig Venter Institute, Rockville, MD, USA
| | | |
Collapse
|
30
|
Fuller CW, Middendorf LR, Benner SA, Church GM, Harris T, Huang X, Jovanovich SB, Nelson JR, Schloss JA, Schwartz DC, Vezenov DV. The challenges of sequencing by synthesis. Nat Biotechnol 2009; 27:1013-23. [PMID: 19898456 DOI: 10.1038/nbt.1585] [Citation(s) in RCA: 159] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
DNA sequencing-by-synthesis (SBS) technology, using a polymerase or ligase enzyme as its core biochemistry, has already been incorporated in several second-generation DNA sequencing systems with significant performance. Notwithstanding the substantial success of these SBS platforms, challenges continue to limit the ability to reduce the cost of sequencing a human genome to $100,000 or less. Achieving dramatically reduced cost with enhanced throughput and quality will require the seamless integration of scientific and technological effort across disciplines within biochemistry, chemistry, physics and engineering. The challenges include sample preparation, surface chemistry, fluorescent labels, optimizing the enzyme-substrate system, optics, instrumentation, understanding tradeoffs of throughput versus accuracy, and read-length/phasing limitations. By framing these challenges in a manner accessible to a broad community of scientists and engineers, we hope to solicit input from the broader research community on means of accelerating the advancement of genome sequencing technology.
Collapse
Affiliation(s)
- Carl W Fuller
- GE Healthcare Life Sciences, Piscataway, New Jersey, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Abstract
The past few years have seen enormous advances in genotyping technology, including chips that accommodate in excess of 1 million SNP assays. In addition, the cost per genotype has been driven down to levels unimagined only a few years ago. These developments have resulted in an explosion of positive whole-genome association studies and the identification of many new genes for common diseases. Here I review high-throughput genotyping platforms as well as other approaches for lower numbers of assays but high sample throughput, which play an important role in genotype validation and study replication. Further, the utility of SNP arrays for detecting structural variation through the development of genotyping algorithms is reviewed and methods for long-range haplotyping are presented. It is anticipated that in the future, sample throughput and cost savings will be increased further through the combination of automation, microfluidics, and nanotechnologies.
Collapse
Affiliation(s)
- Jiannis Ragoussis
- Genomics Laboratory, Wellcome Trust Centre for Human Genetics, Oxford University, Oxford OX3 7BN, United Kingdom.
| |
Collapse
|
32
|
Sinoquet C. Iterative two-pass algorithm for missing data imputation in SNP arrays. J Bioinform Comput Biol 2009; 7:833-52. [PMID: 19785048 DOI: 10.1142/s0219720009004357] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Revised: 04/10/2009] [Accepted: 05/21/2009] [Indexed: 11/18/2022]
Abstract
Though nowadays high-throughput genotyping techniques' quality improves, missing data still remains fairly common. Studies have shown that even a low percentage of missing SNPs is detrimental to the reliability of down-stream analyses such as SNP-disease association tests. This paper investigates the potentiality for improving the accuracy of an SNP inference method based on the algorithm formerly designed by Roberts and co-workers (NPUTE, 2007). This initial algorithm performs a single scan of an SNP array, inferring missing SNPs in the context of sliding windows. We have first designed a variant, KNNWinOpti, which fully exploits backward and forward dependencies between the overlapping windows and thus restores the genuine dependency of inference upon direction scanning. Our major contribution, algorithm SNPShuttle, therefore iterates bi-directional scanning to predict SNP values with more confidence. We have run simulations on realistic benchmarks built after the high resolution map of mouse strains published by the Perlegen Project. For each of the 20 mouse chromosomes and for missing data percentage varying in range 5%-30%, SNPShuttle has always been shown to increase yet high KNNWinOpti's accuracies.
Collapse
Affiliation(s)
- Christine Sinoquet
- Computer Science Institute of Nantes-Atlantic (Lina), U.M.R. C.N.R.S. 6241, University of Nantes, 2 rue de la Houssinière, BP 92208, 44322 Nantes Cedex, France.
| |
Collapse
|
33
|
Estimating population haplotype frequencies from pooled DNA samples using PHASE algorithm. Genet Res (Camb) 2009; 90:509-24. [PMID: 19123969 DOI: 10.1017/s0016672308009877] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Recent studies show that the PHASE algorithm is a state-of-the-art method for population-based haplotyping from individually genotyped data. We present a modified version of PHASE for estimating population haplotype frequencies from pooled DNA data. The algorithm is compared with (i) a maximum likelihood estimation under the multinomial model and (ii) a deterministic greedy algorithm, on both simulated and real data sets (HapMap data). Our results suggest that the PHASE algorithm is a method of choice also on pooled DNA data. The main reason for improvement over the other approaches is assumed to be the same as with individually genotyped data: the biologically motivated model of PHASE takes into account correlated genealogical histories of the haplotypes by modelling mutations and recombinations. The important questions of efficiency of DNA pooling as well as influence of the pool size on the accuracy of the estimates are also considered. Our results are in line with the earlier findings in that the pool size should be relatively small, only 2-5 individuals in our examples, in order to provide reliable estimates of population haplotype frequencies.
Collapse
|
34
|
Tam JM, Song L, Walt DR. DNA detection on ultrahigh-density optical fiber-based nanoarrays. Biosens Bioelectron 2009; 24:2488-93. [DOI: 10.1016/j.bios.2008.12.034] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2008] [Revised: 12/09/2008] [Accepted: 12/22/2008] [Indexed: 10/21/2022]
|
35
|
Zhang Y, Ozdemir P. Microfluidic DNA amplification--a review. Anal Chim Acta 2009; 638:115-25. [PMID: 19327449 DOI: 10.1016/j.aca.2009.02.038] [Citation(s) in RCA: 259] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2009] [Revised: 02/03/2009] [Accepted: 02/20/2009] [Indexed: 11/17/2022]
Abstract
The application of microfluidic devices for DNA amplification has recently been extensively studied. Here, we review the important development of microfluidic polymerase chain reaction (PCR) devices and discuss the underlying physical principles for the optimal design and operation of the device. In particular, we focus on continuous-flow microfluidic PCR on-chip, which can be readily implemented as an integrated function of a micro-total-analysis system. To overcome sample carryover contamination and surface adsorption associated with microfluidic PCR, microdroplet technology has recently been utilized to perform PCR in droplets, which can eliminate the synthesis of short chimeric products, shorten thermal-cycling time, and offers great potential for single DNA molecule and single-cell amplification. The work on chip-based PCR in droplets is highlighted.
Collapse
Affiliation(s)
- Yonghao Zhang
- Department of Mechanical Engineering, University of Strathclyde, Glasgow, G1 1XJ, UK.
| | | |
Collapse
|
36
|
Tang J, Xiao P. Polymerizing immobilization of acrylamide-modified nucleic acids and its application. Biosens Bioelectron 2009; 24:1817-24. [DOI: 10.1016/j.bios.2008.09.018] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Revised: 09/17/2008] [Accepted: 09/17/2008] [Indexed: 11/29/2022]
|
37
|
Chetverina HV, Chetverin AB. Nanocolonies: Detection, cloning, and analysis of individual molecules. BIOCHEMISTRY (MOSCOW) 2009; 73:1361-87. [DOI: 10.1134/s0006297908130014] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
38
|
Luo HR, Wu GS, Dong C, Arcos-Burgos M, Ribeiro L, Licinio J, Wong ML. Association of PDE11A global haplotype with major depression and antidepressant drug response. Neuropsychiatr Dis Treat 2009; 5:163-70. [PMID: 19557111 PMCID: PMC2695232 DOI: 10.2147/ndt.s4771] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Cyclic nucleotide phosphodiesterases (PDEs) hydrolyze the intracellular second messengers cAMP and cGMP to their corresponding monophosphates. PDEs play an important role in signal transduction by regulating the intracellular concentration of cyclic nucleotides. We have previously shown that the individual haplotype GAACC in the PDE11A gene was associated with major depressive disorder (MDD) based on block-by-block analysis. There are two PDE genes, PDE11A and PDE1A, located in chromosome 2q31-q32. In this study, we have further explored whether the whole region 2q31-q32 contribute to MDD or antidepressant response 278 depressed Mexican-American participants and 321 matched healthy controls. Although there is no significant interaction between the two genes, the remission rate of individual carrying the combination genotype at rs1880916 (AG/AA) and rs1549870 (GG) is significantly increased. We analyzed the global haplotype by examining 16 single-nucleotide polymorphisms (SNPs) in PDE11A and six SNPs in PDE1A. None of the haplotypes consisting of six SNPs in the PDE1A have a significant difference between depressed and control groups. Among haplotypes consisting of 16 SNPs across 440 kb in the PDE11A gene, 18 common haplotypes (with frequency higher than 0.8%) have been found in the studied population. Six haplotypes showed significantly different frequencies between the MDD group and the control group. The phylogenetic network result for the 16 SNPs showed that several historic recombination events have happened in the PDE11A gene. The frequency of one haplotype is significantly lower in the remitter group than in the nonremitter group for the depressed participants treated with either desipramine or fluoxetine. Thus, our data suggest that the PDE11A global haplotype is associated with both MDD and antidepressant drug response.
Collapse
Affiliation(s)
- Huai-Rong Luo
- Center on Pharmacogenomics, Department of Psychiatry and Behavioral Science, University of Miami Miller School of Medicine, Miami, FL, USA
| | | | | | | | | | | | | |
Collapse
|
39
|
Michikawa Y, Sugahara K, Suga T, Ohtsuka Y, Ishikawa K, Ishikawa A, Shiomi N, Shiomi T, Iwakawa M, Imai T. In-gel multiple displacement amplification of long DNA fragments diluted to the single molecule level. Anal Biochem 2008; 383:151-8. [DOI: 10.1016/j.ab.2008.08.011] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2008] [Revised: 08/13/2008] [Accepted: 08/13/2008] [Indexed: 10/21/2022]
|
40
|
Dapprich J, Ferriola D, Magira EE, Kunkel M, Monos D. SNP-specific extraction of haplotype-resolved targeted genomic regions. Nucleic Acids Res 2008; 36:e94. [PMID: 18611953 PMCID: PMC2528194 DOI: 10.1093/nar/gkn345] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
The availability of genotyping platforms for comprehensive genetic analysis of complex traits has resulted in a plethora of studies reporting the association of specific single-nucleotide polymorphisms (SNPs) with common diseases or drug responses. However, detailed genetic analysis of these associated regions that would correlate particular polymorphisms to phenotypes has lagged. This is primarily due to the lack of technologies that provide additional sequence information about genomic regions surrounding specific SNPs, preferably in haploid form. Enrichment methods for resequencing should have the specificity to provide DNA linked to SNPs of interest with sufficient quality to be used in a cost-effective and high-throughput manner. We describe a simple, automated method of targeting specific sequences of genomic DNA that can directly be used in downstream applications. The method isolates haploid chromosomal regions flanking targeted SNPs by hybridizing and enzymatically elongating oligonucleotides with biotinylated nucleotides based on their selective binding to unique sequence elements that differentiate one allele from any other differing sequence. The targeted genomic region is captured by streptavidin-coated magnetic particles and analyzed by standard genotyping, sequencing or microarray analysis. We applied this technology to determine contiguous molecular haplotypes across a ∼150 kb genomic region of the major histocompatibility complex.
Collapse
|
41
|
Strausberg RL, Levy S, Rogers YH. Emerging DNA sequencing technologies for human genomic medicine. Drug Discov Today 2008; 13:569-77. [PMID: 18598911 DOI: 10.1016/j.drudis.2008.03.025] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2008] [Revised: 03/24/2008] [Accepted: 03/31/2008] [Indexed: 01/22/2023]
Abstract
The completion of draft sequences of the human genome represented a remarkable achievement for automated DNA sequencing based on Sanger technology. However, the future requires substantial leaps in sequencing technology such that whole genome sequencing will become a standard component of biomedical research and patient care. In this review we describe current advances that are in early stages of development, but that point toward technology that will enable the onset of genomic medicine encompasses strategies for preventative medicine and intervention based on complete knowledge of an individual's genome.
Collapse
|
42
|
Douville N, Huh D, Takayama S. DNA linearization through confinement in nanofluidic channels. Anal Bioanal Chem 2008; 391:2395-409. [PMID: 18340435 DOI: 10.1007/s00216-008-1995-y] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2007] [Revised: 02/12/2008] [Accepted: 02/18/2008] [Indexed: 12/28/2022]
Abstract
Stretching DNA has emerged as a vital process for studying the physical and biological properties of these molecules. Over the past decade, there has been increasing research interest in utilizing nanoscale fluidic channels to confine and stretch single DNA molecules. Nanofabricated systems for linearizing DNA have revealed new and important insights into the conformation changes of DNA molecules. They also have emerged as innovative techniques for efficiently separating DNA molecules based on size and for physically mapping genetic information along the genome. This review describes physical theories of DNA linearization, current DNA stretching techniques based on nanofabricated channels, and breakthroughs resulting from the use of nanofluidic channels for DNA linearization.
Collapse
Affiliation(s)
- Nicholas Douville
- Department of Biomedical Engineering, University of Michigan, 2200 Bonisteel Blvd, Ann Arbor, MI 48109, USA.
| | | | | |
Collapse
|
43
|
Abstract
Polony DNA sequencing provides an inexpensive, accurate, high-throughput way to resequence genomes of interest by comparison to a reference genome. Mate-paired in vitro shotgun genomic libraries are produced and clonally amplified on microbeads by emulsion PCR. These serve as templates for sequencing by fluorescent nonamer ligation reactions on a microscope slide. Each sequencing run results in millions of 26-bp reads that can be aligned to the reference genome, allowing the identification of differences between sequences.
Collapse
|
44
|
Abstract
Inspired by the principles of biological evolution, biologists--and others--have in recent decades harnessed the power of "natural" selection to sift through huge libraries of genes and find those with desirable properties. At the same time, the demand for high-throughput biochemical and genetic assays and screens has driven the development of increasingly miniaturised assay systems. An exciting synergy is now emerging between these two fields, whereby the tools of ultrahigh-throughput screening promise to open up new directions in molecular engineering.
Collapse
Affiliation(s)
- Valerie Taly
- Medical Research Council, Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK
| | | | | |
Collapse
|
45
|
Chetverin AB, Chetverina HV. Molecular Colony Technique: A New Tool for Biomedical Research and Clinical Practice. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 2008; 82:219-55. [DOI: 10.1016/s0079-6603(08)00007-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
46
|
Suga T, Ishikawa A, Kohda M, Otsuka Y, Yamada S, Yamamoto N, Shibamoto Y, Ogawa Y, Nomura K, Sho K, Omura M, Sekiguchi K, Kikuchi Y, Michikawa Y, Noda S, Sagara M, Ohashi J, Yoshinaga S, Mizoe J, Tsujii H, Iwakawa M, Imai T. Haplotype-based analysis of genes associated with risk of adverse skin reactions after radiotherapy in breast cancer patients. Int J Radiat Oncol Biol Phys 2007; 69:685-93. [PMID: 17889263 DOI: 10.1016/j.ijrobp.2007.06.021] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2007] [Revised: 06/05/2007] [Accepted: 06/05/2007] [Indexed: 11/22/2022]
Abstract
PURPOSE To identify haplotypes of single nucleotide polymorphism markers associated with the risk of early adverse skin reactions (EASRs) after radiotherapy in breast cancer patients. METHODS AND MATERIALS DNA was sampled from 399 Japanese breast cancer patients who qualified for breast-conserving radiotherapy. Using the National Cancer Institute-Common Toxicity Criteria scoring system, version 2, the patients were grouped according to EASRs, defined as those occurring within 3 months of starting radiotherapy (Grade 1 or less, n = 290; Grade 2 or greater, n = 109). A total of 999 single nucleotide polymorphisms from 137 candidate genes for radiation susceptibility were genotyped, and the haplotype associations between groups were assessed. RESULTS The global haplotype association analysis (p < 0.05 and false discovery rate < 0.05) indicated that estimated haplotypes in six loci were associated with EASR risk. A comparison of the risk haplotype with the most frequent haplotype in each locus showed haplotype GGTT in CD44 (odds ratio [OR] = 2.17; 95% confidence interval [CI], 1.07-4.43) resulted in a significantly greater EASR risk. Five haplotypes, CG in MAD2L2 (OR = 0.55; 95% CI, 0.35-0.87), GTTG in PTTG1 (OR = 0.48; 95% CI, 0.24-0.96), TCC (OR = 0.48; 95% CI, 0.26-0.89) and CCG (OR = 0.50; 95% CI, 0.27-0.92) in RAD9A, and GCT in LIG3 (OR = 0.46; 95% CI, 0.22-0.93) were associated with a reduced EASR risk. No significant risk haplotype was observed in REV3L. CONCLUSION Individual radiosensitivity can be partly determined by these haplotypes in multiple loci. Our findings may lead to a better understanding of the mechanisms underlying the genetic variation in radiation sensitivity and resistance among breast cancer patients.
Collapse
Affiliation(s)
- Tomo Suga
- RadGenomics Project, National Institute of Radiological Sciences, Chiba, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Nardi V, Raz T, Cao X, Wu CJ, Stone RM, Cortes J, Deininger MWN, Church G, Zhu J, Daley GQ. Quantitative monitoring by polymerase colony assay of known mutations resistant to ABL kinase inhibitors. Oncogene 2007; 27:775-82. [PMID: 17684485 DOI: 10.1038/sj.onc.1210698] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Resistance to molecularly targeted chemotherapy, and the development of novel agents that are active against resistant forms of target proteins create the need for a sensitive and quantitative assay to monitor drug-resistant mutations in patients to guide treatment and assess response. Here, we describe an application of the polymerase colony (polony) method to identify and quantify known point mutations in the BCR-ABL oncogene in patients with chronic myelogenous leukemia who evolve resistance to ABL kinase inhibitors. The assay can detect mutations with a sensitivity of 10(-4), quantify the burden of drug-resistant cells, and simultaneously monitor the dynamics of several coexisting mutations. As a proof of concept, we analysed blood samples from three patients undergoing therapy with ABL kinase inhibitors and found that the patients' response to therapy correlated with our molecular monitoring. We were also able to detect mutations emerging in patients long before clinical relapse. Therefore, the polony assay could be applied to a larger patient sample to assess the utility of early mutation detection in patient-specific treatment decisions. Finally, this methodology could be a valuable research tool to shed light on the natural behavior of mutations pre-existing kinase inhibitors therapy and either disappearing over time or slowly taking over.
Collapse
Affiliation(s)
- V Nardi
- Division of Hematology/Oncology, Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Children's Hospital Boston, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Ryan D, Rahimi M, Lund J, Mehta R, Parviz BA. Toward nanoscale genome sequencing. Trends Biotechnol 2007; 25:385-9. [PMID: 17658190 DOI: 10.1016/j.tibtech.2007.07.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2007] [Revised: 05/14/2007] [Accepted: 07/11/2007] [Indexed: 01/28/2023]
Abstract
This article reports on the state-of-the-art technologies that sequence DNA using miniaturized devices. The article considers the miniaturization of existing technologies for sequencing DNA and the opportunities for cost reduction that 'on-chip' devices can deliver. The ability to construct nano-scale structures and perform measurements using novel nano-scale effects has provided new opportunities to identify nucleotides directly using physical, and not chemical, methods. The challenges that these technologies need to overcome to provide a US$1000-genome sequencing technology are also presented.
Collapse
Affiliation(s)
- Declan Ryan
- Department of Electrical Engineering, University of Washington Paul Allen Center, AE100R Box 352500 Seattle, WA 98195, USA
| | | | | | | | | |
Collapse
|
49
|
Skotheim RI, Nees M. Alternative splicing in cancer: Noise, functional, or systematic? Int J Biochem Cell Biol 2007; 39:1432-49. [PMID: 17416541 DOI: 10.1016/j.biocel.2007.02.016] [Citation(s) in RCA: 157] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2006] [Revised: 02/13/2007] [Accepted: 02/22/2007] [Indexed: 12/22/2022]
Abstract
Pre-messenger RNA splicing is a fine-tuned process that generates multiple functional variants from individual genes. Various cell types and developmental stages regulate alternative splicing patterns differently in their generation of specific gene functions. In cancers, splicing is significantly altered, and understanding the underlying mechanisms and patterns in cancer will shed new light onto cancer biology. Cancer-specific transcript variants are promising biomarkers and targets for diagnostic, prognostic, and treatment purposes. In this review, we explore how alternative splicing cannot simply be considered as noise or an innocent bystander, but is actively regulated or deregulated in cancers. A special focus will be on aspects of cell biology and biochemistry of alternative splicing in cancer cells, addressing differences in splicing mechanisms between normal and malignant cells. The systems biology of splicing is only now applied to the field of cancer research. We explore functional annotations for some of the most intensely spliced gene classes, and provide a literature mining and clustering that reflects the most intensely investigated genes. A few well-established cancer-specific splice events, such as the CD44 antigen, are used to illustrate the potential behind the exploration of the mechanisms of their regulation. Accordingly, we describe the functional connection between the regulatory machinery (i.e., the spliceosome and its accessory proteins) and their global impact on qualitative transcript variation that are only now emerging from the use of genomic technologies such as microarrays. These studies are expected to open an entirely new level of genetic information that is currently still poorly understood.
Collapse
Affiliation(s)
- Rolf I Skotheim
- Department of Cancer Prevention, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| | | |
Collapse
|
50
|
ISHAPE: new rapid and accurate software for haplotyping. BMC Bioinformatics 2007; 8:205. [PMID: 17573965 PMCID: PMC1919397 DOI: 10.1186/1471-2105-8-205] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2007] [Accepted: 06/15/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We have developed a new haplotyping program based on the combination of an iterative multiallelic EM algorithm (IEM), bootstrap resampling and a pseudo Gibbs sampler. The use of the IEM-bootstrap procedure considerably reduces the space of possible haplotype configurations to be explored, greatly reducing computation time, while the adaptation of the Gibbs sampler with a recombination model on this restricted space maintains high accuracy. On large SNP datasets (>30 SNPs), we used a segmented approach based on a specific partition-ligation strategy. We compared this software, Ishape (Iterative Segmented HAPlotyping by Em), with reference programs such as Phase, Fastphase, and PL-EM. Analogously with Phase, there are 2 versions of Ishape: Ishape1 which uses a simple coalescence model for the pseudo Gibbs sampler step, and Ishape2 which uses a recombination model instead. RESULTS We tested the program on 2 types of real SNP datasets derived from Hapmap: adjacent SNPs (high LD) and SNPs spaced by 5 Kb (lower level of LD). In both cases, we tested 100 replicates for each size: 10, 20, 30, 40, 50, 60, and 80 SNPs. For adjacent SNPs Ishape2 is superior to the other software both in terms of speed and accuracy. For SNPs spaced by 5 Kb, Ishape2 yields similar results to Phase2.1 in terms of accuracy, and both outperform the other software. In terms of speed, Ishape2 runs about 4 times faster than Phase2.1 with 10 SNPs, and about 10 times faster with 80 SNPs. For the case of 5kb-spaced SNPs, Fastphase may run faster with more than 100 SNPs. CONCLUSION These results show that the Ishape heuristic approach for haplotyping is very competitive in terms of accuracy and speed and deserves to be evaluated extensively for possible future widespread use.
Collapse
|