1
|
Song C, Erijman A, Langhorst BW, Liu P, Dimalanta ET, Davis TB. Abstract 5628: Immune repertoire sequencing facilitates gamma delta T cell clonal determination. Cancer Res 2022. [DOI: 10.1158/1538-7445.am2022-5628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Gamma-delta T cells are a small fraction of T lymphocytes with only 1-5% of the overall T cell population, but they are an important subset that have unique contributions to both innate and adaptive immunity. Gamma-delta T cells can recognize a broad range of antigens to provide rapid responses to pathogens, and can also interact with both immune cells and non-immune tissue cells triggering regulatory and cytotoxic responses. These unique features make them ideal candidates that could be targeted to induce durable immunity in the context of different pathologies. There has been growing interest in understanding the contributions of gamma-delta T cells to immunology and developing efficient gamma-delta T-cell-based therapies for cancer, infectious disease, and autoimmune disease. In this study, we have implemented T cell repertoire sequencing for high throughput characterization. Gamma-delta T cells were enriched from tissues and peripheral blood mononuclear cells (PBMCs). RNA was extracted and used to generate full length TCR libraries. Unique molecular identifiers (UMIs) were incorporated to discretely barcode each mRNA molecule, enabling absolute quantitative ranking of TCR clone abundance. Full length immune repertoire sequencing facilitates the detection of distinct and shared clones in tissue and blood samples, enabling the identification of disease specific clones to evaluate immunotherapy effects. In addition, RNA sequencing was performed for gene expression analysis of gamma-delta T cells to identify cell phenotypes.
Citation Format: Chen Song, Ariel Erijman, Bradley W. Langhorst, Pingfang Liu, Eileen T. Dimalanta, Theodore B. Davis. Immune repertoire sequencing facilitates gamma delta T cell clonal determination [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 5628.
Collapse
|
2
|
Vaisvila R, Ponnaluri VKC, Sun Z, Langhorst BW, Saleh L, Guan S, Dai N, Campbell MA, Sexton BS, Marks K, Samaranayake M, Samuelson JC, Church HE, Tamanaha E, Corrêa IR, Pradhan S, Dimalanta ET, Evans TC, Williams L, Davis TB. Enzymatic methyl sequencing detects DNA methylation at single-base resolution from picograms of DNA. Genome Res 2021; 31:1280-1289. [PMID: 34140313 PMCID: PMC8256858 DOI: 10.1101/gr.266551.120] [Citation(s) in RCA: 124] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Accepted: 05/06/2021] [Indexed: 01/15/2023]
Abstract
Bisulfite sequencing detects 5mC and 5hmC at single-base resolution. However, bisulfite treatment damages DNA, which results in fragmentation, DNA loss, and biased sequencing data. To overcome these problems, enzymatic methyl-seq (EM-seq) was developed. This method detects 5mC and 5hmC using two sets of enzymatic reactions. In the first reaction, TET2 and T4-BGT convert 5mC and 5hmC into products that cannot be deaminated by APOBEC3A. In the second reaction, APOBEC3A deaminates unmodified cytosines by converting them to uracils. Therefore, these three enzymes enable the identification of 5mC and 5hmC. EM-seq libraries were compared with bisulfite-converted DNA, and each library type was ligated to Illumina adaptors before conversion. Libraries were made using NA12878 genomic DNA, cell-free DNA, and FFPE DNA over a range of DNA inputs. The 5mC and 5hmC detected in EM-seq libraries were similar to those of bisulfite libraries. However, libraries made using EM-seq outperformed bisulfite-converted libraries in all specific measures examined (coverage, duplication, sensitivity, etc.). EM-seq libraries displayed even GC distribution, better correlations across DNA inputs, increased numbers of CpGs within genomic features, and accuracy of cytosine methylation calls. EM-seq was effective using as little as 100 pg of DNA, and these libraries maintained the described advantages over bisulfite sequencing. EM-seq library construction, using challenging samples and lower DNA inputs, opens new avenues for research and clinical applications.
Collapse
Affiliation(s)
| | | | - Zhiyi Sun
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | | | - Lana Saleh
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Shengxi Guan
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Nan Dai
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | | | - Brittany S Sexton
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Katherine Marks
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Mala Samaranayake
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - James C Samuelson
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Heidi E Church
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Esta Tamanaha
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Ivan R Corrêa
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Sriharsa Pradhan
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | | | - Thomas C Evans
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Louise Williams
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| | - Theodore B Davis
- New England Biolabs, Incorporated, Ipswich, Massachusetts 01938, USA
| |
Collapse
|
3
|
Malkani S, Chin CR, Cekanaviciute E, Mortreux M, Okinula H, Tarbier M, Schreurs AS, Shirazi-Fard Y, Tahimic CGT, Rodriguez DN, Sexton BS, Butler D, Verma A, Bezdan D, Durmaz C, MacKay M, Melnick A, Meydan C, Li S, Garrett-Bakelman F, Fromm B, Afshinnekoo E, Langhorst BW, Dimalanta ET, Cheng-Campbell M, Blaber E, Schisler JC, Vanderburg C, Friedländer MR, McDonald JT, Costes SV, Rutkove S, Grabham P, Mason CE, Beheshti A. Circulating miRNA Spaceflight Signature Reveals Targets for Countermeasure Development. Cell Rep 2020; 33:108448. [PMID: 33242410 PMCID: PMC8441986 DOI: 10.1016/j.celrep.2020.108448] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 11/02/2020] [Accepted: 11/06/2020] [Indexed: 12/14/2022] Open
Abstract
We have identified and validated a spaceflight-associated microRNA (miRNA) signature that is shared by rodents and humans in response to simulated, short-duration and long-duration spaceflight. Previous studies have identified miRNAs that regulate rodent responses to spaceflight in low-Earth orbit, and we have confirmed the expression of these proposed spaceflight-associated miRNAs in rodents reacting to simulated spaceflight conditions. Moreover, astronaut samples from the NASA Twins Study confirmed these expression signatures in miRNA sequencing, single-cell RNA sequencing (scRNA-seq), and single-cell assay for transposase accessible chromatin (scATAC-seq) data. Additionally, a subset of these miRNAs (miR-125, miR-16, and let-7a) was found to regulate vascular damage caused by simulated deep space radiation. To demonstrate the physiological relevance of key spaceflight-associated miRNAs, we utilized antagomirs to inhibit their expression and successfully rescue simulated deep-space-radiation-mediated damage in human 3D vascular constructs.
Collapse
Affiliation(s)
- Sherina Malkani
- Blue Marble Space Institute of Science, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Christopher R Chin
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Egle Cekanaviciute
- Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Marie Mortreux
- Department of Neurology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
| | - Hazeem Okinula
- Center for Radiological Research, Columbia University, New York, NY 10032, USA
| | - Marcel Tarbier
- Science for Life Laboratory, Department of Molecular Biosciences, the Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
| | - Ann-Sofie Schreurs
- KBR, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Yasaman Shirazi-Fard
- Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Candice G T Tahimic
- KBR, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | | | | | - Daniel Butler
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Akanksha Verma
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Daniela Bezdan
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA; Institute of Medical Virology and Epidemiology of Viral Diseases, University Hospital, Tubingen, Germany
| | - Ceyda Durmaz
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Matthew MacKay
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Ari Melnick
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Cem Meydan
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Sheng Li
- The Jackson Laboratories, Farmington, CT, USA
| | - Francine Garrett-Bakelman
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA; Department of Medicine, University of Virginia School of Medicine, Charlottesville, VA, USA; Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Bastian Fromm
- Science for Life Laboratory, Department of Molecular Biosciences, the Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
| | - Ebrahim Afshinnekoo
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | | | | | - Margareth Cheng-Campbell
- Center for Biotechnology & Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA
| | - Elizabeth Blaber
- Center for Biotechnology & Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA; Universities Space Research Association, Space Biosciences Division, NASA Ames Research Center, Mountain View, CA 94035, USA
| | - Jonathan C Schisler
- McAllister Heart Institute, Department of Pharmacology, and Department of Pathology and Lab Medicine, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Charles Vanderburg
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Marc R Friedländer
- Science for Life Laboratory, Department of Molecular Biosciences, the Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
| | - J Tyson McDonald
- Department of Radiation Medicine, Georgetown University School of Medicine, Washington DC 20007, USA
| | - Sylvain V Costes
- Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Seward Rutkove
- Department of Neurology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
| | - Peter Grabham
- Center for Radiological Research, Columbia University, New York, NY 10032, USA
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA; The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA; The Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA.
| | - Afshin Beheshti
- KBR, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| |
Collapse
|
4
|
Song C, Liu P, Barry A, Langhorst BW, Stewart FJ, Russello S, Davis TB, Dimalanta ET. Abstract 4046: Immune repertoire sequencing reveals tumor microenvironment and tracks clonally expanded B cell and T cell in blood. Cancer Res 2019. [DOI: 10.1158/1538-7445.am2019-4046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The study of complex immunological diseases and tumor microenvironments has progressed through recent developments on sequencing of the immune repertoire. Using this approach, the interrogation of disease progression is facilitated through analysis of millions of V(D)J combinations from B cell antibodies (Igs) and T cell receptors (TCRs). One major challenge of immune repertoire sequencing is to accurately capture the structural and sequence complexities of antibodies and TCR genes. We have developed a method for accurate sequencing of full length immune gene repertoires of B cells and T cells. RNA extracted from tumor tissues containing tumor infiltrating lymphocytes, as well as matched peripheral blood mononuclear cells (PBMCs) were used to generate full length Ig and TCR libraries. Unique molecule index (UMI) was used to discretely barcode each mRNA molecule, enabling absolute quantitative ranking of antibody/TCR clone abundance. Full length immune repertoire sequencing facilitates detection of distinct and shared clones in tissue and blood samples, enabling identification of disease specific clones to evaluate immunotherapy effects. Highly expanded clones in tumor samples are also found in blood samples. RNA-seq libraries were also constructed from the same RNA for Ig and TCR libraries. The expression level of IGH/IGL/IGK and TRA/TRB in the immune sequencing libraries highly correlates with the RNA-seq data. In addition, both Ig and TCR libraries can be constructed in one tube to obtain a whole immune repertoire profile.Our immune repertoire sequencing approach allows accurate clonal determination for both Ig and TCR. This technique is applicable for investigation of lymphocytes infiltration of tumor microenvironments, tracing expanded B cell and T cells in blood samples, and monitoring of minimal residual disease.
Citation Format: Chen Song, Pingfang Liu, Andrew Barry, Bradley W. Langhorst, Fiona J. Stewart, Salvatore Russello, Theodore B. Davis, Eileen T. Dimalanta. Immune repertoire sequencing reveals tumor microenvironment and tracks clonally expanded B cell and T cell in blood [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA. Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr 4046.
Collapse
|
5
|
Song C, Sun L, Liu P, Langhorst B, Barry A, Davis TB, Dimalanta ET. Abstract B046: Immune repertoire sequencing enables complete B-cell and T-cell clonality determination and minimal residual disease assessment. Cancer Immunol Res 2019. [DOI: 10.1158/2326-6074.cricimteatiaacr18-b046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The study of complex immunologic diseases and tumor microenvironment has progressed through recent developments that enable the sequencing of the immune repertoire. Using this approach, the interrogation of disease progression is facilitated through analysis of millions of V(D)J combinations from both B cell antibodies (Igs) and T-cell receptors (TCRs). One major challenge of immune repertoire sequencing is to capture the structural and sequence complexities of antibody and TCR genes. We have developed and optimized a method for accurate sequencing of full-length immune gene repertoires of B-cells and T-cells. RNA extracted from tissue and PBMCs were used to generate immune sequencing libraries. Using a unique molecule index (UMI) to discretely barcode each mRNA molecule, PCR copies of each mRNA fragment can be collapsed into a single consensus sequence. The B cell genes were enriched during library preparation by IGH, IGK and IGL primes, including isotype-specific primers (IgA, IgD, IgE, IgG and IgM). The T-cell genes were enriched by TCRα and TCRβ specific primers. To investigate the applicability of minimal residual disease assessment, we obtained Jurkat RNA, a homogenous population of leukemic Jurkat T-cells and spiked it into a PBMC RNA sample at varying proportions of Jurkat RNA (10%, 1%, 0.1%, 0.01% and 0.001%). The RNA mixtures were made into TCR libraries, sequenced on both Illumina MiSeq and Oxford Nanopore MinION, and analyzed to assess the TCR repertoire. Utilization of UMIs enabled absolute quantification of the B cell antibody/TCR clones and accurate ranking of their abundance. For B cell repertoire sequencing, the use of isotype-specific primers enabled measurement of the heavy chain isotype proportions within the samples. Full-length heavy chain antibody analysis enabled measurement of the mutation level of each antibody sequence, providing information on the overall maturity and mutational profile of the sample repertoire. For TCR repertoire sequencing, distinct and shared clonal sequences were quantitatively detected in PBMC samples. The method also accurately and sensitively detected the control TCR clone spiked into both Illumina and Oxford Nanopore libraries at levels appropriate for minimal residual disease assessment.This immune repertoire sequencing approach allows accurate clonal determination for both Igs and TCRs. This technique is applicable for a variety of applications including design of antibody chains for in vitro synthesis, investigation of T-cell infiltration of tumor microenvironments, and monitoring of minimal residual disease.
Citation Format: Chen Song, Luo Sun, Pingfang Liu, Bradley Langhorst, Andrew Barry, Theodore B. Davis, Eileen T. Dimalanta. Immune repertoire sequencing enables complete B-cell and T-cell clonality determination and minimal residual disease assessment [abstract]. In: Proceedings of the Fourth CRI-CIMT-EATI-AACR International Cancer Immunotherapy Conference: Translating Science into Survival; Sept 30-Oct 3, 2018; New York, NY. Philadelphia (PA): AACR; Cancer Immunol Res 2019;7(2 Suppl):Abstract nr B046.
Collapse
Affiliation(s)
- Chen Song
- New England Biolabs, Inc., Ipswich, MA
| | - Luo Sun
- New England Biolabs, Inc., Ipswich, MA
| | | | | | | | | | | |
Collapse
|
6
|
Kovar L, Nageswara-Rao M, Ortega-Rodriguez S, Dugas DV, Straub S, Cronn R, Strickler SR, Hughes CE, Hanley KA, Rodriguez DN, Langhorst BW, Dimalanta ET, Bailey CD. PacBio-Based Mitochondrial Genome Assembly of Leucaena trichandra (Leguminosae) and an Intrageneric Assessment of Mitochondrial RNA Editing. Genome Biol Evol 2018; 10:2501-2517. [PMID: 30137422 PMCID: PMC6161758 DOI: 10.1093/gbe/evy179] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2018] [Indexed: 12/31/2022] Open
Abstract
Reconstructions of vascular plant mitochondrial genomes (mt-genomes) are notoriously complicated by rampant recombination that has resulted in comparatively few plant mt-genomes being available. The dearth of plant mitochondrial resources has limited our understanding of mt-genome structural diversity, complex patterns of RNA editing, and the origins of novel mt-genome elements. Here, we use an efficient long read (PacBio) iterative assembly pipeline to generate mt-genome assemblies for Leucaena trichandra (Leguminosae: Caesalpinioideae: mimosoid clade), providing the first assessment of non-papilionoid legume mt-genome content and structure to date. The efficiency of the assembly approach facilitated the exploration of alternative structures that are common place among plant mitochondrial genomes. A compact version (729 kbp) of the recovered assemblies was used to investigate sources of mt-genome size variation among legumes and mt-genome sequence similarity to the legume associated root holoparasite Lophophytum. The genome and an associated suite of transcriptome data from select species of Leucaena permitted an in-depth exploration of RNA editing in a diverse clade of closely related species that includes hybrid lineages. RNA editing in the allotetraploid, Leucaena leucocephala, is consistent with co-option of nearly equal maternal and paternal C-to-U edit components, generating novel combinations of RNA edited sites. A preliminary investigation of L. leucocephala C-to-U edit frequencies identified the potential for a hybrid to generate unique pools of alleles from parental variation through edit frequencies shared with one parental lineage, those intermediate between parents, and transgressive patterns.
Collapse
Affiliation(s)
- Lynsey Kovar
- Department of Biology, New Mexico State University
| | | | | | | | - Shannon Straub
- Department of Biology, Hobart and William Smith Colleges, Geneva, New York
| | - Richard Cronn
- Pacific Northwest Research Station, Corvallis, Oregon
| | | | - Colin E Hughes
- Department of Systematic & Evolutionary Botany, University of Zurich, Switzerland
| | | | | | | | | | | |
Collapse
|
7
|
Song C, Liu P, Barry A, Dimalanta ET, Stewart FJ, Russello S, Davis TB. Abstract 4675: B-cell and T-cell repertoire sequencing enables somatic hypermutation and minimal residual disease assessment. Cancer Res 2018. [DOI: 10.1158/1538-7445.am2018-4675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The study of complex immunological diseases has progressed through recent developments that enable the sequencing of the immune repertoire. Using this approach, the interrogation of disease progression is facilitated through analysis of millions of V(D)J combinations from both B-cell antibodies and T-cell Receptors (TCR). One major challenge of immune repertoire sequencing is to capture the structural and sequence complexities of antibody and TCR genes. We have developed and optimized a method for accurate sequencing of full-length immune gene repertoires of B-cells and T-cells. RNA extracted from PBMCs were used to generate immune sequencing libraries in duplicate from a single patient. Using a unique barcoding scheme specifically designed to discretely barcode each mRNA molecule with a unique molecule index (UMI), PCR copies of each mRNA fragment can be collapsed into a single consensus sequence, improving accuracy through the resolution of PCR bias and sequencing errors. Utilization of the UMIs enabled absolute quantification of input RNA molecules and accurate ranking of the antibody/TCR clone abundance. This approach was also applied to the analysis of tumor samples for abundance of expanded clones through clustering based on V gene, J gene and CDR3 similarity and ranking by mRNA abundance. The use of isotype-specific primers (IgM, IgD, IgG, IgA and IgE) enabled measurement of the heavy chain isotype proportions within the samples. Full-length heavy chain antibody sequences generated using this method were aligned to germline genes from reference databases, enabling quantitation of the mutation level of each antibody sequence, providing information on the overall maturity and mutational profile of the sample repertoire. Additionally, this approach was capable of detecting minimal residual diseases by sensitive quantification of TCR rearrangement. This novel method allows for exhaustive somatic mutation profiling across complete V, D and J segments and full isotype information analysis. The information obtained enables the possibility for synthesis and expression of complete antibody chains and T-cell receptor chains for downstream immunological assays.
Citation Format: Chen Song, Pingfang Liu, Andrew Barry, Eileen T. Dimalanta, Fiona J. Stewart, Salvatore Russello, Theodore B. Davis. B-cell and T-cell repertoire sequencing enables somatic hypermutation and minimal residual disease assessment [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 4675.
Collapse
|
8
|
Druliner BR, Vera D, Johnson R, Ruan X, Apone LM, Dimalanta ET, Stewart FJ, Boardman L, Dennis JH. Comprehensive nucleosome mapping of the human genome in cancer progression. Oncotarget 2017; 7:13429-45. [PMID: 26735342 PMCID: PMC4924652 DOI: 10.18632/oncotarget.6811] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 12/21/2015] [Indexed: 11/25/2022] Open
Abstract
Altered chromatin structure is a hallmark of cancer, and inappropriate regulation of chromatin structure may represent the origin of transformation. Important studies have mapped human nucleosome distributions genome wide, but the role of chromatin structure in cancer progression has not been addressed. We developed a MNase-Transcription Start Site Sequence Capture method (mTSS-seq) to map the nucleosome distribution at human transcription start sites genome-wide in primary human lung and colon adenocarcinoma tissue. Here, we confirm that nucleosome redistribution is an early, widespread event in lung (LAC) and colon (CRC) adenocarcinoma. These altered nucleosome architectures are consistent between LAC and CRC patient samples indicating that they may serve as important early adenocarcinoma markers. We demonstrate that the nucleosome alterations are driven by the underlying DNA sequence and potentiate transcription factor binding. We conclude that DNA-directed nucleosome redistributions are widespread early in cancer progression. We have proposed an entirely new hierarchical model for chromatin-mediated genome regulation.
Collapse
Affiliation(s)
- Brooke R Druliner
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America.,Division of Gastroenterology and Hepatology, Department of Internal Medicine, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Daniel Vera
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America.,The Center for Genomics and Personalized Medicine, The Florida State University, Tallahassee, Florida, United States of America
| | - Ruth Johnson
- Department of Laboratory Medicine and Experimental Pathology, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Xiaoyang Ruan
- Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic College of Medicine, Rochester, Minnesota, United States of America
| | - Lynn M Apone
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Eileen T Dimalanta
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Fiona J Stewart
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Lisa Boardman
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Jonathan H Dennis
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America.,The Center for Genomics and Personalized Medicine, The Florida State University, Tallahassee, Florida, United States of America.,Institute of Molecular Biophysics, The Florida State University, Tallahassee, Florida, United States of America
| |
Collapse
|
9
|
Krishnan K, Yigit E, Karaca M, Rodriguez D, Langhorst B, Shtatland T, Munafo D, Liu P, Apone L, Panchapakesa V, Duggan K, Sumner C, Rozzi C, Stewart F, Mazzola L, Bybee J, Rivizzigno D, Dimalanta ET, Davis TB. Abstract 5406: Low-input transcript profiling with enhanced sensitivity using a highly efficient, low-bias and strand-specific RNA-Seq library preparation method. Cancer Res 2017. [DOI: 10.1158/1538-7445.am2017-5406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
RNA-seq has become the most popular method for transcriptome analysis and is widely used to study gene expression, detect mutations, fusion transcripts, alternative splicing, and post-transcriptional modifications. It is becoming the method of choice to detect alterations in diseases, especially cancer, to provide insights on the various molecular pathways perturbed by changes in the transcriptome and study their implications. As RNA-seq is being adopted in molecular diagnostics and biomarker identifications, the need for good quality, reproducible library preparation methods using very low amounts of RNA input, or precious clinical samples, is increasing. To address these challenges, we have developed a strand-specific RNA-seq library preparation method that retains information about which strand of DNA is transcribed, from as low as 5 ng total RNA input. Strand specificity is important for correct annotation of genes, identification of antisense transcripts with potential regulatory roles, and accurate determination of gene expression levels in the presence of antisense transcripts. Enhanced sensitivity to detect transcripts with even coverage across their length offers a non-biased approach for accurate quantification of transcript levels.
Methods: Enriched poly-A mRNA or ribo-depleted RNA (Universal Human Reference RNA) was used to make libraries with our strand specific library preparation method. Library quality and quantity were analyzed on an Agilent Bioanalyzer, pooled at equimolar ratio and sequenced on Illumina’s Nextseq 500. Paired end reads were mapped to a human reference genome (hg19) using Hisat2 and sequencing metrics were calculated using Picard's RNA-seq Metrics and RSeqC tools. Transcript abundance was measured using Salmon and the Ensembl GRCh38 CDS sequences.
Results: Libraries prepared with our streamlined method using inputs that range from 5ng to 1ug show greater than 98% directionality at all input levels. GC content analysis, gene body coverage and gene expression correlation are similar for all inputs tested (5ng to 1ug), even though input amounts vary by over three orders of magnitude. These consistent results are recapitulated with the spiked-in ERCC controls at all inputs.
Conclusions: Our library preparation method is streamlined and can be used for a wide range of input RNA without any major modifications to the protocol, making it an easy to follow, convenient method for RNA-seq library preparation. In addition, our method has increased sensitivity and specificity, especially for low-abundance transcripts, reduced PCR duplicates and sequence bias, delivering high quality strand-specific data even for low input RNA. Finally, our method is compatible with both poly A-tail enriched and ribosomal RNA depleted samples, and is amenable to large-scale library construction and automation.
Citation Format: Keerthana Krishnan, Erbay Yigit, Mehmet Karaca, Deyra Rodriguez, Bradley Langhorst, Timur Shtatland, Daniela Munafo, Pingfang Liu, Lynne Apone, Vaishnavi Panchapakesa, Karen Duggan, Christine Sumner, Christine Rozzi, Fiona Stewart, Laurie Mazzola, Joanna Bybee, Danielle Rivizzigno, Eileen T. Dimalanta, Theodore B. Davis. Low-input transcript profiling with enhanced sensitivity using a highly efficient, low-bias and strand-specific RNA-Seq library preparation method [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 5406. doi:10.1158/1538-7445.AM2017-5406
Collapse
|
10
|
liu P, Chen L, Ettwiller L, Sumner C, Stewart FJ, Dimalanta ET, Davis TB, Thomas EC. Abstract 3628: Improving sequencing quality of libraries prepared from FFPE DNA. Cancer Res 2016. [DOI: 10.1158/1538-7445.am2016-3628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Targeted cancer therapy based on genomic alterations can be remarkably effective, and has made significant strides with the recent advances in next-generation sequencing (NGS) technology. Although samples of blood and other bodily fluids are being actively explored for early disease diagnosis and treatment monitoring, DNA isolated from FFPE samples is currently the main source for NGS-based cancer profiling in clinical settings. Unfortunately, sequencing DNA from FFPE samples is challenging due to limited quantities and poor quality, a result of DNA damage incurred during fixation and storage. Artifacts associated with FFPE DNA have limited the mutation detection sensitivity to ≥ 5% mutant allele frequency (Frampton et al, Nature Biotechnology 2013), which would unfortunately leave many low-abundance genetic variants of clinical significance undetected. For example, clinical resistance-causing KIT and EGFR mutations can be present in tumors at levels << 1% (Milbury et al, Clin. Chem. 2012).
In this study, we investigated the effects of DNA repair and different sample handling workflows on sequencing quality of libraries prepared from FFPE samples. Careful analysis of sequencing data showed that base calling qualities for all 4 bases are improved, and aberrant G:C to A:T mutations were significantly reduced upon DNA repair. Because the large majority of mutations encountered in human tumors are G:C to A:T mutations (Greenman, C. Nature 2007), we expect that lowering the damage induced background noise of FFPE DNA would allow more reliable detection of clinically important, actionable mutations at lower abundance. In addition, we observed specific sequencing artifacts associated with the method of handling FFPE samples and have since identified effective measurements to avoid such artifacts. We expect that these improvements in sequencing quality of FFPE samples would ultimately enable more sensitive and robust detection of many low level genetic variations in clinically and biologically relevant cancer genes.
Citation Format: pingfang liu, Lixin Chen, Laurence Ettwiller, Christine Sumner, Fiona J. Stewart, Eileen T. Dimalanta, Theodore B. Davis, Evans C. Thomas. Improving sequencing quality of libraries prepared from FFPE DNA. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 3628.
Collapse
|
11
|
Stewart FJ, Karaca M, Adams K, Clouser C, Patel B, Timberlake S, Donahue W, Apone L, Russello S, Dimalanta ET, Davis TB, Vigneault F. Abstract 3998: Sequencing the B-cell and T-cell repertoire. Cancer Res 2016. [DOI: 10.1158/1538-7445.am2016-3998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Immune sequencing allows for the study of complex immunological diseases by sequencing millions of V(D)J combinations from B-cell antibody and T-cell receptors. The popularity of this technique has increased due to recent throughput and read length improvements in next-generation sequencing technologies. However, structural and sequence complexities of antibody genes have made reliable targeting approaches challenging.
We have developed and optimized a method for accurate sequencing of full-length immune gene repertoires of B-cells and T-cells. The method uses a unique barcoding scheme specifically designed to tag every mRNA molecule with a unique identifier (UID) so that all PCR copies of each mRNA fragment can be collapsed into a single consensus sequence. This makes the assay extremely accurate, by resolving PCR bias and sequencing errors as well as allowing quantitative digital molecule counting.
Immune sequencing libraries were generated from total RNA extracted from Peripheral Blood Mononuclear Cells in duplicate from a single patient. The use of UIDs enabled absolute quantification of starting RNA molecules present in the original sample and therefore accurate ranking of the antibody clone abundance, by avoiding the bias incorporated by PCR or sequencing when total reads only were measured. Using the same sequencing method, tumor samples were analyzed for abundance of expanded clones via grouping clones by V gene, J gene and CDR3 similarity and ranking by mRNA abundance. Additionally, the use of isotype-specific primers (IgM, IgD, IgG, IgA and IgE) enabled measurement of the heavy chain isotype proportions within the samples. Further, alignment of full-length heavy chain antibody sequences generated using this method to germline genes from reference databases enabled quantitation of the mutation level of each antibody sequence, thereby providing information on the overall maturity and mutational profile of the sample repertoire.
Citation Format: Fiona J. Stewart, Mehmet Karaca, Kris Adams, Chris Clouser, Bonny Patel, Sonia Timberlake, William Donahue, Lynne Apone, Salvatore Russello, Eileen T. Dimalanta, Theodore B. Davis, Francois Vigneault. Sequencing the B-cell and T-cell repertoire. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 3998.
Collapse
|
12
|
Yigit E, Feehery GR, Langhorst BW, Stewart FJ, Dimalanta ET, Pradhan S, Slatko B, Gardner AF, McFarland J, Sumner C, Davis TB. A Microbiome DNA Enrichment Method for Next-Generation Sequencing Sample Preparation. ACTA ACUST UNITED AC 2016; 115:7.26.1-7.26.14. [PMID: 27366894 DOI: 10.1002/cpmb.12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
"Microbiome" is used to describe the communities of microorganisms and their genes in a particular environment, including communities in association with a eukaryotic host or part of a host. One challenge in microbiome analysis concerns the presence of host DNA in samples. Removal of host DNA before sequencing results in greater sequence depth of the intended microbiome target population. This unit describes a novel method of microbial DNA enrichment in which methylated host DNA such as human genomic DNA is selectively bound and separated from microbial DNA before next-generation sequencing (NGS) library construction. This microbiome enrichment technique yields a higher fraction of microbial sequencing reads and improved read quality resulting in a reduced cost of downstream data generation and analysis. © 2016 by John Wiley & Sons, Inc.
Collapse
|
13
|
Munafó DB, Langhorst BW, Chater CL, Sumner CJ, Rodríguez DN, Russello S, Gardner AF, Slatko BE, Stewart FJ, Sinicropi D, Morlan J, Qu K, Dimalanta ET, Davis TB. Selective Depletion of Abundant RNAs to Enable Transcriptome Analysis of Low-Input and Highly Degraded Human RNA. ACTA ACUST UNITED AC 2016; 113:7.22.1-7.22.9. [PMID: 31773915 DOI: 10.1002/0471142727.mb0722s113] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Ribosomal RNAs (rRNAs) are extremely abundant, often constituting 80% to 90% of total RNA. Since rRNA sequences are often not of interest in genomic RNA sequencing experiments, rRNAs can be removed from the sample before the library preparation step, in order to prevent the majority of the library and the majority of sequencing reads from being rRNA. Removal of rRNA can be especially challenging for low quality and formalin-fixed paraffin-embedded (FFPE) RNA samples due to the fragmented nature of these RNA molecules. The NEBNext rRNA Depletion Kit (Human/Mouse/Rat) depletes both cytoplasmic (5 S rRNA, 5.8 S rRNA, 18 S rRNA, and 28 S rRNA) and mitochondrial rRNA (12 S rRNA and 16 S rRNA) from total RNA preparations from human, mouse, and rat samples. Due to the high similarity among mammalian rRNA sequences, it is likely that rRNA depletion can also be achieved for other mammals but has not been empirically tested. This product is compatible with both intact and degraded RNA (e.g., FFPE RNA). The resulting rRNA-depleted RNA is suitable for RNA-seq, random-primed cDNA synthesis, or other downstream RNA analysis applications. Regardless of the quality or amount of input RNA, this method efficiently removes rRNA, while retaining non-coding and other non-poly(A) RNAs. The NEBNext rRNA Depletion Kit thus provides a more complete picture of the transcript repertoire than oligo d(T) poly(A) mRNA enrichment methods. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - John Morlan
- Genomic Health, Inc, Redwood City, California
| | - Kunbin Qu
- Genomic Health, Inc, Redwood City, California
| | | | | |
Collapse
|
14
|
Feehery GR, Yigit E, Oyola SO, Langhorst BW, Schmidt VT, Stewart FJ, Dimalanta ET, Amaral-Zettler LA, Davis T, Quail MA, Pradhan S. A method for selectively enriching microbial DNA from contaminating vertebrate host DNA. PLoS One 2013; 8:e76096. [PMID: 24204593 PMCID: PMC3810253 DOI: 10.1371/journal.pone.0076096] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Accepted: 08/20/2013] [Indexed: 12/05/2022] Open
Abstract
DNA samples derived from vertebrate skin, bodily cavities and body fluids contain both host and microbial DNA; the latter often present as a minor component. Consequently, DNA sequencing of a microbiome sample frequently yields reads originating from the microbe(s) of interest, but with a vast excess of host genome-derived reads. In this study, we used a methyl-CpG binding domain (MBD) to separate methylated host DNA from microbial DNA based on differences in CpG methylation density. MBD fused to the Fc region of a human antibody (MBD-Fc) binds strongly to protein A paramagnetic beads, forming an effective one-step enrichment complex that was used to remove human or fish host DNA from bacterial and protistan DNA for subsequent sequencing and analysis. We report enrichment of DNA samples from human saliva, human blood, a mock malaria-infected blood sample and a black molly fish. When reads were mapped to reference genomes, sequence reads aligning to host genomes decreased 50-fold, while bacterial and Plasmodium DNA sequences reads increased 8-11.5-fold. The Shannon-Wiener diversity index was calculated for 149 bacterial species in saliva before and after enrichment. Unenriched saliva had an index of 4.72, while the enriched sample had an index of 4.80. The similarity of these indices demonstrates that bacterial species diversity and relative phylotype abundance remain conserved in enriched samples. Enrichment using the MBD-Fc method holds promise for targeted microbiome sequence analysis across a broad range of sample types.
Collapse
Affiliation(s)
- George R. Feehery
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Erbay Yigit
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | | | | | - Victor T. Schmidt
- The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, United States of America
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
| | - Fiona J. Stewart
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | | | - Linda A. Amaral-Zettler
- The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, United States of America
- Department of Geological Sciences, Brown University, Providence, Rhode Island, United States of America
| | - Theodore Davis
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | | | - Sriharsa Pradhan
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| |
Collapse
|
15
|
Liu P, Lohman GJS, Cantor E, Langhorst BW, Yigit E, Apone LM, Munafo DB, Sumner C, Stewart FJ, Evans TC, Nichols NM, Dimalanta ET, Davis TB. A fast solution to NGS library preparation with low nanogram DNA input. BMC Proc 2012. [PMCID: PMC3467472 DOI: 10.1186/1753-6561-6-s6-p26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
16
|
Munafo D, Liu P, Sumner C, Yigit E, Merrill L, Apone L, Langhorst B, Stewart F, Dimalanta ET, Davis T. Comparative analysis of strand-specific RNA sequencing approaches. BMC Proc 2012. [PMCID: PMC3467684 DOI: 10.1186/1753-6561-6-s6-p30] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
|
17
|
Liu P, Lohman G, Cantor E, Langhorst B, Apone L, Munafo D, Sumner C, Stewart F, Evans TC, Dimalanta ET, Davis T. Abstract 3180: A fast solution to NGS library preparation with low nanogram DNA input. Cancer Res 2012. [DOI: 10.1158/1538-7445.am2012-3180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Next generation sequencing (NGS) has significantly impacted cancer genetics, enabling a comprehensive characterization of genomic abnormalities in the cancer genome at very high resolution. By delivering massive DNA sequences at unprecedented speed and cost, NGS promises to make personalized genome-based diagnosis and targeted cancer therapy a reality in the foreseeable future. To date, library construction with clinical samples has been a challenge, primarily due to the limited quantities of sample DNA available. To overcome this challenge, we have developed a fast library preparation method using novel NEBNext reagents and adaptors. This method enables library construction from a minimal quantity of DNA (< 5 ng), and can be used for both intact and fragmented DNA. Moreover, the workflow is compatible with multiple NGS platforms.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 103rd Annual Meeting of the American Association for Cancer Research; 2012 Mar 31-Apr 4; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2012;72(8 Suppl):Abstract nr 3180. doi:1538-7445.AM2012-3180
Collapse
|
18
|
Antipova AA, Sokolsky TD, Clouser CR, Dimalanta ET, Hendrickson CL, Kosnopo C, Lee CC, Ranade SS, Zhang L, Blanchard AP, McKernan KJ. Polymorphism discovery in high-throughput resequenced microarray-enriched human genomic loci. J Biomol Tech 2009; 20:253-257. [PMID: 19949697 PMCID: PMC2777346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Identifying genetic variants and mutations that underlie human diseases requires development of robust, cost-effective tools for routine resequencing of regions of interest in the human genome. Here, we demonstrate that coupling Applied Biosystems SOLiD system-sequencing platform with microarray capture of targeted regions provides an efficient and robust method for high-coverage resequencing and polymorphism discovery in human protein-coding exons.
Collapse
|
19
|
McKernan KJ, Peckham HE, Costa GL, McLaughlin SF, Fu Y, Tsung EF, Clouser CR, Duncan C, Ichikawa JK, Lee CC, Zhang Z, Ranade SS, Dimalanta ET, Hyland FC, Sokolsky TD, Zhang L, Sheridan A, Fu H, Hendrickson CL, Li B, Kotler L, Stuart JR, Malek JA, Manning JM, Antipova AA, Perez DS, Moore MP, Hayashibara KC, Lyons MR, Beaudoin RE, Coleman BE, Laptewicz MW, Sannicandro AE, Rhodes MD, Gottimukkala RK, Yang S, Bafna V, Bashir A, MacBride A, Alkan C, Kidd JM, Eichler EE, Reese MG, De La Vega FM, Blanchard AP. Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. Genome Res 2009; 19:1527-41. [PMID: 19546169 DOI: 10.1101/gr.091868.109] [Citation(s) in RCA: 421] [Impact Index Per Article: 28.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
We describe the genome sequencing of an anonymous individual of African origin using a novel ligation-based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9%, allowing us to accurately call SNPs with as few as two reads per allele. We collected several billion mate-paired reads yielding approximately 18x haploid coverage of aligned sequence and close to 300x clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read, and 99.65% is spanned by at least one uniquely placed mate-paired clone. We identify over 3.8 million SNPs, 19% of which are novel. Mate-paired data are used to physically resolve haplotype phases of nearly two-thirds of the genotypes obtained and produce phased segments of up to 215 kb. We detect 226,529 intra-read indels, 5590 indels between mate-paired reads, 91 inversions, and four gene fusions. We use a novel approach for detecting indels between mate-paired reads that are smaller than the standard deviation of the insert size of the library and discover deletions in common with those detected with our intra-read approach. Dozens of mutations previously described in OMIM and hundreds of nonsynonymous single-nucleotide and structural variants in genes previously implicated in disease are identified in this individual. There is more genetic variation in the human genome still to be uncovered, and we provide guidance for future surveys in populations and cancer biopsies.
Collapse
|
20
|
Dimalanta ET, Lim A, Runnheim R, Lamers C, Churas C, Forrest DK, de Pablo JJ, Graham MD, Coppersmith SN, Goldstein S, Schwartz DC. A microfluidic system for large DNA molecule arrays. Anal Chem 2006; 76:5293-301. [PMID: 15362885 DOI: 10.1021/ac0496401] [Citation(s) in RCA: 138] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Single molecule approaches offer the promise of large, exquisitely miniature ensembles for the generation of equally large data sets. Although microfluidic devices have previously been designed to manipulate single DNA molecules, many of the functionalities they embody are not applicable to very large DNA molecules, normally extracted from cells. Importantly, such microfluidic devices must work within an integrated system to enable high-throughput biological or biochemical analysis-a key measure of any device aimed at the chemical/biological interface and required if large data sets are to be created for subsequent analysis. The challenge here was to design an integrated microfluidic device to control the deposition or elongation of large DNA molecules (up to millimeters in length), which would serve as a general platform for biological/biochemical analysis to function within an integrated system that included massively parallel data collection and analysis. The approach we took was to use replica molding to construct silastic devices to consistently deposit oriented, elongated DNA molecules onto charged surfaces, creating massive single molecule arrays, which we analyzed for both physical and biochemical insights within an integrated environment that created large data sets. The overall efficacy of this approach was demonstrated by the restriction enzyme mapping and identification of single human genomic DNA molecules.
Collapse
Affiliation(s)
- Eileen T Dimalanta
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, and Laboratory of Genetics, University of Wisconsin-Madison, 425 Henry Mall, Madison, Wisconsin 53706, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Zhou S, Kile A, Bechner M, Place M, Kvikstad E, Deng W, Wei J, Severin J, Runnheim R, Churas C, Forrest D, Dimalanta ET, Lamers C, Burland V, Blattner FR, Schwartz DC. Single-molecule approach to bacterial genomic comparisons via optical mapping. J Bacteriol 2004; 186:7773-82. [PMID: 15516592 PMCID: PMC524920 DOI: 10.1128/jb.186.22.7773-7782.2004] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Modern comparative genomics has been established, in part, by the sequencing and annotation of a broad range of microbial species. To gain further insights, new sequencing efforts are now dealing with the variety of strains or isolates that gives a species definition and range; however, this number vastly outstrips our ability to sequence them. Given the availability of a large number of microbial species, new whole genome approaches must be developed to fully leverage this information at the level of strain diversity that maximize discovery. Here, we describe how optical mapping, a single-molecule system, was used to identify and annotate chromosomal alterations between bacterial strains represented by several species. Since whole-genome optical maps are ordered restriction maps, sequenced strains of Shigella flexneri serotype 2a (2457T and 301), Yersinia pestis (CO 92 and KIM), and Escherichia coli were aligned as maps to identify regions of homology and to further characterize them as possible insertions, deletions, inversions, or translocations. Importantly, an unsequenced Shigella flexneri strain (serotype Y strain AMC[328Y]) was optically mapped and aligned with two sequenced ones to reveal one novel locus implicated in serotype conversion and several other loci containing insertion sequence elements or phage-related gene insertions. Our results suggest that genomic rearrangements and chromosomal breakpoints are readily identified and annotated against a prototypic sequenced strain by using the tools of optical mapping.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computation Genomics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Abstract
An extended Brownian dynamics simulation method is used to characterize the dynamics of long DNA molecules flowing in microchannels. The relaxation time increases due to confinement in agreement with scaling predictions. During flow the molecules migrate toward the channel center line, and thereby segregate according to molecular weight. Capturing these effects requires the detailed incorporation of solvent flow in the simulation method, demonstrating the importance of hydrodynamic effects in the dynamics of confined macromolecules.
Collapse
Affiliation(s)
- Richard M Jendrejack
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | | | | | | |
Collapse
|
23
|
Zhou S, Deng W, Anantharaman TS, Lim A, Dimalanta ET, Wang J, Wu T, Chunhong T, Creighton R, Kile A, Kvikstad E, Bechner M, Yen G, Garic-Stankovic A, Severin J, Forrest D, Runnheim R, Churas C, Lamers C, Perna NT, Burland V, Blattner FR, Mishra B, Schwartz DC. A whole-genome shotgun optical map of Yersinia pestis strain KIM. Appl Environ Microbiol 2002; 68:6321-31. [PMID: 12450857 PMCID: PMC134435 DOI: 10.1128/aem.68.12.6321-6331.2002] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2002] [Accepted: 09/12/2002] [Indexed: 11/20/2022] Open
Abstract
Yersinia pestis is the causative agent of the bubonic, septicemic, and pneumonic plagues (also known as black death) and has been responsible for recurrent devastating pandemics throughout history. To further understand this virulent bacterium and to accelerate an ongoing sequencing project, two whole-genome restriction maps (XhoI and PvuII) of Y. pestis strain KIM were constructed using shotgun optical mapping. This approach constructs ordered restriction maps from randomly sheared individual DNA molecules directly extracted from cells. The two maps served different purposes; the XhoI map facilitated sequence assembly by providing a scaffold for high-resolution alignment, while the PvuII map verified genome sequence assembly. Our results show that such maps facilitated the closure of sequence gaps and, most importantly, provided a purely independent means for sequence validation. Given the recent advancements to the optical mapping system, increased resolution and throughput are enabling such maps to guide sequence assembly at a very early stage of a microbial sequencing project.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Lim A, Dimalanta ET, Potamousis KD, Yen G, Apodoca J, Tao C, Lin J, Qi R, Skiadas J, Ramanathan A, Perna NT, Plunkett G, Burland V, Mau B, Hackett J, Blattner FR, Anantharaman TS, Mishra B, Schwartz DC. Shotgun optical maps of the whole Escherichia coli O157:H7 genome. Genome Res 2001; 11:1584-93. [PMID: 11544203 PMCID: PMC311123 DOI: 10.1101/gr.172101] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 06/04/2001] [Indexed: 11/24/2022]
Abstract
We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies.
Collapse
Affiliation(s)
- A Lim
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Perna NT, Plunkett G, Burland V, Mau B, Glasner JD, Rose DJ, Mayhew GF, Evans PS, Gregor J, Kirkpatrick HA, Pósfai G, Hackett J, Klink S, Boutin A, Shao Y, Miller L, Grotbeck EJ, Davis NW, Lim A, Dimalanta ET, Potamousis KD, Apodaca J, Anantharaman TS, Lin J, Yen G, Schwartz DC, Welch RA, Blattner FR. erratum Genome sequence of enterohaemorrhagic Escherichia coli 0157:H7. Nature 2001. [DOI: 10.1038/35065664] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
26
|
Perna NT, Plunkett G, Burland V, Mau B, Glasner JD, Rose DJ, Mayhew GF, Evans PS, Gregor J, Kirkpatrick HA, Pósfai G, Hackett J, Klink S, Boutin A, Shao Y, Miller L, Grotbeck EJ, Davis NW, Lim A, Dimalanta ET, Potamousis KD, Apodaca J, Anantharaman TS, Lin J, Yen G, Schwartz DC, Welch RA, Blattner FR. Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 2001; 409:529-33. [PMID: 11206551 DOI: 10.1038/35054089] [Citation(s) in RCA: 1470] [Impact Index Per Article: 63.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The bacterium Escherichia coli O157:H7 is a worldwide threat to public health and has been implicated in many outbreaks of haemorrhagic colitis, some of which included fatalities caused by haemolytic uraemic syndrome. Close to 75,000 cases of O157:H7 infection are now estimated to occur annually in the United States. The severity of disease, the lack of effective treatment and the potential for large-scale outbreaks from contaminated food supplies have propelled intensive research on the pathogenesis and detection of E. coli O157:H7 (ref. 4). Here we have sequenced the genome of E. coli O157:H7 to identify candidate genes responsible for pathogenesis, to develop better methods of strain detection and to advance our understanding of the evolution of E. coli, through comparison with the genome of the non-pathogenic laboratory strain E. coli K-12 (ref. 5). We find that lateral gene transfer is far more extensive than previously anticipated. In fact, 1,387 new genes encoded in strain-specific clusters of diverse sizes were found in O157:H7. These include candidate virulence factors, alternative metabolic capacities, several prophages and other new functions--all of which could be targets for surveillance.
Collapse
Affiliation(s)
- N T Perna
- Genome Center of Wisconsin, and Department of Animal Health and Biomedical Sciences, University of Wisconsin, Madison 53706, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Lai Z, Jing J, Aston C, Clarke V, Apodaca J, Dimalanta ET, Carucci DJ, Gardner MJ, Mishra B, Anantharaman TS, Paxia S, Hoffman SL, Craig Venter J, Huff EJ, Schwartz DC. A shotgun optical map of the entire Plasmodium falciparum genome. Nat Genet 1999; 23:309-13. [PMID: 10610179 DOI: 10.1038/15484] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The unicellular parasite Plasmodium falciparum is the cause of human malaria, resulting in 1.7-2.5 million deaths each year. To develop new means to treat or prevent malaria, the Malaria Genome Consortium was formed to sequence and annotate the entire 24.6-Mb genome. The plan, already underway, is to sequence libraries created from chromosomal DNA separated by pulsed-field gel electrophoresis (PFGE). The AT-rich genome of P. falciparum presents problems in terms of reliable library construction and the relative paucity of dense physical markers or extensive genetic resources. To deal with these problems, we reasoned that a high-resolution, ordered restriction map covering the entire genome could serve as a scaffold for the alignment and verification of sequence contigs developed by members of the consortium. Thus optical mapping was advanced to use simply extracted, unfractionated genomic DNA as its principal substrate. Ordered restriction maps (BamHI and NheI) derived from single molecules were assembled into 14 deep contigs corresponding to the molecular karyotype determined by PFGE (ref. 3).
Collapse
Affiliation(s)
- Z Lai
- W.M. Keck Laboratory for Biomolecular Imaging, Department of Chemistry, New York University, New York, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|