Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rosenfeld JA, Mason CE, Smith TM. Limitations of the human reference genome for personalized genomics. PLoS One 2012;7:e40294. [PMID: 22811759 PMCID: PMC3394790 DOI: 10.1371/journal.pone.0040294] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2012] [Accepted: 06/07/2012] [Indexed: 11/19/2022] Open

For:	Rosenfeld JA, Mason CE, Smith TM. Limitations of the human reference genome for personalized genomics. PLoS One 2012;7:e40294. [PMID: 22811759 PMCID: PMC3394790 DOI: 10.1371/journal.pone.0040294] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2012] [Accepted: 06/07/2012] [Indexed: 11/19/2022] Open

Number

Cited by Other Article(s)

Rutter LA, MacKay MJ, Cope H, Szewczyk NJ, Kim J, Overbey E, Tierney BT, Muratani M, Lamm B, Bezdan D, Paul AM, Schmidt MA, Church GM, Giacomello S, Mason CE. Protective alleles and precision healthcare in crewed spaceflight. Nat Commun 2024;15:6158. [PMID: 39039045 PMCID: PMC11263583 DOI: 10.1038/s41467-024-49423-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 06/05/2024] [Indexed: 07/24/2024] Open

Affiliation(s)

Lindsay A Rutter Transborder Medical Research Center, University of Tsukuba, Ibaraki, 305-8575, Japan Department of Genome Biology, Institute of Medicine, University of Tsukuba, Ibaraki, 305-8575, Japan School of Chemistry, University of Glasgow, Glasgow, G12 8QQ, UK
Matthew J MacKay Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, 10021, USA The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, 10065, USA
Henry Cope School of Medicine, University of Nottingham, Nottingham, DE22 3DT, UK
Nathaniel J Szewczyk School of Medicine, University of Nottingham, Nottingham, DE22 3DT, UK Ohio Musculoskeletal and Neurological Institute (OMNI), Heritage College of Osteopathic Medicine, Ohio University, Athens, OH, 45701, USA
JangKeun Kim Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, 10021, USA
Eliah Overbey Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, 10021, USA
Braden T Tierney Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, 10021, USA
Masafumi Muratani Transborder Medical Research Center, University of Tsukuba, Ibaraki, 305-8575, Japan Department of Genome Biology, Institute of Medicine, University of Tsukuba, Ibaraki, 305-8575, Japan
Ben Lamm Colossal Biosciences, 1401 Lavaca St, Unit #155 Austin, Austin, TX, 78701, USA
Daniela Bezdan Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany NGS Competence Center Tübingen (NCCT), University of Tübingen, Tübingen, Germany Yuri GmbH, Meckenbeuren, Germany
Amber M Paul Embry-Riddle Aeronautical University, Department of Human Factors and Behavioral Neurobiology, Daytona Beach, FL, 32114, USA
Michael A Schmidt Sovaris Aerospace, Boulder, CO, 80302, USA. Advanced Pattern Analysis & Human Performance Group, Boulder, CO, 80302, USA.
George M Church GC Therapeutics Inc, Cambridge, MA, 02139, USA. Department of Genetics, Harvard Medical School, Boston, MA, 02115, USA. Wyss Institute for Biologically Inspired Engineering, Harvard University, Cambridge, MA, 02115, USA.
Stefania Giacomello SciLifeLab, KTH Royal Institute of Technology, Stockholm, 17165, Sweden.
Christopher E Mason Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA. The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, 10021, USA. The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, 10065, USA. Wyss Institute for Biologically Inspired Engineering, Harvard University, Cambridge, MA, 02115, USA. The Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, 10065, USA.

Collapse

Eynard SE, Klopp C, Canale-Tabet K, Marande W, Vandecasteele C, Roques C, Donnadieu C, Boone Q, Servin B, Vignal A. The black honey bee genome: insights on specific structural elements and a first step towards pangenomes. Genet Sel Evol 2024;56:51. [PMID: 38943059 PMCID: PMC11212449 DOI: 10.1186/s12711-024-00917-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 06/04/2024] [Indexed: 07/01/2024] Open

Abstract

BACKGROUND

The honey bee reference genome, HAv3.1, was produced from a commercial line sample that was thought to have a largely dominant Apis mellifera ligustica genetic background. Apis mellifera mellifera, often referred to as the black bee, has a separate evolutionary history and is the original type in western and northern Europe. Growing interest in this subspecies for conservation and non-professional apicultural practices, together with the necessity of deciphering genome backgrounds in hybrids, triggered the necessity for a specific genome assembly. Moreover, having several high-quality genomes is becoming key for taking structural variations into account in pangenome analyses.

RESULTS

Pacific Bioscience technology long reads were produced from a single haploid black bee drone. Scaffolding contigs into chromosomes was done using a high-density genetic map. This allowed for re-estimation of the recombination rate, which was over-estimated in some previous studies due to mis-assemblies, which resulted in spurious inversions in the older reference genomes. The sequence continuity obtained was very high and the only limit towards continuous chromosome-wide sequences seemed to be due to tandem repeat arrays that were usually longer than 10 kb and that belonged to two main families, the 371 and 91 bp repeats, causing problems in the assembly process due to high internal sequence similarity. Our assembly was used together with the reference genome to genotype two structural variants by a pangenome graph approach with Graphtyper2. Genotypes obtained were either correct or missing, when compared to an approach based on sequencing depth analysis, and genotyping rates were 89 and 76% for the two variants.

CONCLUSIONS

Our new assembly for the Apis mellifera mellifera honey bee subspecies demonstrates the utility of multiple high-quality genomes for the genotyping of structural variants, with a test case on two insertions and deletions. It will therefore be an invaluable resource for future studies, for instance by including structural variants in GWAS. Having used a single haploid drone for sequencing allowed a refined analysis of very large tandem repeat arrays, raising the question of their function in the genome. High quality genome assemblies for multiple subspecies such as presented here, are crucial for emerging projects using pangenomes.

Collapse

Cope H, Elsborg J, Demharter S, McDonald JT, Wernecke C, Parthasarathy H, Unadkat H, Chatrathi M, Claudio J, Reinsch S, Avci P, Zwart SR, Smith SM, Heer M, Muratani M, Meydan C, Overbey E, Kim J, Chin CR, Park J, Schisler JC, Mason CE, Szewczyk NJ, Willis CRG, Salam A, Beheshti A. Transcriptomics analysis reveals molecular alterations underpinning spaceflight dermatology. COMMUNICATIONS MEDICINE 2024;4:106. [PMID: 38862781 PMCID: PMC11166967 DOI: 10.1038/s43856-024-00532-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 05/23/2024] [Indexed: 06/13/2024] Open

Affiliation(s)

Henry Cope School of Medicine, University of Nottingham, Derby, DE22 3DT, UK
Jonas Elsborg Department of Energy Conversion and Storage, Technical University of Denmark, 2800, Kongens Lyngby, Denmark Abzu, Copenhagen, 2150, Denmark
Samuel Demharter Abzu, Copenhagen, 2150, Denmark
J Tyson McDonald Department of Radiation Medicine, School of Medicine, Georgetown University, Washington D.C., WA, 20057, USA
Chiara Wernecke NASA GeneLab For High Schools Program (GL4HS), Space Biology Program, NASA Ames Research Center, Moffett Field, CA, USA Department of Aerospace and Geodesy, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
Hari Parthasarathy NASA GeneLab For High Schools Program (GL4HS), Space Biology Program, NASA Ames Research Center, Moffett Field, CA, USA College of Engineering and Haas School of Business, University of California, Berkeley, Berkeley, CA, 94720, USA
Hriday Unadkat NASA GeneLab For High Schools Program (GL4HS), Space Biology Program, NASA Ames Research Center, Moffett Field, CA, USA School of Engineering and Applied Science, Princeton University, Princeton, NJ, 08540, USA
Mira Chatrathi NASA GeneLab For High Schools Program (GL4HS), Space Biology Program, NASA Ames Research Center, Moffett Field, CA, USA College of Letters and Science, University of California, Berkeley, Berkeley, CA, 94720, USA
Jennifer Claudio NASA GeneLab For High Schools Program (GL4HS), Space Biology Program, NASA Ames Research Center, Moffett Field, CA, USA Blue Marble Space Institute of Science, Space Biosciences Division, NASA Ames Research Center, Moffett field, CA, USA
Sigrid Reinsch NASA GeneLab For High Schools Program (GL4HS), Space Biology Program, NASA Ames Research Center, Moffett Field, CA, USA Space Biosciences Division, NASA Ames Research Center, Moffett field, CA, USA
Pinar Avci Department of Dermatology and Allergy, University Hospital, LMU Munich, 80337, Munich, Germany
Sara R Zwart University of Texas Medical Branch, Galveston, TX, USA
Scott M Smith Biomedical Research and Environmental Sciences Division, Human Health and Performance Directorate, NASA Johnson Space Center, Houston, TX, 77058, USA
Martina Heer IU International University of Applied Sciences, Erfurt and University of Bonn, Bonn, Germany
Masafumi Muratani Transborder Medical Research Center, University of Tsukuba, Ibaraki, 305-8575, Japan Department of Genome Biology, Institute of Medicine, University of Tsukuba, Ibaraki, 305-8575, Japan
Cem Meydan Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, New York, NY, USA
Eliah Overbey Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, New York, NY, USA
Jangkeun Kim Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, New York, NY, USA
Christopher R Chin Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, New York, NY, USA
Jiwoon Park Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, New York, NY, USA Laboratory of Virology and Infectious Disease, The Rockefeller University, New York, NY, 10065, USA
Jonathan C Schisler McAllister Heart Institute and Department of Pharmacology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Christopher E Mason Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, New York, NY, USA Laboratory of Virology and Infectious Disease, The Rockefeller University, New York, NY, 10065, USA
Nathaniel J Szewczyk School of Medicine, University of Nottingham, Derby, DE22 3DT, UK Ohio Musculoskeletal and Neurological Institute, Heritage College of Osteopathic Medicine, Ohio University, Athens, OH, 45701, USA
Craig R G Willis School of Chemistry and Biosciences, Faculty of Life Sciences, University of Bradford, Bradford, BD7 1DP, UK
Amr Salam St John's Institute of Dermatology, King's College London, Guy's and St Thomas' NHS Foundation Trust, Guy's Hospital, Great Maze Pond, London, SE1 9RT, UK
Afshin Beheshti Blue Marble Space Institute of Science, Space Biosciences Division, NASA Ames Research Center, Moffett field, CA, USA. Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Collapse

Seylani A, Galsinh AS, Tasoula A, I AR, Camera A, Calleja-Agius J, Borg J, Goel C, Kim J, Clark KB, Das S, Arif S, Boerrigter M, Coffey C, Szewczyk N, Mason CE, Manoli M, Karouia F, Schwertz H, Beheshti A, Tulodziecki D. Ethical considerations for the age of non-governmental space exploration. Nat Commun 2024;15:4774. [PMID: 38862473 PMCID: PMC11166968 DOI: 10.1038/s41467-023-44357-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 12/05/2023] [Indexed: 06/13/2024] Open

Affiliation(s)

Allen Seylani School of Medicine, University of California, Riverside. 92521 Botanical Garden Dr, Riverside, CA, 92507, USA
Aman Singh Galsinh School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Aberdeen, AB24 3FX, UK
Alexia Tasoula Department of Life Science Engineering, FH Technikum, Vienna, Austria Heritage College of Osteopathic Medicine, Ohio University, Athens, OH, USA
Anu R I Department of Cancer Biology and Therapeutics, MVR Cancer Centre and Research Institute, Calicut, India Department of Clinical Biochemistry, MVR Cancer Centre and Research Institute, Calicut, India
Andrea Camera Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
Jean Calleja-Agius Department of Anatomy, Faculty of Medicine and Surgery, University of Malta, MSD2080, Msida, Malta
Joseph Borg Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, MSD2080, Msida, Malta
Chirag Goel Northwestern University Feinberg School of Medicine, Chicago, IL, USA
JangKeun Kim Department of Physiology & Biophysics, Weill Cornell Medicine, New York, NY, USA
Kevin B Clark Cures Within Reach, Chicago, IL, 60602, USA Peace Innovation Institute, The Hague 2511, Netherlands & Stanford University, Palo Alto, CA, 94305, USA Biometrics and Nanotechnology Councils, Institute for Electrical and Electronics Engineers, New York, NY, 10016-5997, USA
Saswati Das Department of Biochemistry, Atal Bihari Vajpayee Institute of Medical Sciences, New Delhi, India
Shehbeel Arif Center for Data-Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Michael Boerrigter Deep Space Biology, San Francisco, CA, USA
Caroline Coffey Heritage College of Osteopathic Medicine, Ohio University, Athens, OH, USA
Nathaniel Szewczyk Heritage College of Osteopathic Medicine, Ohio University, Athens, OH, USA
Christopher E Mason Department of Physiology & Biophysics, Weill Cornell Medicine, New York, NY, USA
Maria Manoli School of Law, University of Aberdeen, Aberdeen, AB24 3UB, UK
Fathi Karouia Blue Marble Space Institute for Science, Exobiology Branch, NASA Ames Research Center, Moffett Field, CA, USA Space Research Within Reach, San Francisco, CA, USA Center for Space Medicine, Baylor College of Medicine, Houston, TX, USA
Hansjörg Schwertz Molecular Medicine Program at the University of Utah, Salt Lake City, UT, 84112, USA. Division of Occupational Medicine at the University of Utah, Salt Lake City, UT, 84112, USA. Occupational Medicine at Billings Clinic Bozeman, Bozeman, MT, 59715, USA.
Afshin Beheshti Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA. Blue Marble Space Institute of Science, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA, US.
Dana Tulodziecki Department of Philosophy, Purdue University, West Lafayette, IN, USA.

Collapse

Spillmann RC, Tan QKG, Reuter C, Schoch K, Kohler J, Bonner D, Zastrow D, Alkelai A, Baugh E, Cope H, Marwaha S, Wheeler MT, Bernstein JA, Shashi V. A concurrent dual analysis of genomic data augments diagnoses: Experiences of 2 clinical sites in the Undiagnosed Diseases Network. Genet Med 2023;25:100353. [PMID: 36481303 PMCID: PMC10506157 DOI: 10.1016/j.gim.2022.12.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 11/28/2022] [Accepted: 12/01/2022] [Indexed: 12/12/2022] Open

Affiliation(s)

Rebecca C Spillmann Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC
Queenie K-G Tan Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC
Chloe Reuter Stanford Center for Inherited Cardiovascular Disease, Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA; Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Kelly Schoch Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC
Jennefer Kohler Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Devon Bonner Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Diane Zastrow Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Anna Alkelai Institute for Genome Medicine, Columbia University Medical Center, New York, NY
Evan Baugh Institute for Genome Medicine, Columbia University Medical Center, New York, NY
Heidi Cope Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC
Shruti Marwaha Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Matthew T Wheeler Stanford Center for Inherited Cardiovascular Disease, Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA; Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Jonathan A Bernstein Stanford Center for Undiagnosed Diseases, Stanford University, and Department of Pediatrics, Stanford University School of Medicine, Stanford, CA
Vandana Shashi Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC.

Collapse

Park J, Kim J, Lewy T, Rice CM, Elemento O, Rendeiro AF, Mason CE. Spatial omics technologies at multimodal and single cell/subcellular level. Genome Biol 2022;23:256. [PMID: 36514162 PMCID: PMC9746133 DOI: 10.1186/s13059-022-02824-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 11/29/2022] [Indexed: 12/15/2022] Open

Muñoz-Barrera A, Rubio-Rodríguez LA, Díaz-de Usera A, Jáspez D, Lorenzo-Salazar JM, González-Montelongo R, García-Olivares V, Flores C. From Samples to Germline and Somatic Sequence Variation: A Focus on Next-Generation Sequencing in Melanoma Research. Life (Basel) 2022;12:1939. [PMID: 36431075 PMCID: PMC9695713 DOI: 10.3390/life12111939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 11/12/2022] [Accepted: 11/16/2022] [Indexed: 11/24/2022] Open

Xiao C, Chen Z, Chen W, Padilla C, Colgan M, Wu W, Fang LT, Liu T, Yang Y, Schneider V, Wang C, Xiao W. Personalized genome assembly for accurate cancer somatic mutation discovery using tumor-normal paired reference samples. Genome Biol 2022;23:237. [PMID: 36352452 PMCID: PMC9648002 DOI: 10.1186/s13059-022-02803-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 10/25/2022] [Indexed: 11/11/2022] Open

Hertzog A, Selvanathan A, Devanapalli B, Ho G, Bhattacharya K, Tolun AA. A narrative review of metabolomics in the era of "-omics": integration into clinical practice for inborn errors of metabolism. Transl Pediatr 2022;11:1704-1716. [PMID: 36345452 PMCID: PMC9636448 DOI: 10.21037/tp-22-105] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 08/23/2022] [Indexed: 11/29/2022] Open

Tetikol HS, Turgut D, Narci K, Budak G, Kalay O, Arslan E, Demirkaya-Budak S, Dolgoborodov A, Kabakci-Zorlu D, Semenyuk V, Jain A, Davis-Dusenbery BN. Pan-African genome demonstrates how population-specific genome graphs improve high-throughput sequencing data analysis. Nat Commun 2022;13:4384. [PMID: 35927245 PMCID: PMC9352875 DOI: 10.1038/s41467-022-31724-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 06/30/2022] [Indexed: 11/29/2022] Open

Kaminow B, Ballouz S, Gillis J, Dobin A. Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses. Genome Res 2022;32:738-749. [PMID: 35256454 PMCID: PMC8997357 DOI: 10.1101/gr.275613.121] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 03/02/2022] [Indexed: 11/25/2022]

Moore KJM, Cahill J, Aidelberg G, Aronoff R, Bektaş A, Bezdan D, Butler DJ, Chittur SV, Codyre M, Federici F, Tanner NA, Tighe SW, True R, Ware SB, Wyllie AL, Afshin EE, Bendesky A, Chang CB, Dela Rosa R, Elhaik E, Erickson D, Goldsborough AS, Grills G, Hadasch K, Hayden A, Her SY, Karl JA, Kim CH, Kriegel AJ, Kunstman T, Landau Z, Land K, Langhorst BW, Lindner AB, Mayer BE, McLaughlin LA, McLaughlin MT, Molloy J, Mozsary C, Nadler JL, D'Silva M, Ng D, O'Connor DH, Ongerth JE, Osuolale O, Pinharanda A, Plenker D, Ranjan R, Rosbash M, Rotem A, Segarra J, Schürer S, Sherrill-Mix S, Solo-Gabriele H, To S, Vogt MC, Yu AD, Mason CE. Loop-Mediated Isothermal Amplification Detection of SARS-CoV-2 and Myriad Other Applications. J Biomol Tech 2021;32:228-275. [PMID: 35136384 PMCID: PMC8802757 DOI: 10.7171/jbt.21-3203-017] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Abstract

As the second year of the COVID-19 pandemic begins, it remains clear that a massive increase in the ability to test for SARS-CoV-2 infections in a myriad of settings is critical to controlling the pandemic and to preparing for future outbreaks. The current gold standard for molecular diagnostics is the polymerase chain reaction (PCR), but the extraordinary and unmet demand for testing in a variety of environments means that both complementary and supplementary testing solutions are still needed. This review highlights the role that loop-mediated isothermal amplification (LAMP) has had in filling this global testing need, providing a faster and easier means of testing, and what it can do for future applications, pathogens, and the preparation for future outbreaks. This review describes the current state of the art for research of LAMP-based SARS-CoV-2 testing, as well as its implications for other pathogens and testing. The authors represent the global LAMP (gLAMP) Consortium, an international research collective, which has regularly met to share their experiences on LAMP deployment and best practices; sections are devoted to all aspects of LAMP testing, including preanalytic sample processing, target amplification, and amplicon detection, then the hardware and software required for deployment are discussed, and finally, a summary of the current regulatory landscape is provided. Included as well are a series of first-person accounts of LAMP method development and deployment. The final discussion section provides the reader with a distillation of the most validated testing methods and their paths to implementation. This review also aims to provide practical information and insight for a range of audiences: for a research audience, to help accelerate research through sharing of best practices; for an implementation audience, to help get testing up and running quickly; and for a public health, clinical, and policy audience, to help convey the breadth of the effect that LAMP methods have to offer.

Collapse

Affiliation(s)

Keith J M Moore School of Science and Engineering, Ateneo de Manila University, Quezon City 1108, Philippines
Jeremy Cahill Metamer Labs, Boston, MA 02111-1929, USA
Guy Aidelberg Université de Paris, INSERM U1284, Center for Research and Interdisciplinarity (CRI), 75006 Paris, France Just One Giant Lab, Centre de Recherches Interdisciplinaires (CRI), 75004 Paris, France
Rachel Aronoff Just One Giant Lab, Centre de Recherches Interdisciplinaires (CRI), 75004 Paris, France Action for Genomic Integrity Through Research! (AGiR!), Lausanne, Switzerland Association Hackuarium, Lausanne, Switzerland
Ali Bektaş Oakland Genomics Center, Oakland, CA 94609, USA
Daniela Bezdan Institute of Medical Genetics and Applied Genomics, University of Tübingen, 72076 Tübingen, Germany NGS Competence Center Tübingen (NCCT), University of Tübingen, 72076 Tübingen, Germany Poppy Health, Inc, San Francisco, CA 94158, USA Institute of Medical Virology and Epidemiology of Viral Diseases, University Hospital, 72076 Tübingen, Germany
Daniel J Butler Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY 10065, USA
Sridar V Chittur Center for Functional Genomics, Department of Biomedical Sciences, School of Public Health, University at Albany, State University of New York, Rensselaer, 12222, USA
Martin Codyre GiantLeap Biotechnology Ltd, Wicklow A63 Kv91, Ireland
Fernan Federici ANID, Millennium Science Initiative Program, Millennium Institute for Integrative Biology (iBio), Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 8331150, Chile
Nathan A Tanner New England Biolabs, Ipswich, MA 01938, USA
Scott W Tighe University of Vermont, Burlington, 05405, USA
Randy True FloodLAMP Biotechnologies, San Carlos, CA 94070, USA
Sarah B Ware Just One Giant Lab, Centre de Recherches Interdisciplinaires (CRI), 75004 Paris, France BioBlaze Community Bio Lab, 1800 W Hawthorne Ln, Ste J-1, West Chicago, IL 60185, USA Blossom Bio Lab, 1800 W Hawthorne Ln, Ste K-2, West Chicago, IL 60185, USA
Anne L Wyllie Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT 06510, USA
Evan E Afshin Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY 10065, USA The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY 10065, USA
Andres Bendesky Department of Ecology, Evolution and Environmental Biology, Columbia University, New York, NY 10027, USA Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA
Connie B Chang Department of Chemical and Biological Engineering, Montana State University, Bozeman, 59717, USA Center for Biofilm Engineering, Montana State University, Bozeman, 59717, USA
Richard Dela Rosa School of Science and Engineering, Ateneo de Manila University, Quezon City 1108, Philippines
Eran Elhaik Department of Biology, Lund University, Sölvegatan 35, Lund, Sweden
David Erickson Sibley School of Mechanical and Aerospace Engineering, Cornell University, Ithaca, NY 14850, USA
Andrew S Goldsborough RNAssist Ltd, Cambridge CB1 1EE, England
George Grills Department of Microbiology, University of Pennsylvania, Philadelphia, 19104, USA
Kathrin Hadasch Université de Paris, INSERM U1284, Center for Research and Interdisciplinarity (CRI), 75006 Paris, France Department of Biology, Membrane Biophysics, Technische Universität Darmstadt, 64289 Darmstadt, Germany Lab3 eV, Labspace Darmstadt, 64295 Darmstadt, Germany IANUS Verein für Friedensorientierte Technikgestaltung eV, 64289 Darmstadt, Germany
Andrew Hayden Center for Functional Genomics, Department of Biomedical Sciences, School of Public Health, University at Albany, State University of New York, Rensselaer, 12222, USA
Seong-Young Her Metamer Labs, Boston, MA 02111-1929, USA
Julie A Karl Department of Pathology and Laboratory Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, Madison 53705, USA
Chang Hee Kim GoDx, Inc, Madison, WI 53719, USA
Alison J Kriegel Medical College of Wisconsin, Milwaukee, 53226, USA
Thomas Kunstman University of Colorado, Boulder, CO 80309, USA
Zeph Landau Department of Computer Science, University of California, Berkeley, Berkeley, 94720, USA
Kevin Land Mologic, Centre for Advanced Rapid Diagnostics, (CARD), Bedford Technology Park, Thurleigh MK44 2YA, England Department of Electrical, Electronic and Computer Engineering, University of Pretoria, 0028 Pretoria, South Africa
Bradley W Langhorst New England Biolabs, Ipswich, MA 01938, USA
Ariel B Lindner Université de Paris, INSERM U1284, Center for Research and Interdisciplinarity (CRI), 75006 Paris, France
Benjamin E Mayer Department of Biology, Membrane Biophysics, Technische Universität Darmstadt, 64289 Darmstadt, Germany Lab3 eV, Labspace Darmstadt, 64295 Darmstadt, Germany
Lee A McLaughlin IC Mobile Lab Ltd, SpacePortx, M1 1DZ Manchester, England
Matthew T McLaughlin Department of Pathology and Laboratory Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, Madison 53705, USA
Jenny Molloy Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, England
Christopher Mozsary Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY 10065, USA
Jerry L Nadler Department of Pharmacology, New York Medical College, Valhalla, 10595, USA
Melinee D'Silva Department of Pharmacology, New York Medical College, Valhalla, 10595, USA
David Ng Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA
David H O'Connor Department of Pathology and Laboratory Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, Madison 53705, USA
Jerry E Ongerth University of Wollongong, Environmental Engineering, Wollongong NSW 2522, Australia
Olayinka Osuolale Applied Environmental Metagenomics and Infectious Diseases Research (AEMIDR), Department of Biological Sciences, Elizade University, Ilara Mokin, Nigeria
Ana Pinharanda Department of Biological Sciences, Columbia University, New York, NY 10027, USA
Dennis Plenker Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Ravi Ranjan Genomics Resource Laboratory, Institute for Applied Life Sciences, University of Massachusetts, Amherst, 01003, USA
Michael Rosbash Howard Hughes Medical Institute and Department of Biology, Brandeis University, Waltham, MA 02453, USA
Assaf Rotem Metamer Labs, Boston, MA 02111-1929, USA
Jacob Segarra Metamer Labs, Boston, MA 02111-1929, USA
Stephan Schürer University of Florida, Miami, 33146, USA
Scott Sherrill-Mix Department of Microbiology, University of Pennsylvania, Philadelphia, 19104, USA
Helena Solo-Gabriele University of Florida, Miami, 33146, USA
Shaina To School of Science and Engineering, Ateneo de Manila University, Quezon City 1108, Philippines
Merly C Vogt Department of Biological Sciences, Columbia University, New York, NY 10027, USA
Albert D Yu Howard Hughes Medical Institute and Department of Biology, Brandeis University, Waltham, MA 02453, USA
Christopher E Mason Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10065, USA The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY 10065, USA The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY 10065, USA The Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY 10065, USA

Collapse

Daw Elbait G, Henschel A, Tay GK, Al Safar HS. A Population-Specific Major Allele Reference Genome From The United Arab Emirates Population. Front Genet 2021;12:660428. [PMID: 33968136 PMCID: PMC8102833 DOI: 10.3389/fgene.2021.660428] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/19/2021] [Indexed: 12/30/2022] Open

Du L, Liu K, Yao X, Risacher SL, Han J, Saykin AJ, Guo L, Shen L. Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:227-239. [PMID: 31634139 PMCID: PMC7156329 DOI: 10.1109/tcbb.2019.2947428] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Lee H, Shuaibi A, Bell JM, Pavlichin DS, Ji HP. Unique k-mer sequences for validating cancer-related substitution, insertion and deletion mutations. NAR Cancer 2020;2:zcaa034. [PMID: 33345188 PMCID: PMC7727745 DOI: 10.1093/narcan/zcaa034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 10/23/2020] [Accepted: 11/12/2020] [Indexed: 12/26/2022] Open

Swart Y, van Eeden G, Sparks A, Uren C, Möller M. Prospective avenues for human population genomics and disease mapping in southern Africa. Mol Genet Genomics 2020;295:1079-1089. [PMID: 32440765 PMCID: PMC7240165 DOI: 10.1007/s00438-020-01684-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 05/06/2020] [Indexed: 12/22/2022]

Balachandran P, Beck CR. Structural variant identification and characterization. Chromosome Res 2020;28:31-47. [PMID: 31907725 PMCID: PMC7131885 DOI: 10.1007/s10577-019-09623-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 10/15/2019] [Accepted: 11/24/2019] [Indexed: 01/06/2023]

Roddy AC, Jurek-Loughrey A, Souza J, Gilmore A, O’Reilly PG, Stupnikov A, Gonzalez de Castro D, Prise KM, Salto-Tellez M, McArt DG. NUQA: Estimating Cancer Spatial and Temporal Heterogeneity and Evolution through Alignment-Free Methods. Mol Biol Evol 2019;36:2883-2889. [PMID: 31424551 PMCID: PMC6878956 DOI: 10.1093/molbev/msz182] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Ibrahim O, Sutherland HG, Haupt LM, Griffiths LR. Saliva as a comparable-quality source of DNA for Whole Exome Sequencing on Ion platforms. Genomics 2019;112:1437-1443. [PMID: 31445087 DOI: 10.1016/j.ygeno.2019.08.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2019] [Revised: 08/05/2019] [Accepted: 08/19/2019] [Indexed: 11/17/2022]

Ballouz S, Dobin A, Gillis JA. Is it time to change the reference genome? Genome Biol 2019;20:159. [PMID: 31399121 PMCID: PMC6688217 DOI: 10.1186/s13059-019-1774-4] [Citation(s) in RCA: 97] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Du L, Liu K, Yao X, Risacher SL, Han J, Guo L, Saykin AJ, Shen L. Fast Multi-Task SCCA Learning with Feature Selection for Multi-Modal Brain Imaging Genetics. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2019;2018:356-361. [PMID: 30881731 DOI: 10.1109/bibm.2018.8621298] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Kaiser VB, Semple CA. Chromatin loop anchors are associated with genome instability in cancer and recombination hotspots in the germline. Genome Biol 2018;19:101. [PMID: 30060743 PMCID: PMC6066925 DOI: 10.1186/s13059-018-1483-4] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Accepted: 07/13/2018] [Indexed: 01/07/2023] Open

Abstract

Background

Chromatin loops form a basic unit of interphase nuclear organization, with chromatin loop anchor points providing contacts between regulatory regions and promoters. However, the mutational landscape at these anchor points remains under-studied. Here, we describe the unusual patterns of somatic mutations and germline variation associated with loop anchor points and explore the underlying features influencing these patterns.

Results

Analyses of whole genome sequencing datasets reveal that anchor points are strongly depleted for single nucleotide variants (SNVs) in tumours. Despite low SNV rates in their genomic neighbourhood, anchor points emerge as sites of evolutionary innovation, showing enrichment for structural variant (SV) breakpoints and a peak of SNVs at focal CTCF sites within the anchor points. Both CTCF-bound and non-CTCF anchor points harbour an excess of SV breakpoints in multiple tumour types and are prone to double-strand breaks in cell lines. Common fragile sites, which are hotspots for genome instability, also show elevated numbers of intersecting loop anchor points. Recurrently disrupted anchor points are enriched for genes with functions in cell cycle transitions and regions associated with predisposition to cancer. We also discover a novel class of CTCF-bound anchor points which overlap meiotic recombination hotspots and are enriched for the core PRDM9 binding motif, suggesting that the anchor points have been foci for diversity generated during recent human evolution.

Conclusions

We suggest that the unusual chromatin environment at loop anchor points underlies the elevated rates of variation observed, marking them as sites of regulatory importance but also genomic fragility.

Electronic supplementary material

The online version of this article (10.1186/s13059-018-1483-4) contains supplementary material, which is available to authorized users.

Collapse

Toubia J, Conn VM, Conn SJ. Don't go in circles: confounding factors in gene expression profiling. EMBO J 2018;37:embj.201797945. [PMID: 29735571 DOI: 10.15252/embj.201797945] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open

Borghesi A, Mencarelli MA, Memo L, Ferrero GB, Bartuli A, Genuardi M, Stronati M, Villani A, Renieri A, Corsello G. Intersociety policy statement on the use of whole-exome sequencing in the critically ill newborn infant. Ital J Pediatr 2017;43:100. [PMID: 29100554 PMCID: PMC5670717 DOI: 10.1186/s13052-017-0418-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 10/17/2017] [Indexed: 01/05/2023] Open

Abstract

The rapid advancement of next-generation sequencing (NGS) technology and the decrease in costs for whole-exome sequencing (WES) and whole-genome sequening (WGS), has prompted its clinical application in several fields of medicine. Currently, there are no specific guidelines for the use of NGS in the field of neonatal medicine and in the diagnosis of genetic diseases in critically ill newborn infants. As a consequence, NGS may be underused with reduced diagnostic success rate, or overused, with increased costs for the healthcare system. Most genetic diseases may be already expressed during the neonatal age, but their identification may be complicated by nonspecific presentation, especially in the setting of critical clinical conditions. The differential diagnosis process in the neonatal intensive care unit (NICU) may be time-consuming, uncomfortable for the patient due to repeated sampling, and ineffective in reaching a molecular diagnosis during NICU stay. Serial gene sequencing (Sanger sequencing) may be successful only for conditions for which the clinical phenotype strongly suggests a diagnostic hypothesis and for genetically homogeneous diseases. Newborn screenings with Guthrie cards, which vary from country to country, are designed to only test for a few dozen genetic diseases out of the more than 6000 diseases for which a genetic characterization is available. The use of WES in selected cases in the NICU may overcome these issues. We present an intersociety document that aims to define the best indications for the use of WES in different clinical scenarios in the NICU. We propose that WES is used in the NICU for critically ill newborn infants when an early diagnosis is desirable to guide the clinical management during NICU stay, when a strong hypothesis cannot be formulated based on the clinical phenotype or the disease is genetically heterogeneous, and when specific non-genetic laboratory tests are not available. The use of WES may reduce the time for diagnosis in infants during NICU stay and may eventually result in cost-effectiveness.

Collapse

Worthey EA. Analysis and Annotation of Whole-Genome or Whole-Exome Sequencing Derived Variants for Clinical Diagnosis. ACTA ACUST UNITED AC 2017;95:9.24.1-9.24.28. [PMID: 29044471 DOI: 10.1002/cphg.49] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Highly Variable Genomic Landscape of Endogenous Retroviruses in the C57BL/6J Inbred Strain, Depending on Individual Mouse, Gender, Organ Type, and Organ Location. Int J Genomics 2017;2017:3152410. [PMID: 28951865 PMCID: PMC5603323 DOI: 10.1155/2017/3152410] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 06/16/2017] [Accepted: 07/03/2017] [Indexed: 11/17/2022] Open

Schneider VA, Graves-Lindsay T, Howe K, Bouk N, Chen HC, Kitts PA, Murphy TD, Pruitt KD, Thibaud-Nissen F, Albracht D, Fulton RS, Kremitzki M, Magrini V, Markovic C, McGrath S, Steinberg KM, Auger K, Chow W, Collins J, Harden G, Hubbard T, Pelan S, Simpson JT, Threadgold G, Torrance J, Wood JM, Clarke L, Koren S, Boitano M, Peluso P, Li H, Chin CS, Phillippy AM, Durbin R, Wilson RK, Flicek P, Eichler EE, Church DM. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res 2017;27:849-864. [PMID: 28396521 PMCID: PMC5411779 DOI: 10.1101/gr.213611.116] [Citation(s) in RCA: 569] [Impact Index Per Article: 81.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Accepted: 03/14/2017] [Indexed: 11/24/2022]

Affiliation(s)

Valerie A Schneider National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Tina Graves-Lindsay McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Kerstin Howe Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Nathan Bouk National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Hsiu-Chuan Chen National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Paul A Kitts National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Terence D Murphy National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Kim D Pruitt National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Françoise Thibaud-Nissen National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
Derek Albracht McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Robert S Fulton McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Milinn Kremitzki McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Vincent Magrini McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Chris Markovic McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Sean McGrath McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Karyn Meltz Steinberg McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Kate Auger Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
William Chow Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Joanna Collins Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Glenn Harden Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Timothy Hubbard Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Sarah Pelan Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Jared T Simpson Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Glen Threadgold Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
James Torrance Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Jonathan M Wood Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Laura Clarke European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
Sergey Koren National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Matthew Boitano Pacific Biosciences, Menlo Park, California 94025, USA
Paul Peluso Pacific Biosciences, Menlo Park, California 94025, USA
Heng Li Broad Institute, Cambridge, Massachusetts 02142, USA
Chen-Shan Chin Pacific Biosciences, Menlo Park, California 94025, USA
Adam M Phillippy National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Richard Durbin Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
Richard K Wilson McDonnell Genome Institute at Washington University, St. Louis, Missouri 63018, USA
Paul Flicek European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
Evan E Eichler Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
Deanna M Church National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA

Collapse

Mason CE, Afshinnekoo E, Tighe S, Wu S, Levy S. International Standards for Genomes, Transcriptomes, and Metagenomes. J Biomol Tech 2017;28:8-18. [PMID: 28337071 PMCID: PMC5359768 DOI: 10.7171/jbt.17-2801-006] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Ahsanuddin S, Afshinnekoo E, Gandara J, Hakyemezoğlu M, Bezdan D, Minot S, Greenfield N, Mason CE. Assessment of REPLI-g Multiple Displacement Whole Genome Amplification (WGA) Techniques for Metagenomic Applications. J Biomol Tech 2017;28:46-55. [PMID: 28344519 DOI: 10.7171/jbt.17-2801-008] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Abstract

Amplification of minute quantities of DNA is a fundamental challenge in low-biomass metagenomic and microbiome studies because of potential biases in coverage, guanine-cytosine (GC) content, and altered species abundances. Whole genome amplification (WGA), although widely used, is notorious for introducing artifact sequences, either by amplifying laboratory contaminants or by nonrandom amplification of a sample's DNA. In this study, we investigate the effect of REPLI-g multiple displacement amplification (MDA; Qiagen, Valencia, CA, USA) on sequencing data quality and species abundance detection in 8 paired metagenomic samples and 1 titrated, mixed control sample. We extracted and sequenced genomic DNA (gDNA) from 8 environmental samples and compared the quality of the sequencing data for the MDA and their corresponding non-MDA samples. The degree of REPLI-g MDA bias was evaluated by sequence metrics, species composition, and cross-validating observed species abundance and species diversity estimates using the One Codex and MetaPhlAn taxonomic classification tools. Here, we provide evidence of the overall efficacy of REPLI-g MDA on retaining sequencing data quality and species abundance measurements while providing increased yields of high-fidelity DNA. We find that species abundance estimates are largely consistent across samples, even with REPLI-g amplification, as demonstrated by the Spearman's rank order coefficient (R² > 0.8). However, REPLI-g MDA often produced fewer classified reads at the species, genera, and family level, resulting in decreased species diversity. We also observed some areas with the PCR "jackpot effect," with varying input DNA values for the Metagenomics Research Group (MGRG) controls at specific genomic loci. We visualize this effect in whole genome coverage plots and with sequence composition analyses and note these caveats of the MDA method. Despite overall concordance of species abundance between the amplified and unamplified samples, these results demonstrate that amplification of DNA using the REPLI-g method has some limitations. These concerns could be addressed by future improvements in the enzymes or methods for REPLI-g to be considered a >99% robust method for increasing the amount of high-fidelity DNA from low-biomass samples or at the very least, accounted for during computational analysis of MDA samples.

Collapse

Afshinnekoo E, Chou C, Alexander N, Ahsanuddin S, Schuetz AN, Mason CE. Precision Metagenomics: Rapid Metagenomic Analyses for Infectious Disease Diagnostics and Public Health Surveillance. J Biomol Tech 2017;28:40-45. [PMID: 28337072 DOI: 10.7171/jbt.17-2801-007] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

GenePANDA-a novel network-based gene prioritizing tool for complex diseases. Sci Rep 2017;7:43258. [PMID: 28252032 PMCID: PMC5333103 DOI: 10.1038/srep43258] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2016] [Accepted: 01/23/2017] [Indexed: 02/08/2023] Open

A Fast SCCA Algorithm for Big Data Analysis in Brain Imaging Genetics. GRAPHS IN BIOMEDICAL IMAGE ANALYSIS, COMPUTATIONAL ANATOMY AND IMAGING GENETICS 2017. [DOI: 10.1007/978-3-319-67675-3_19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Brumme CJ, Poon AFY. Promises and pitfalls of Illumina sequencing for HIV resistance genotyping. Virus Res 2016;239:97-105. [PMID: 27993623 DOI: 10.1016/j.virusres.2016.12.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 12/15/2016] [Accepted: 12/15/2016] [Indexed: 12/13/2022]

An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes. Nat Commun 2016;7:13637. [PMID: 27882922 PMCID: PMC5123046 DOI: 10.1038/ncomms13637] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Accepted: 10/18/2016] [Indexed: 12/20/2022] Open

Comparing genetic variants detected in the 1000 genomes project with SNPs determined by the International HapMap Consortium. J Genet 2016;94:731-40. [PMID: 26690529 DOI: 10.1007/s12041-015-0588-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Vicini P, Fields O, Lai E, Litwack ED, Martin AM, Morgan TM, Pacanowski MA, Papaluca M, Perez OD, Ringel MS, Robson M, Sakul H, Vockley J, Zaks T, Dolsten M, Søgaard M. Precision medicine in the age of big data: The present and future role of large-scale unbiased sequencing in drug discovery and development. Clin Pharmacol Ther 2015;99:198-207. [PMID: 26536838 DOI: 10.1002/cpt.293] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 10/30/2015] [Indexed: 12/15/2022]

Ni G, Strom TM, Pausch H, Reimer C, Preisinger R, Simianer H, Erbe M. Comparison among three variant callers and assessment of the accuracy of imputation from SNP array data to whole-genome sequence level in chicken. BMC Genomics 2015;16:824. [PMID: 26486989 PMCID: PMC4618161 DOI: 10.1186/s12864-015-2059-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Accepted: 10/09/2015] [Indexed: 12/21/2022] Open

Abstract

BACKGROUND

The technical progress in the last decade has made it possible to sequence millions of DNA reads in a relatively short time frame. Several variant callers based on different algorithms have emerged and have made it possible to extract single nucleotide polymorphisms (SNPs) out of the whole-genome sequence. Often, only a few individuals of a population are sequenced completely and imputation is used to obtain genotypes for all sequence-based SNP loci for other individuals, which have been genotyped for a subset of SNPs using a genotyping array.

METHODS

First, we compared the sets of variants detected with different variant callers, namely GATK, freebayes and SAMtools, and checked the quality of genotypes of the called variants in a set of 50 fully sequenced white and brown layers. Second, we assessed the imputation accuracy (measured as the correlation between imputed and true genotype per SNP and per individual, and genotype conflict between father-progeny pairs) when imputing from high density SNP array data to whole-genome sequence using data from around 1000 individuals from six different generations. Three different imputation programs (Minimac, FImpute and IMPUTE2) were checked in different validation scenarios.

RESULTS

There were 1,741,573 SNPs detected by all three callers on the studied chromosomes 3, 6, and 28, which was 71.6 % (81.6 %, 88.0 %) of SNPs detected by GATK (SAMtools, freebayes) in total. Genotype concordance (GC) defined as the proportion of individuals whose array-derived genotypes are the same as the sequence-derived genotypes over all non-missing SNPs on the array were 0.98 (GATK), 0.97 (freebayes) and 0.98 (SAMtools). Furthermore, the percentage of variants that had high values (>0.9) for another three measures (non-reference sensitivity, non-reference genotype concordance and precision) were 90 (88, 75) for GATK (SAMtools, freebayes). With all imputation programs, correlation between original and imputed genotypes was >0.95 on average with randomly masked 1000 SNPs from the SNP array and >0.85 for a leave-one-out cross-validation within sequenced individuals.

CONCLUSIONS

Performance of all variant callers studied was very good in general, particularly for GATK and SAMtools. FImpute performed slightly worse than Minimac and IMPUTE2 in terms of genotype correlation, especially for SNPs with low minor allele frequency, while it had lowest numbers in Mendelian conflicts in available father-progeny pairs. Correlations of real and imputed genotypes remained constantly high even if individuals to be imputed were several generations away from the sequenced individuals.

Collapse

Clark PM, Kunkel M, Monos DS. The dichotomy between disease phenotype databases and the implications for understanding complex diseases involving the major histocompatibility complex. Int J Immunogenet 2015;42:413-22. [PMID: 26456690 DOI: 10.1111/iji.12236] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Revised: 07/14/2015] [Accepted: 08/16/2015] [Indexed: 01/08/2023]

Reinert K, Langmead B, Weese D, Evers DJ. Alignment of Next-Generation Sequencing Reads. Annu Rev Genomics Hum Genet 2015;16:133-51. [DOI: 10.1146/annurev-genom-090413-025358] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Tetreault M, Bareke E, Nadaf J, Alirezaie N, Majewski J. Whole-exome sequencing as a diagnostic tool: current challenges and future opportunities. Expert Rev Mol Diagn 2015;15:749-60. [PMID: 25959410 DOI: 10.1586/14737159.2015.1039516] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry. BMC Genomics 2015;16:92. [PMID: 25765185 PMCID: PMC4336699 DOI: 10.1186/s12864-015-1233-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2014] [Accepted: 01/12/2015] [Indexed: 12/30/2022] Open

Abstract

Background

The 1000 Genome project paved the way for sequencing diverse human populations. New genome projects are being established to sequence underrepresented populations helping in understanding human genetic diversity. The Kuwait Genome Project an initiative to sequence individual genomes from the three subgroups of Kuwaiti population namely, Saudi Arabian tribe; “tent-dwelling” Bedouin; and Persian, attributing their ancestry to different regions in Arabian Peninsula and to modern-day Iran (West Asia). These subgroups were in line with settlement history and are confirmed by genetic studies. In this work, we report whole genome sequence of a Kuwaiti native from Persian subgroup at >37X coverage.

Results

We document 3,573,824 SNPs, 404,090 insertions/deletions, and 11,138 structural variations. Out of the reported SNPs and indels, 85,939 are novel. We identify 295 ‘loss-of-function’ and 2,314 ’deleterious’ coding variants, some of which carry homozygous genotypes in the sequenced genome; the associated phenotypes include pharmacogenomic traits such as greater triglyceride lowering ability with fenofibrate treatment, and requirement of high warfarin dosage to elicit anticoagulation response. 6,328 non-coding SNPs associate with 811 phenotype traits: in congruence with medical history of the participant for Type 2 diabetes and β-Thalassemia, and of participant’s family for migraine, 72 (of 159 known) Type 2 diabetes, 3 (of 4) β-Thalassemia, and 76 (of 169) migraine variants are seen in the genome. Intergenome comparisons based on shared disease-causing variants, positions the sequenced genome between Asian and European genomes in congruence with geographical location of the region. On comparison, bead arrays perform better than sequencing platforms in correctly calling genotypes in low-coverage sequenced genome regions however in the event of novel SNP or indel near genotype calling position can lead to false calls using bead arrays.

Conclusions

We report, for the first time, reference genome resource for the population of Persian ancestry. The resource provides a starting point for designing large-scale genetic studies in Peninsula including Kuwait, and Persian population. Such efforts on populations under-represented in global genome variation surveys help augment current knowledge on human genome diversity.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1233-x) contains supplementary material, which is available to authorized users.

Collapse

Wijaya E, Shimizu K, Asai K, Hamada M. Reference-free prediction of rearrangement breakpoint reads. ACTA ACUST UNITED AC 2014;30:2559-67. [PMID: 24876376 DOI: 10.1093/bioinformatics/btu360] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Affiliation(s)

Edward Wijaya Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565-0871, Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 and Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
Kana Shimizu Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565-0871, Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 and Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
Kiyoshi Asai Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565-0871, Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 and Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565-0871, Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 and Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
Michiaki Hamada Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565-0871, Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 and Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565-0871, Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 and Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan

Collapse

Jacob HJ, Abrams K, Bick DP, Brodie K, Dimmock DP, Farrell M, Geurts J, Harris J, Helbling D, Joers BJ, Kliegman R, Kowalski G, Lazar J, Margolis DA, North P, Northup J, Roquemore-Goins A, Scharer G, Shimoyama M, Strong K, Taylor B, Tsaih SW, Tschannen MR, Veith RL, Wendt-Andrae J, Wilk B, Worthey EA. Genomics in clinical practice: lessons from the front lines. Sci Transl Med 2014;5:194cm5. [PMID: 23863829 DOI: 10.1126/scitranslmed.3006468] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Watson CT, Marques-Bonet T, Sharp AJ, Mefford HC. The genetics of microdeletion and microduplication syndromes: an update. Annu Rev Genomics Hum Genet 2014;15:215-244. [PMID: 24773319 DOI: 10.1146/annurev-genom-091212-153408] [Citation(s) in RCA: 115] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Bodian DL, McCutcheon JN, Kothiyal P, Huddleston KC, Iyer RK, Vockley JG, Niederhuber JE. Germline variation in cancer-susceptibility genes in a healthy, ancestrally diverse cohort: implications for individual genome sequencing. PLoS One 2014;9:e94554. [PMID: 24728327 PMCID: PMC3984285 DOI: 10.1371/journal.pone.0094554] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2013] [Accepted: 02/17/2014] [Indexed: 01/05/2023] Open

Ling Y, Jin Z, Su M, Zhong J, Zhao Y, Yu J, Wu J, Xiao J. VCGDB: a dynamic genome database of the Chinese population. BMC Genomics 2014;15:265. [PMID: 24708222 PMCID: PMC4028056 DOI: 10.1186/1471-2164-15-265] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2013] [Accepted: 03/28/2014] [Indexed: 12/18/2022] Open

Patel ZH, Kottyan LC, Lazaro S, Williams MS, Ledbetter DH, Tromp H, Rupert A, Kohram M, Wagner M, Husami A, Qian Y, Valencia CA, Zhang K, Hostetter MK, Harley JB, Kaufman KM. The struggle to find reliable results in exome sequencing data: filtering out Mendelian errors. Front Genet 2014;5:16. [PMID: 24575121 PMCID: PMC3921572 DOI: 10.3389/fgene.2014.00016] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2013] [Accepted: 01/16/2014] [Indexed: 12/30/2022] Open

Abstract

Next Generation Sequencing studies generate a large quantity of genetic data in a relatively cost and time efficient manner and provide an unprecedented opportunity to identify candidate causative variants that lead to disease phenotypes. A challenge to these studies is the generation of sequencing artifacts by current technologies. To identify and characterize the properties that distinguish false positive variants from true variants, we sequenced a child and both parents (one trio) using DNA isolated from three sources (blood, buccal cells, and saliva). The trio strategy allowed us to identify variants in the proband that could not have been inherited from the parents (Mendelian errors) and would most likely indicate sequencing artifacts. Quality control measurements were examined and three measurements were found to identify the greatest number of Mendelian errors. These included read depth, genotype quality score, and alternate allele ratio. Filtering the variants on these measurements removed ~95% of the Mendelian errors while retaining 80% of the called variants. These filters were applied independently. After filtering, the concordance between identical samples isolated from different sources was 99.99% as compared to 87% before filtering. This high concordance suggests that different sources of DNA can be used in trio studies without affecting the ability to identify causative polymorphisms. To facilitate analysis of next generation sequencing data, we developed the Cincinnati Analytical Suite for Sequencing Informatics (CASSI) to store sequencing files, metadata (eg. relatedness information), file versioning, data filtering, variant annotation, and identify candidate causative polymorphisms that follow either de novo, rare recessive homozygous or compound heterozygous inheritance models. We conclude the data cleaning process improves the signal to noise ratio in terms of variants and facilitates the identification of candidate disease causative polymorphisms.

Collapse

Affiliation(s)

Zubin H Patel Division of Rheumatology, Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA ; Medical Scientist Training Program, University of Cincinnati College of Medicine, Cincinnati OH, USA
Leah C Kottyan Division of Rheumatology, Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA ; Department of Veterans Affairs, Veterans Affairs Medical Center - Cincinnati, Cincinnati OH, USA
Sara Lazaro Division of Rheumatology, Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA ; Department of Veterans Affairs, Veterans Affairs Medical Center - Cincinnati, Cincinnati OH, USA
Marc S Williams Genomic Medicine Institute, Geisinger Health System, Danville PA, USA
David H Ledbetter Genomic Medicine Institute, Geisinger Health System, Danville PA, USA
Hbgerard Tromp Genomic Medicine Institute, Geisinger Health System, Danville PA, USA
Andrew Rupert Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
Mojtaba Kohram Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
Michael Wagner Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
Ammar Husami Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
Yaping Qian Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
C Alexander Valencia Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
Kejian Zhang Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
Margaret K Hostetter Division of Infectious Disease, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA
John B Harley Division of Rheumatology, Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA ; Department of Veterans Affairs, Veterans Affairs Medical Center - Cincinnati, Cincinnati OH, USA
Kenneth M Kaufman Division of Rheumatology, Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati OH, USA ; Department of Veterans Affairs, Veterans Affairs Medical Center - Cincinnati, Cincinnati OH, USA

Collapse

Moore CB, Wallace JR, Wolfe DJ, Frase AT, Pendergrass SA, Weiss KM, Ritchie MD. Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data. PLoS Genet 2013;9:e1003959. [PMID: 24385916 PMCID: PMC3873241 DOI: 10.1371/journal.pgen.1003959] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 10/01/2013] [Indexed: 12/13/2022] Open

Abstract

Analyses investigating low frequency variants have the potential for explaining additional genetic heritability of many complex human traits. However, the natural frequencies of rare variation between human populations strongly confound genetic analyses. We have applied a novel collapsing method to identify biological features with low frequency variant burden differences in thirteen populations sequenced by the 1000 Genomes Project. Our flexible collapsing tool utilizes expert biological knowledge from multiple publicly available database sources to direct feature selection. Variants were collapsed according to genetically driven features, such as evolutionary conserved regions, regulatory regions genes, and pathways. We have conducted an extensive comparison of low frequency variant burden differences (MAF<0.03) between populations from 1000 Genomes Project Phase I data. We found that on average 26.87% of gene bins, 35.47% of intergenic bins, 42.85% of pathway bins, 14.86% of ORegAnno regulatory bins, and 5.97% of evolutionary conserved regions show statistically significant differences in low frequency variant burden across populations from the 1000 Genomes Project. The proportion of bins with significant differences in low frequency burden depends on the ancestral similarity of the two populations compared and types of features tested. Even closely related populations had notable differences in low frequency burden, but fewer differences than populations from different continents. Furthermore, conserved or functionally relevant regions had fewer significant differences in low frequency burden than regions under less evolutionary constraint. This degree of low frequency variant differentiation across diverse populations and feature elements highlights the critical importance of considering population stratification in the new era of DNA sequencing and low frequency variant genomic analyses.

Low frequency variants are likely to play an important role in uncovering complex trait heritability; however, they are often continent or population specific. This specificity complicates genetic analyses investigating low frequency variants for two reasons: low frequency variant signals in an association test are often difficult to generalize beyond a single population or continental group, and there is an increase in false positive results in association analyses due to underlying population stratification. In order to reveal the magnitude of low frequency population stratification, we performed pairwise population comparisons using the 1000 Genomes Project Phase I data to investigate differences in low frequency variant burden across multiple biological features. We found that low frequency variant confounding is much more prevalent than one might expect, even within continental groups. The proportion of significant differences in low frequency variant burden was also dependent on the region of interest; for example, annotated regulatory regions showed fewer low frequency burden differences between populations than intergenic regions. Knowledge of population structure and the genomic landscape in a region of interest are important factors in determining the extent of confounding due to population stratification in a low frequency genomic analysis.

Collapse

Affiliation(s)

Carrie B. Moore Center for Human Genetic Research, Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
John R. Wallace Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Daniel J. Wolfe Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Alex T. Frase Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Sarah A. Pendergrass Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Kenneth M. Weiss Department of Anthropology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Marylyn D. Ritchie Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America * E-mail:

Collapse

Worthey EA. Analysis and annotation of whole-genome or whole-exome sequencing-derived variants for clinical diagnosis. CURRENT PROTOCOLS IN HUMAN GENETICS 2013;79:9.24.1-9.24.24. [PMID: 24510652 DOI: 10.1002/0471142905.hg0924s79] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Bromberg Y. Building a genome analysis pipeline to predict disease risk and prevent disease. J Mol Biol 2013;425:3993-4005. [PMID: 23928561 DOI: 10.1016/j.jmb.2013.07.038] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Revised: 07/26/2013] [Accepted: 07/28/2013] [Indexed: 12/24/2022]