Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lin YL, Chang PC, Hsu C, Hung MZ, Chien YH, Hwu WL, Lai F, Lee NC. Comparison of GATK and DeepVariant by trio sequencing. Sci Rep 2022;12:1809. [PMID: 35110657 DOI: 10.1038/s41598-022-05833-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 01/12/2022] [Indexed: 12/03/2022] Open

For:	Lin YL, Chang PC, Hsu C, Hung MZ, Chien YH, Hwu WL, Lai F, Lee NC. Comparison of GATK and DeepVariant by trio sequencing. Sci Rep 2022;12:1809. [PMID: 35110657 DOI: 10.1038/s41598-022-05833-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 01/12/2022] [Indexed: 12/03/2022] Open

Number

Cited by Other Article(s)

Kalleberg J, Rissman J, Schnabel RD. Overcoming Limitations to Deep Learning in Domesticated Animals with TrioTrain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.15.589602. [PMID: 38659907 PMCID: PMC11042298 DOI: 10.1101/2024.04.15.589602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]

Kosugi S, Terao C. Comparative evaluation of SNVs, indels, and structural variations detected with short- and long-read sequencing data. Hum Genome Var 2024;11:18. [PMID: 38632226 PMCID: PMC11024196 DOI: 10.1038/s41439-024-00276-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/12/2024] [Accepted: 03/20/2024] [Indexed: 04/19/2024] Open

Connor R, Shakya M, Yarmosh DA, Maier W, Martin R, Bradford R, Brister JR, Chain PSG, Copeland CA, di Iulio J, Hu B, Ebert P, Gunti J, Jin Y, Katz KS, Kochergin A, LaRosa T, Li J, Li PE, Lo CC, Rashid S, Maiorova ES, Xiao C, Zalunin V, Purcell L, Pruitt KD. Recommendations for Uniform Variant Calling of SARS-CoV-2 Genome Sequence across Bioinformatic Workflows. Viruses 2024;16:430. [PMID: 38543795 PMCID: PMC10975397 DOI: 10.3390/v16030430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 02/12/2024] [Accepted: 02/16/2024] [Indexed: 04/01/2024] Open

Affiliation(s)

Ryan Connor National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Migun Shakya Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
David A. Yarmosh American Type Culture Collection, Manassas, VA 20110, USA; (D.A.Y.); (R.B.); (S.R.) BEI Resources, Manassas, VA 20110, USA
Wolfgang Maier Galaxy Europe Team, University of Freiburg, 79085 Freiburg, Germany;
Ross Martin Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA; (R.M.); (J.L.); (E.S.M.)
Rebecca Bradford American Type Culture Collection, Manassas, VA 20110, USA; (D.A.Y.); (R.B.); (S.R.) BEI Resources, Manassas, VA 20110, USA
J. Rodney Brister National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Patrick S. G. Chain Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
Courtney A. Copeland Deloitte Consulting LLP, Rosslyn, VA 22209, USA; (C.A.C.); (T.L.)
Julia di Iulio Vir Biotechnology Inc., San Francisco, CA 94158, USA; (J.d.I.); (L.P.)
Bin Hu Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
Philip Ebert Eli Lilly and Company, Indianapolis, IN 46225, USA;
Jonathan Gunti National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Yumi Jin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Kenneth S. Katz National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Andrey Kochergin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Tré LaRosa Deloitte Consulting LLP, Rosslyn, VA 22209, USA; (C.A.C.); (T.L.)
Jiani Li Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA; (R.M.); (J.L.); (E.S.M.)
Po-E Li Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
Chien-Chi Lo Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; (M.S.); (P.S.G.C.); (B.H.); (P.-E.L.); (C.-C.L.)
Sujatha Rashid American Type Culture Collection, Manassas, VA 20110, USA; (D.A.Y.); (R.B.); (S.R.)
Evguenia S. Maiorova Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA; (R.M.); (J.L.); (E.S.M.)
Chunlin Xiao National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Vadim Zalunin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)
Lisa Purcell Vir Biotechnology Inc., San Francisco, CA 94158, USA; (J.d.I.); (L.P.)
Kim D. Pruitt National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (R.C.); (J.R.B.); (J.G.); (Y.J.); (K.S.K.); (A.K.); (C.X.); (V.Z.)

Collapse

Chafai N, Bonizzi L, Botti S, Badaoui B. Emerging applications of machine learning in genomic medicine and healthcare. Crit Rev Clin Lab Sci 2024;61:140-163. [PMID: 37815417 DOI: 10.1080/10408363.2023.2259466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 09/12/2023] [Indexed: 10/11/2023]

Wang L, Yan X, Wu H, Wang F, Zhong Z, Zheng G, Xiao Q, Wu K, Na W. Selection Signal Analysis Reveals Hainan Yellow Cattle Are Being Selectively Bred for Heat Tolerance. Animals (Basel) 2024;14:775. [PMID: 38473160 DOI: 10.3390/ani14050775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Revised: 02/24/2024] [Accepted: 02/27/2024] [Indexed: 03/14/2024] Open

Zhou Y, Kathiresan N, Yu Z, Rivera LF, Yang Y, Thimma M, Manickam K, Chebotarov D, Mauleon R, Chougule K, Wei S, Gao T, Green CD, Zuccolo A, Xie W, Ware D, Zhang J, McNally KL, Wing RA. A high-performance computational workflow to accelerate GATK SNP detection across a 25-genome dataset. BMC Biol 2024;22:13. [PMID: 38273258 PMCID: PMC10809545 DOI: 10.1186/s12915-024-01820-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/09/2024] [Indexed: 01/27/2024] Open

Abstract

BACKGROUND

Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable.

RESULTS

Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq).

CONCLUSIONS

This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.

Collapse

Affiliation(s)

Yong Zhou Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
Nagarajan Kathiresan KAUST Supercomputing Laboratory (KSL), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Zhichao Yu Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
Luis F Rivera Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Yujian Yang National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
Manjula Thimma Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Keerthana Manickam Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Dmytro Chebotarov International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
Ramil Mauleon International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
Kapeel Chougule Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
Sharon Wei Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
Tingting Gao National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
Carl D Green Information Technology Department, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Andrea Zuccolo Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia Crop Science Research Center (CSRC), Scuola Superiore Sant'Anna, Pisa, 56127, Italy
Weibo Xie National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
Doreen Ware Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA USDA ARS NEA Plant, Soil & Nutrition Laboratory Research Unit, Ithaca, NY, 14853, USA
Jianwei Zhang National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
Kenneth L McNally International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
Rod A Wing Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia. Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA. International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines.

Collapse

Kawai Y, Watanabe Y, Omae Y, Miyahara R, Khor SS, Noiri E, Kitajima K, Shimanuki H, Gatanaga H, Hata K, Hattori K, Iida A, Ishibashi-Ueda H, Kaname T, Kanto T, Matsumura R, Miyo K, Noguchi M, Ozaki K, Sugiyama M, Takahashi A, Tokuda H, Tomita T, Umezawa A, Watanabe H, Yoshida S, Goto YI, Maruoka Y, Matsubara Y, Niida S, Mizokami M, Tokunaga K. Exploring the genetic diversity of the Japanese population: Insights from a large-scale whole genome sequencing analysis. PLoS Genet 2023;19:e1010625. [PMID: 38060463 PMCID: PMC10703243 DOI: 10.1371/journal.pgen.1010625] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Accepted: 10/24/2023] [Indexed: 12/18/2023] Open

Affiliation(s)

Yosuke Kawai Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Yusuke Watanabe Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Yosuke Omae Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan Central Biobank, National Center Biobank Network, Shinjuku-ku, Tokyo, Japan
Reiko Miyahara Central Biobank, National Center Biobank Network, Shinjuku-ku, Tokyo, Japan
Seik-Soon Khor Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Eisei Noiri Central Biobank, National Center Biobank Network, Shinjuku-ku, Tokyo, Japan
Koji Kitajima Central Biobank, National Center Biobank Network, Shinjuku-ku, Tokyo, Japan Department of Data Science Center for Clinical Sciences, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Hideyuki Shimanuki Central Biobank, National Center Biobank Network, Shinjuku-ku, Tokyo, Japan Department of Data Science Center for Clinical Sciences, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Hiroyuki Gatanaga AIDS Clinical Center, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Kenichiro Hata Department of Maternal-Fetal Biology, National Center for Child Health and Development, Setagaya-ku, Tokyo, Japan
Kotaro Hattori Department of Bioresources, Medical Genome Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
Aritoshi Iida Department of Clinical Genome Analysis, Medical Genome Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
Hatsue Ishibashi-Ueda NCVC Biobank, National Cerebral and Cardiovascular Center, Suita, Osaka, Japan
Tadashi Kaname Department of Genome Medicine, National Center for Child Health and Development, Setagaya-ku, Tokyo, Japan
Tatsuya Kanto Department of Liver Disease, Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Ichikawa, Chiba, Japan
Ryo Matsumura Department of Bioresources, Medical Genome Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
Kengo Miyo Center for Medical Informatics Intelligence, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Michio Noguchi NCVC Biobank, National Cerebral and Cardiovascular Center, Suita, Osaka, Japan
Kouichi Ozaki Medical Genome Center, Research Institute, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
Masaya Sugiyama Department of Viral Pathogenesis and Controls, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Ayako Takahashi NCVC Biobank, National Cerebral and Cardiovascular Center, Suita, Osaka, Japan
Haruhiko Tokuda Core Facility Administration, Research Institute, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan Department of Metabolic Research, Research Institute, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan Department of Clinical Laboratory, Hospital, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan
Tsutomu Tomita NCVC Biobank, National Cerebral and Cardiovascular Center, Suita, Osaka, Japan
Akihiro Umezawa Center for Regenerative Medicine, Research Institute, National Center for Child Health and Development, Setagaya-ku, Tokyo, Japan
Hiroshi Watanabe Core Facility Administration, Research Institute, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan Innovation Center for Translational Research, Hospital, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan
Sumiko Yoshida Department of Bioresources, Medical Genome Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
Yu-ichi Goto Medical Genome Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
Yutaka Maruoka Department of Oral and Maxillofacial Surgery, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
Yoichi Matsubara National Center for Child Health and Development, Setagaya-ku, Tokyo, Japan
Shumpei Niida Core Facility Administration, Research Institute, National Center for Geriatrics and Gerontology, Obu, Aichi, Japan
Masashi Mizokami Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Ichikawa, Chiba, Japan
Katsushi Tokunaga Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan Central Biobank, National Center Biobank Network, Shinjuku-ku, Tokyo, Japan

Collapse

Rioux B, Chong M, Walker R, McGlasson S, Rannikmäe K, McCartney D, McCabe J, Brown R, Crow YJ, Hunt D, Whiteley W. Phenotypes associated with genetic determinants of type I interferon regulation in the UK Biobank: a protocol. Wellcome Open Res 2023;8:550. [PMID: 38855722 PMCID: PMC11162527 DOI: 10.12688/wellcomeopenres.20385.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/14/2023] [Indexed: 06/11/2024] Open

Abstract

Background

Type I interferons are cytokines involved in innate immunity against viruses. Genetic disorders of type I interferon regulation are associated with a range of autoimmune and cerebrovascular phenotypes. Carriers of pathogenic variants involved in genetic disorders of type I interferons are generally considered asymptomatic. Preliminary data suggests, however, that genetically determined dysregulation of type I interferon responses is associated with autoimmunity, and may also be relevant to sporadic cerebrovascular disease and dementia. We aim to determine whether functional variants in genes involved in type I interferon regulation and signalling are associated with the risk of autoimmunity, stroke, and dementia in a population cohort.

Methods

We will perform a hypothesis-driven candidate pathway association study of type I interferon-related genes using rare variants in the UK Biobank (UKB). We will manually curate type I interferon regulation and signalling genes from a literature review and Gene Ontology, followed by clinical and functional filtering. Variants of interest will be included based on pre-defined clinical relevance and functional annotations (using LOFTEE, M-CAP and a minor allele frequency <0.1%). The association of variants with 15 clinical and three neuroradiological phenotypes will be assessed with a rare variant genetic risk score and gene-level tests, using a Bonferroni-corrected p-value threshold from the number of genetic units and phenotypes tested. We will explore the association of significant genetic units with 196 additional health-related outcomes to help interpret their relevance and explore the clinical spectrum of genetic perturbations of type I interferon.

Ethics and dissemination

The UKB has received ethical approval from the North West Multicentre Research Ethics Committee, and all participants provided written informed consent at recruitment. This research will be conducted using the UKB Resource under application number 93160. We expect to disseminate our results in a peer-reviewed journal and at an international cardiovascular conference.

Collapse

Zhang YJ, Luo Z, Sun Y, Liu J, Chen Z. From beasts to bytes: Revolutionizing zoological research with artificial intelligence. Zool Res 2023;44:1115-1131. [PMID: 37933101 PMCID: PMC10802096 DOI: 10.24272/j.issn.2095-8137.2023.263] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 10/30/2023] [Indexed: 11/08/2023] Open

Steyaert W, Haer-Wigman L, Pfundt R, Hellebrekers D, Steehouwer M, Hampstead J, de Boer E, Stegmann A, Yntema H, Kamsteeg EJ, Brunner H, Hoischen A, Gilissen C. Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation. Nat Commun 2023;14:6845. [PMID: 37891200 PMCID: PMC10611741 DOI: 10.1038/s41467-023-42531-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 10/13/2023] [Indexed: 10/29/2023] Open

Affiliation(s)

Wouter Steyaert Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands Radboud Institute for Molecular Life Sciences, Nijmegen, Netherlands
Lonneke Haer-Wigman Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
Rolph Pfundt Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
Debby Hellebrekers Maastricht University Medical Center + , Department of Clinical Genetics, Maastricht, Netherlands
Marloes Steehouwer Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
Juliet Hampstead Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
Elke de Boer Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, Netherlands
Alexander Stegmann Maastricht University Medical Center + , Department of Clinical Genetics, Maastricht, Netherlands
Helger Yntema Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
Erik-Jan Kamsteeg Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
Han Brunner Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands Maastricht University Medical Center + , Department of Clinical Genetics, Maastricht, Netherlands
Alexander Hoischen Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands Radboud Institute for Molecular Life Sciences, Nijmegen, Netherlands Radboud University Medical Center, Department of Internal Medicine and Radboud Center for Infectious Diseases (RCI), Nijmegen, Netherlands
Christian Gilissen Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands. Radboud Institute for Molecular Life Sciences, Nijmegen, Netherlands.

Collapse

Guhlin J, Le Lec MF, Wold J, Koot E, Winter D, Biggs PJ, Galla SJ, Urban L, Foster Y, Cox MP, Digby A, Uddstrom LR, Eason D, Vercoe D, Davis T, Howard JT, Jarvis ED, Robertson FE, Robertson BC, Gemmell NJ, Steeves TE, Santure AW, Dearden PK. Species-wide genomics of kākāpō provides tools to accelerate recovery. Nat Ecol Evol 2023;7:1693-1705. [PMID: 37640765 DOI: 10.1038/s41559-023-02165-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 07/11/2023] [Indexed: 08/31/2023]

Affiliation(s)

Joseph Guhlin Genomics Aotearoa, Biochemistry Department, School of Biomedical Sciences, University of Otago, Dunedin, Aotearoa New Zealand
Marissa F Le Lec Genomics Aotearoa, Biochemistry Department, School of Biomedical Sciences, University of Otago, Dunedin, Aotearoa New Zealand
Jana Wold School of Biological Sciences, University of Canterbury, Christchurch, Aotearoa New Zealand
Emily Koot The New Zealand Institute for Plant and Food Research Ltd, Palmerston North, Aotearoa New Zealand
David Winter School of Natural Sciences, Massey University, Palmerston North, Aotearoa New Zealand
Patrick J Biggs School of Natural Sciences, Massey University, Palmerston North, Aotearoa New Zealand School of Veterinary Science, Massey University, Palmerston North, Aotearoa New Zealand
Stephanie J Galla School of Biological Sciences, University of Canterbury, Christchurch, Aotearoa New Zealand Department of Biological Sciences, Boise State University, Boise, ID, USA
Lara Urban Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, Aotearoa New Zealand Helmholtz Pioneer Campus, Helmholtz Zentrum Muenchen, Neuherberg, Germany Helmholtz AI, Helmholtz Zentrum Muenchen, Neuherberg, Germany School of Life Sciences, Technical University of Munich, Freising, Germany
Yasmin Foster Department of Zoology, University of Otago, Dunedin, Aotearoa New Zealand
Murray P Cox School of Natural Sciences, Massey University, Palmerston North, Aotearoa New Zealand Department of Statistics, University of Auckland, Auckland, Aotearoa New Zealand
Andrew Digby Kākāpō Recovery Programme, Department of Conservation, Invercargill, Aotearoa New Zealand
Lydia R Uddstrom Kākāpō Recovery Programme, Department of Conservation, Invercargill, Aotearoa New Zealand
Daryl Eason Kākāpō Recovery Programme, Department of Conservation, Invercargill, Aotearoa New Zealand
Deidre Vercoe Kākāpō Recovery Programme, Department of Conservation, Invercargill, Aotearoa New Zealand
Tāne Davis Rakiura Tītī Islands Administering Body, Invercargill, Aotearoa New Zealand
Jason T Howard Neurogenetics of Language Lab, The Rockefeller University, New York, NY, USA Mirxes, Cambridge, MA, USA
Erich D Jarvis The Rockefeller University, New York, NY, USA Howard Hughes Medical Institute, Chevy Chase, MD, USA
Fiona E Robertson Department of Zoology, University of Otago, Dunedin, Aotearoa New Zealand
Bruce C Robertson Department of Zoology, University of Otago, Dunedin, Aotearoa New Zealand
Neil J Gemmell Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, Aotearoa New Zealand
Tammy E Steeves School of Biological Sciences, University of Canterbury, Christchurch, Aotearoa New Zealand
Anna W Santure School of Biological Sciences, University of Auckland, Auckland, Aotearoa New Zealand
Peter K Dearden Genomics Aotearoa, Biochemistry Department, School of Biomedical Sciences, University of Otago, Dunedin, Aotearoa New Zealand.

Collapse

Xu X, Chen B, Zhang J, Lan S, Wu S. Whole-genome resequencing analysis of the medicinal plant Gardenia jasminoides. PeerJ 2023;11:e16056. [PMID: 37744244 PMCID: PMC10512932 DOI: 10.7717/peerj.16056] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 08/17/2023] [Indexed: 09/26/2023] Open

Nakamichi K, Van Gelder RN, Chao JR, Mustafi D. Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients. Sci Rep 2023;13:8535. [PMID: 37237007 PMCID: PMC10219926 DOI: 10.1038/s41598-023-35791-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 05/24/2023] [Indexed: 05/28/2023] Open

Lloret-Villas A, Pausch H, Leonard AS. The size and composition of haplotype reference panels impact the accuracy of imputation from low-pass sequencing in cattle. Genet Sel Evol 2023;55:33. [PMID: 37170101 PMCID: PMC10173671 DOI: 10.1186/s12711-023-00809-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 05/02/2023] [Indexed: 05/13/2023] Open

Abstract

BACKGROUND

Low-pass sequencing followed by sequence variant genotype imputation is an alternative to the routine microarray-based genotyping in cattle. However, the impact of haplotype reference panels and their interplay with the coverage of low-pass whole-genome sequencing data have not been sufficiently explored in typical livestock settings where only a small number of reference samples is available.

METHODS

Sequence variant genotyping accuracy was compared between two variant callers, GATK and DeepVariant, in 50 Brown Swiss cattle with sequencing coverages ranging from 4- to 63-fold. Haplotype reference panels of varying sizes and composition were built with DeepVariant based on 501 individuals from nine breeds. High-coverage sequence data for 24 Brown Swiss cattle were downsampled to between 0.01- and 4-fold to mimic low-pass sequencing. GLIMPSE was used to infer sequence variant genotypes from the low-pass sequencing data using different haplotype reference panels. The accuracy of the sequence variant genotypes that were inferred from low-pass sequencing data was compared with sequence variant genotypes called from high-coverage data.

RESULTS

DeepVariant was used to establish bovine haplotype reference panels because it outperformed GATK in all evaluations. Within-breed haplotype reference panels were more accurate and efficient to impute sequence variant genotypes from low-pass sequencing than equally-sized multibreed haplotype reference panels for all target sample coverages and allele frequencies. F1 scores greater than 0.9, which indicate high harmonic means of recall and precision of called genotypes, were achieved with 0.25-fold sequencing coverage when large breed-specific haplotype reference panels (n = 150) were used. In absence of such large within-breed haplotype panels, variant genotyping accuracy from low-pass sequencing could be increased either by adding non-related samples to the haplotype reference panel or by increasing the coverage of the low-pass sequencing data. Sequence variant genotyping from low-pass sequencing was substantially less accurate when the reference panel lacked individuals from the target breed.

CONCLUSIONS

Variant genotyping is more accurate with DeepVariant than GATK. DeepVariant is therefore suitable to establish bovine haplotype reference panels. Medium-sized breed-specific haplotype reference panels and large multibreed haplotype reference panels enable accurate imputation of low-pass sequencing data in a typical cattle breed.

Collapse

Harvey WT, Ebert P, Ebler J, Audano PA, Munson KM, Hoekzema K, Porubsky D, Beck CR, Marschall T, Garimella K, Eichler EE. Whole-genome long-read sequencing downsampling and its effect on variant calling precision and recall. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.04.539448. [PMID: 37205567 PMCID: PMC10187267 DOI: 10.1101/2023.05.04.539448] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]

Ruperao P, Gandham P, Odeny DA, Mayes S, Selvanayagam S, Thirunavukkarasu N, Das RR, Srikanda M, Gandhi H, Habyarimana E, Manyasa E, Nebie B, Deshpande SP, Rathore A. Exploring the sorghum race level diversity utilizing 272 sorghum accessions genomic resources. FRONTIERS IN PLANT SCIENCE 2023;14:1143512. [PMID: 37008459 PMCID: PMC10063887 DOI: 10.3389/fpls.2023.1143512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 02/22/2023] [Indexed: 06/19/2023]

Population Structure and Genetic Diversity Analysis of “Yufen 1” H Line Chickens Using Whole-Genome Resequencing. Life (Basel) 2023;13:life13030793. [PMID: 36983948 PMCID: PMC10059704 DOI: 10.3390/life13030793] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 03/13/2023] [Accepted: 03/13/2023] [Indexed: 03/17/2023] Open

Betschart RO, Thiéry A, Aguilera-Garcia D, Zoche M, Moch H, Twerenbold R, Zeller T, Blankenberg S, Ziegler A. Comparison of calling pipelines for whole genome sequencing: an empirical study demonstrating the importance of mapping and alignment. Sci Rep 2022;12:21502. [PMID: 36513709 PMCID: PMC9748128 DOI: 10.1038/s41598-022-26181-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 12/12/2022] [Indexed: 12/14/2022] Open

Affiliation(s)

Raphael O Betschart Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265, Davos Wolfgang, Switzerland
Alexandre Thiéry Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265, Davos Wolfgang, Switzerland
Domingo Aguilera-Garcia Institute of Pathology and Molecular Pathology, University Hospital Zurich, Schmelzbergstrasse 12, 8091, Zurich, Switzerland
Martin Zoche Institute of Pathology and Molecular Pathology, University Hospital Zurich, Schmelzbergstrasse 12, 8091, Zurich, Switzerland
Holger Moch Institute of Pathology and Molecular Pathology, University Hospital Zurich, Schmelzbergstrasse 12, 8091, Zurich, Switzerland
Raphael Twerenbold Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany University Center of Cardiovascular Research Hamburg, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Tanja Zeller Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany University Center of Cardiovascular Research Hamburg, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Stefan Blankenberg Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265, Davos Wolfgang, Switzerland Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany University Center of Cardiovascular Research Hamburg, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Andreas Ziegler Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265, Davos Wolfgang, Switzerland. Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251, Hamburg, Germany. School Mathematics, Statistics and Computer Science, Scottsville, Private Bag X01, Pietermaritzburg, 3209, South Africa.

Collapse

Li J, Wang T, Liu W, Yin D, Lai Z, Zhang G, Zhang K, Ji J, Yin S. A high-quality chromosome-level genome assembly of Pelteobagrus vachelli provides insights into its environmental adaptation and population history. Front Genet 2022;13:1050192. [DOI: 10.3389/fgene.2022.1050192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/01/2022] [Indexed: 11/16/2022] Open

Connor R, Yarmosh DA, Maier W, Shakya M, Martin R, Bradford R, Brister JR, Chain PS, Copeland CA, di Iulio J, Hu B, Ebert P, Gunti J, Jin Y, Katz KS, Kochergin A, LaRosa T, Li J, Li PE, Lo CC, Rashid S, Maiorova ES, Xiao C, Zalunin V, Pruitt KD. Towards increased accuracy and reproducibility in SARS-CoV-2 next generation sequence analysis for public health surveillance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.11.03.515010. [PMID: 36380755 PMCID: PMC9645426 DOI: 10.1101/2022.11.03.515010] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Affiliation(s)

Ryan Connor National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
David A Yarmosh American Type Culture Collection, 10807 University Blvd, Manassas, VA 20110, USA BEI Resources
Wolfgang Maier Galaxy Europe Team, University of Freiburg, Freiburg, Germany
Migun Shakya Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
Ross Martin Clinical Virology Department, Gilead Sciences, 333 Lakeside Dr, Foster City, CA 94404, USA
Rebecca Bradford American Type Culture Collection, 10807 University Blvd, Manassas, VA 20110, USA BEI Resources
J Rodney Brister National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Patrick Sg Chain Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
Courtney A Copeland Deloitte Consulting LLP, 1919 North Lynn St, Suite 1500, Rosslyn, VA 22209 USA
Julia di Iulio Vir Biotechnology Inc., San Francisco, CA, USA
Bin Hu Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
Philip Ebert Eli Lilly and Company, Indianapolis, IN
Jonathan Gunti National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Yumi Jin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Kenneth S Katz National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Andrey Kochergin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Tré LaRosa Deloitte Consulting LLP, 1919 North Lynn St, Suite 1500, Rosslyn, VA 22209 USA
Jiani Li Clinical Virology Department, Gilead Sciences, 333 Lakeside Dr, Foster City, CA 94404, USA
Po-E Li Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
Chien-Chi Lo Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545 USA
Sujatha Rashid American Type Culture Collection, 10807 University Blvd, Manassas, VA 20110, USA
Evguenia S Maiorova Clinical Virology Department, Gilead Sciences, 333 Lakeside Dr, Foster City, CA 94404, USA
Chunlin Xiao National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Vadim Zalunin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Kim D Pruitt National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA

Collapse