Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hehir-Kwa JY, Marschall T, Kloosterman WP, Francioli LC, Baaijens JA, Dijkstra LJ, Abdellaoui A, Koval V, Thung DT, Wardenaar R, Renkens I, Coe BP, Deelen P, de Ligt J, Lameijer EW, van Dijk F, Hormozdiari F, Uitterlinden AG, van Duijn CM, Eichler EE, de Bakker PI, Swertz MA, Wijmenga C, van Ommen GB, Slagboom PE, Boomsma DI, Schönhuth A, Ye K, Guryev V; Genome of the Netherlands Consortium. A high-quality human reference panel reveals the complexity and distribution of genomic structural variants. Nat Commun 2016;7:12989. [PMID: 27708267 DOI: 10.1038/ncomms12989] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2016] [Accepted: 08/24/2016] [Indexed: 02/06/2023] Open

For:	Hehir-Kwa JY, Marschall T, Kloosterman WP, Francioli LC, Baaijens JA, Dijkstra LJ, Abdellaoui A, Koval V, Thung DT, Wardenaar R, Renkens I, Coe BP, Deelen P, de Ligt J, Lameijer EW, van Dijk F, Hormozdiari F, Uitterlinden AG, van Duijn CM, Eichler EE, de Bakker PI, Swertz MA, Wijmenga C, van Ommen GB, Slagboom PE, Boomsma DI, Schönhuth A, Ye K, Guryev V; Genome of the Netherlands Consortium. A high-quality human reference panel reveals the complexity and distribution of genomic structural variants. Nat Commun 2016;7:12989. [PMID: 27708267 DOI: 10.1038/ncomms12989] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2016] [Accepted: 08/24/2016] [Indexed: 02/06/2023] Open

Number

Cited by Other Article(s)

Schreiber M, Jayakodi M, Stein N, Mascher M. Plant pangenomes for crop improvement, biodiversity and evolution. Nat Rev Genet 2024;25:563-577. [PMID: 38378816 DOI: 10.1038/s41576-024-00691-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/14/2023] [Indexed: 02/22/2024]

Sarwal V, Lee S, Yang J, Sankararaman S, Chaisson M, Eskin E, Mangul S. VISTA: an integrated framework for structural variant discovery. Brief Bioinform 2024;25:bbae462. [PMID: 39297879 PMCID: PMC11411772 DOI: 10.1093/bib/bbae462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 08/27/2024] [Accepted: 09/07/2024] [Indexed: 09/26/2024] Open

Abstract

Structural variation (SV) refers to insertions, deletions, inversions, and duplications in human genomes. SVs are present in approximately 1.5% of the human genome. Still, this small subset of genetic variation has been implicated in the pathogenesis of psoriasis, Crohn's disease and other autoimmune disorders, autism spectrum and other neurodevelopmental disorders, and schizophrenia. Since identifying structural variants is an important problem in genetics, several specialized computational techniques have been developed to detect structural variants directly from sequencing data. With advances in whole-genome sequencing (WGS) technologies, a plethora of SV detection methods have been developed. However, dissecting SVs from WGS data remains a challenge, with the majority of SV detection methods prone to a high false-positive rate, and no existing method able to precisely detect a full range of SVs present in a sample. Previous studies have shown that none of the existing SV callers can maintain high accuracy across various SV lengths and genomic coverages. Here, we report an integrated structural variant calling framework, Variant Identification and Structural Variant Analysis (VISTA), that leverages the results of individual callers using a novel and robust filtering and merging algorithm. In contrast to existing consensus-based tools which ignore the length and coverage, VISTA overcomes this limitation by executing various combinations of top-performing callers based on variant length and genomic coverage to generate SV events with high accuracy. We evaluated the performance of VISTA on comprehensive gold-standard datasets across varying organisms and coverage. We benchmarked VISTA using the Genome-in-a-Bottle gold standard SV set, haplotype-resolved de novo assemblies from the Human Pangenome Reference Consortium, along with an in-house polymerase chain reaction (PCR)-validated mouse gold standard set. VISTA maintained the highest F1 score among top consensus-based tools measured using a comprehensive gold standard across both mouse and human genomes. VISTA also has an optimized mode, where the calls can be optimized for precision or recall. VISTA-optimized can attain 100% precision and the highest sensitivity among other variant callers. In conclusion, VISTA represents a significant advancement in structural variant calling, offering a robust and accurate framework that outperforms existing consensus-based tools and sets a new standard for SV detection in genomic research.

Collapse

Cheng PL, Wang H, Dombroski BA, Farrell JJ, Horng I, Chung T, Tosto G, Kunkle BW, Bush WS, Vardarajan B, Schellenberg GD, Lee WP. A Specialized Reference Panel with Structural Variants Integration for Improving Genotype Imputation in Alzheimer's Disease and Related Dementias (ADRD). MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.07.22.24310827. [PMID: 39108532 PMCID: PMC11302603 DOI: 10.1101/2024.07.22.24310827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/12/2024]

Affiliation(s)

Po-Liang Cheng Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Hui Wang Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Beth A Dombroski Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
John J Farrell Biomedical Genetics, Department of Medicine, Boston University Medical School, Boston, MA, USA
Iris Horng Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Tingting Chung Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Giuseppe Tosto Taub Institute for Research on Alzheimer's Disease and the Aging Brain, College of Physicians and Surgeons, Columbia University, NY 10032, USA Department of Neurology, College of Physicians and Surgeons, Columbia University and the New York Presbyterian Hospital, NY 10032, USA
Brian W Kunkle John P Hussman Institute for Human Genomics, Miami, FL, USA John T Macdonald Department of Human Genetics, Miami, FL, USA
William S Bush Cleveland Institute for Computational Biology, Cleveland, OH, USA Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA
Badri Vardarajan Taub Institute for Research on Alzheimer's Disease and the Aging Brain, College of Physicians and Surgeons, Columbia University, NY 10032, USA Department of Neurology, College of Physicians and Surgeons, Columbia University and the New York Presbyterian Hospital, NY 10032, USA
Gerard D Schellenberg Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Wan-Ping Lee Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA

Collapse

Patel-Tupper D, Kelikian A, Leipertz A, Maryn N, Tjahjadi M, Karavolias NG, Cho MJ, Niyogi KK. Multiplexed CRISPR-Cas9 mutagenesis of rice PSBS1 noncoding sequences for transgene-free overexpression. SCIENCE ADVANCES 2024;10:eadm7452. [PMID: 38848363 PMCID: PMC11160471 DOI: 10.1126/sciadv.adm7452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Accepted: 05/03/2024] [Indexed: 06/09/2024]

Xue Z, Zhou A, Zhu X, Li L, Zhu H, Jin X, Wang J. NIPT-PG: empowering non-invasive prenatal testing to learn from population genomics through an incremental pan-genomic approach. Brief Bioinform 2024;25:bbae266. [PMID: 38836702 DOI: 10.1093/bib/bbae266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 05/03/2024] [Accepted: 05/21/2024] [Indexed: 06/06/2024] Open

Shi J, Jia Z, Sun J, Wang X, Zhao X, Zhao C, Liang F, Song X, Guan J, Jia X, Yang J, Chen Q, Yu K, Jia Q, Wu J, Wang D, Xiao Y, Xu X, Liu Y, Wu S, Zhong Q, Wu J, Cui S, Bo X, Wu Z, Park M, Kellis M, He K. Structural variants involved in high-altitude adaptation detected using single-molecule long-read sequencing. Nat Commun 2023;14:8282. [PMID: 38092772 PMCID: PMC10719358 DOI: 10.1038/s41467-023-44034-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 11/27/2023] [Indexed: 12/17/2023] Open

Affiliation(s)

Jinlong Shi Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China Beijing Key Laboratory for Precision Medicine of Chronic Heart Failure, Chinese PLA General Hospital, Beijing, China
Zhilong Jia Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China Beijing Key Laboratory for Precision Medicine of Chronic Heart Failure, Chinese PLA General Hospital, Beijing, China Medical Artificial Intelligence Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China
Jinxiu Sun Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China
Xiaoreng Wang Laboratory of Nuclear and Radiation Injury, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China State Key Laboratory of Experimental Hematology, Beijing, 100853, China
Xiaojing Zhao Beijing Key Laboratory for Precision Medicine of Chronic Heart Failure, Chinese PLA General Hospital, Beijing, China Translational Medicine Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China
Chenghui Zhao Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China Research Center for Biomedical Engineering, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China
Fan Liang NextOmics Biosciences Inc, Wuhan, 430000, China
Xinyu Song Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China Medical Artificial Intelligence Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China
Jiawei Guan Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China
Xue Jia Laboratory of Nuclear and Radiation Injury, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China
Jing Yang Laboratory of Nuclear and Radiation Injury, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China
Qi Chen Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China
Kang Yu Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China
Qian Jia Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China
Jing Wu Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China
Depeng Wang NextOmics Biosciences Inc, Wuhan, 430000, China
Yuhui Xiao NextOmics Biosciences Inc, Wuhan, 430000, China
Xiaoman Xu NextOmics Biosciences Inc, Wuhan, 430000, China
Yinzhe Liu NextOmics Biosciences Inc, Wuhan, 430000, China
Shijing Wu Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China
Qin Zhong Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China
Jue Wu Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China
Saijia Cui Beijing Key Laboratory for Precision Medicine of Chronic Heart Failure, Chinese PLA General Hospital, Beijing, China
Xiaochen Bo Beijing Institute of Radiation Medicine, Beijing, 100850, China
Zhenzhou Wu BioMind Inc, Beijing, 101300, China
Minsung Park NextOmics Biosciences Inc, Wuhan, 430000, China
Manolis Kellis Massachusetts Institute of Technology; MIT Computer Science and Artificial Intelligence Laboratory, Broad Institute of MIT and Harvard, Cambridge, 02139, MA, USA
Kunlun He Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, 100853, China. National Engineering Research Center of Medical Big Data, Chinese PLA General Hospital, Beijing, 100853, China. Key Laboratory of Biomedical Engineering and Translational Medicine, Ministry of Industry and Information Technology, Chinese PLA General Hospital, Beijing, 100853, China. Beijing Key Laboratory for Precision Medicine of Chronic Heart Failure, Chinese PLA General Hospital, Beijing, China.

Collapse

Antinucci M, Comas D, Calafell F. Population history modulates the fitness effects of Copy Number Variation in the Roma. Hum Genet 2023;142:1327-1343. [PMID: 37311904 PMCID: PMC10449987 DOI: 10.1007/s00439-023-02579-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 06/02/2023] [Indexed: 06/15/2023]

Soto DC, Uribe-Salazar JM, Shew CJ, Sekar A, McGinty S, Dennis MY. Genomic structural variation: A complex but important driver of human evolution. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2023;181 Suppl 76:118-144. [PMID: 36794631 PMCID: PMC10329998 DOI: 10.1002/ajpa.24713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 01/21/2023] [Accepted: 02/05/2023] [Indexed: 02/17/2023]

Wen S, Wang M, Qian X, Li Y, Wang K, Choi J, Pennesi ME, Yang P, Marra M, Koenekoop RK, Lopez I, Matynia A, Gorin M, Sui R, Yao F, Goetz K, Porto FBO, Chen R. Systematic assessment of the contribution of structural variants to inherited retinal diseases. Hum Mol Genet 2023;32:2005-2015. [PMID: 36811936 PMCID: PMC10244226 DOI: 10.1093/hmg/ddad032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 01/03/2023] [Accepted: 02/11/2023] [Indexed: 02/24/2023] Open

Affiliation(s)

Shu Wen Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Meng Wang Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Xinye Qian Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Yumei Li Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Keqing Wang Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Jongsu Choi Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Mark E Pennesi Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, Portland, OR 97239, USA
Paul Yang Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, Portland, OR 97239, USA
Molly Marra Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, Portland, OR 97239, USA
Robert K Koenekoop McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, Montreal, Quebec, H4A 3S5, Canada
Irma Lopez McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, Montreal, Quebec, H4A 3S5, Canada
Anna Matynia Jules Stein Eye Institute, Los Angeles, CA 90095, USA Ophthalmology, University of California Los Angeles David Geffen School of Medicine, Los Angeles, CA 90095, USA
Michael Gorin Jules Stein Eye Institute, Los Angeles, CA 90095, USA Ophthalmology, University of California Los Angeles David Geffen School of Medicine, Los Angeles, CA 90095, USA
Ruifang Sui Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, 100005, China
Fengxia Yao Medical Research Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, 100005, China
Kerry Goetz Office of the Director, National Eye Institute/National Institutes of Health, Bethesda, MD 20892, USA
Fernanda Belga Ottoni Porto INRET Clínica e Centro de Pesquisa, Belo Horizonte, Minas Gerais, 30150270, Brazil Department of Ophthalmology, Santa Casa de Misericórdia de Belo Horizonte, Belo Horizonte, Minas Gerais, 30150221, Brazil Centro Oftalmológico de Minas Gerais, Belo Horizonte, Minas Gerais, 30180070, Brazil
Rui Chen Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA

Collapse

Wen S, Wang M, Qian X, Li Y, Wang K, Choi J, Pennesi ME, Yang P, Marra M, Koenekoop RK, Lopez I, Matynia A, Gorin M, Sui R, Yao F, Goetz K, Porto FBO, Chen R. Systematic assessment of the contribution of structural variants to inherited retinal diseases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.02.522522. [PMID: 36789417 PMCID: PMC9928032 DOI: 10.1101/2023.01.02.522522] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Li Q, Yan B, Lam TW, Luo R. Assembly-free discovery of human novel sequences using long reads. DNA Res 2022;29:dsac039. [PMID: 36308393 PMCID: PMC9700288 DOI: 10.1093/dnares/dsac039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Revised: 10/19/2022] [Accepted: 10/27/2022] [Indexed: 09/10/2024] Open

Schuy J, Grochowski CM, Carvalho CMB, Lindstrand A. Complex genomic rearrangements: an underestimated cause of rare diseases. Trends Genet 2022;38:1134-1146. [PMID: 35820967 PMCID: PMC9851044 DOI: 10.1016/j.tig.2022.06.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 05/12/2022] [Accepted: 06/06/2022] [Indexed: 01/24/2023]

Otsuki A, Okamura Y, Ishida N, Tadaka S, Takayama J, Kumada K, Kawashima J, Taguchi K, Minegishi N, Kuriyama S, Tamiya G, Kinoshita K, Katsuoka F, Yamamoto M. Construction of a trio-based structural variation panel utilizing activated T lymphocytes and long-read sequencing technology. Commun Biol 2022;5:991. [PMID: 36127505 PMCID: PMC9489684 DOI: 10.1038/s42003-022-03953-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Accepted: 09/06/2022] [Indexed: 11/13/2022] Open

Affiliation(s)

Akihito Otsuki Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Department of Medical Biochemistry, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan
Yasunobu Okamura Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Noriko Ishida Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Shu Tadaka Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Jun Takayama Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Statistical Genetics Team, RIKEN Center for Advanced Intelligence Project, Nihonbashi 1-chome Mitsui Building 15 F, 1-4-1 Nihonbashi, Chuo-ku, Tokyo, 103-0027, Japan.,Department of AI and Innovative Medicine, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan
Kazuki Kumada Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Junko Kawashima Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Keiko Taguchi Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Department of Medical Biochemistry, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan.,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Naoko Minegishi Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Shinichi Kuriyama Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Gen Tamiya Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Statistical Genetics Team, RIKEN Center for Advanced Intelligence Project, Nihonbashi 1-chome Mitsui Building 15 F, 1-4-1 Nihonbashi, Chuo-ku, Tokyo, 103-0027, Japan.,Department of AI and Innovative Medicine, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan
Kengo Kinoshita Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Department of Medical Biochemistry, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan.,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Graduate School of Information Sciences, Tohoku University, 6-3-09 Aramaki Aza-Aoba, Aoba-ku, Sendai, Miyagi, 980-8579, Japan
Fumiki Katsuoka Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan
Masayuki Yamamoto Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan. .,Department of Medical Biochemistry, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8575, Japan. .,Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.

Collapse

Zhou Y, Yang L, Han X, Han J, Hu Y, Li F, Xia H, Peng L, Boschiero C, Rosen BD, Bickhart DM, Zhang S, Guo A, Van Tassell CP, Smith TPL, Yang L, Liu GE. Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history. Genome Res 2022;32:1585-1601. [PMID: 35977842 PMCID: PMC9435747 DOI: 10.1101/gr.276550.122] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 07/21/2022] [Indexed: 02/03/2023]

Affiliation(s)

Yang Zhou Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Lv Yang Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Xiaotao Han Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Jiazheng Han Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Yan Hu Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Fan Li Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Han Xia Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Lingwei Peng Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Clarissa Boschiero Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, Maryland 20705, USA
Benjamin D Rosen Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, Maryland 20705, USA
Derek M Bickhart Dairy Forage Research Center, ARS USDA, Madison, Wisconsin 53706, USA
Shujun Zhang Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
Aizhen Guo The State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China
Curtis P Van Tassell Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, Maryland 20705, USA
Timothy P L Smith U.S. Meat Animal Research Center, ARS USDA, Clay Center, Nebraska 68933, USA
Liguo Yang Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
George E Liu Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, Maryland 20705, USA

Collapse

Li Z, Jiang X, Fang M, Bai Y, Liu S, Huang S, Jin X. CMDB: the comprehensive population genome variation database of China. Nucleic Acids Res 2022;51:D890-D895. [PMID: 35871305 PMCID: PMC9825573 DOI: 10.1093/nar/gkac638] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 07/22/2022] [Indexed: 01/30/2023] Open

Sarwal V, Niehus S, Ayyala R, Kim M, Sarkar A, Chang S, Lu A, Rajkumar N, Darci-Maher N, Littman R, Chhugani K, Soylev A, Comarova Z, Wesel E, Castellanos J, Chikka R, Distler MG, Eskin E, Flint J, Mangul S. A comprehensive benchmarking of WGS-based deletion structural variant callers. Brief Bioinform 2022;23:bbac221. [PMID: 35753701 PMCID: PMC9294411 DOI: 10.1093/bib/bbac221] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 04/30/2022] [Accepted: 05/11/2022] [Indexed: 01/10/2023] Open

Affiliation(s)

Varuni Sarwal Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA Indian Institute of Technology Delhi, Hauz Khas, New Delhi, Delhi 110016, India
Sebastian Niehus Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Str. 2, 10178 Berlin, Germany Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Charitéplatz 1, 10117 Berlin, Germany
Ram Ayyala Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Minyoung Kim Department of Quantitative and Computational Biology, University of Southern California, 1050 Childs Way, Los Angeles, CA 90089
Aditya Sarkar School of Computing and Electrical Engineering, Indian Institute of Technology Mandi, Kamand, Mandi, Himachal Pradesh 175001, India
Sei Chang Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Angela Lu Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Neha Rajkumar Department of Bioengineering, Department of Bioengineering, University of California Los Angeles, Los Angeles, CA, 90095
Nicholas Darci-Maher Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Russell Littman Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Karishma Chhugani Department of Clinical Pharmacy, School of Pharmacy, University of Southern California 1985 Zonal Avenue Los Angeles, CA 90089-9121
Arda Soylev Department of Computer Engineering, Konya Food and Agriculture University, Konya, Turkey
Zoia Comarova Department Civil and Environmental Engineering, University of Southern California, Los Angeles, CA, United States
Emily Wesel Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Jacqueline Castellanos Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Rahul Chikka Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Margaret G Distler Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA
Eleazar Eskin Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, USA Department of Human Genetics, David Geffen School of Medicine at UCLA, 695 Charles E. Young Drive South, Box 708822, Los Angeles, CA, 90095, USA Department of Computational Medicine, David Geffen School of Medicine at UCLA, 73-235 CHS, Los Angeles, CA, 90095, USA
Jonathan Flint Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, 760 Westwood Plaza, Los Angeles, CA 90095, USA
Serghei Mangul Department of Clinical Pharmacy, School of Pharmacy, University of Southern California 1985 Zonal Avenue Los Angeles, CA 90089-9121

Collapse

Lian Q, Chen Y, Chang F, Fu Y, Qi J. inGAP-family: Accurate Detection of Meiotic Recombination Loci and Causal Mutations by Filtering Out Artificial Variants due to Genome Complexities. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022;20:524-535. [PMID: 33711466 PMCID: PMC9801030 DOI: 10.1016/j.gpb.2019.11.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 09/04/2019] [Accepted: 11/08/2019] [Indexed: 01/26/2023]

The Thousand Polish Genomes-A Database of Polish Variant Allele Frequencies. Int J Mol Sci 2022;23:ijms23094532. [PMID: 35562925 PMCID: PMC9104289 DOI: 10.3390/ijms23094532] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 04/13/2022] [Accepted: 04/14/2022] [Indexed: 02/05/2023] Open

Lei Y, Meng Y, Guo X, Ning K, Bian Y, Li L, Hu Z, Anashkina AA, Jiang Q, Dong Y, Zhu X. Overview of structural variation calling: Simulation, identification, and visualization. Comput Biol Med 2022;145:105534. [DOI: 10.1016/j.compbiomed.2022.105534] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Revised: 04/09/2022] [Accepted: 04/14/2022] [Indexed: 12/11/2022]

Smetana J, Brož P. National Genome Initiatives in Europe and the United Kingdom in the Era of Whole-Genome Sequencing: A Comprehensive Review. Genes (Basel) 2022;13:556. [PMID: 35328109 PMCID: PMC8953625 DOI: 10.3390/genes13030556] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 03/17/2022] [Accepted: 03/18/2022] [Indexed: 12/04/2022] Open

Whole-genome sequencing of 1,171 elderly admixed individuals from São Paulo, Brazil. Nat Commun 2022;13:1004. [PMID: 35246524 PMCID: PMC8897431 DOI: 10.1038/s41467-022-28648-3] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 01/21/2022] [Indexed: 02/07/2023] Open

Valls-Margarit J, Galván-Femenía I, Matías-Sánchez D, Blay N, Puiggròs M, Carreras A, Salvoro C, Cortés B, Amela R, Farre X, Lerga-Jaso J, Puig M, Sánchez-Herrero J, Moreno V, Perucho M, Sumoy L, Armengol L, Delaneau O, Cáceres M, de Cid R, Torrents D. GCAT|Panel, a comprehensive structural variant haplotype map of the Iberian population from high-coverage whole-genome sequencing. Nucleic Acids Res 2022;50:2464-2479. [PMID: 35176773 PMCID: PMC8934637 DOI: 10.1093/nar/gkac076] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 12/24/2021] [Accepted: 02/09/2022] [Indexed: 11/17/2022] Open

Affiliation(s)

Jordi Valls-Margarit
Iván Galván-Femenía
Daniel Matías-Sánchez
Natalia Blay Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
Montserrat Puiggròs Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
Anna Carreras Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
Cecilia Salvoro Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
Beatriz Cortés Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
Ramon Amela Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
Xavier Farre Genomes for Life-GCAT lab Group, Institute for Health Science Research Germans Trias i Pujol (IGTP), Badalona 08916, Spain
Jon Lerga-Jaso Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
Marta Puig Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
Jose Francisco Sánchez-Herrero High Content Genomics and Bioinformatics Unit, Institute for Health Science Research Germans Trias i Pujol (IGTP), 08916 Badalona, Spain
Victor Moreno Catalan Institute of Oncology, Hospitalet del Llobregat, 08908, Spain Bellvitge Biomedical Research Institute (IDIBELL), Hospitalet del Llobregat, 08908, Spain CIBER Epidemiología y Salud Pública (CIBERESP), Madrid 28029, Spain Universitat de Barcelona (UB), Barcelona 08007, Spain
Manuel Perucho Sanford Burnham Prebys Medical Discovery Institute (SBP), La Jolla, CA 92037, USA Cancer Genetics and Epigenetics, Program of Predictive and Personalized Medicine of Cancer (PMPPC), Health Science Research Institute Germans Trias i Pujol (IGTP), Badalona 08916, Spain
Lauro Sumoy High Content Genomics and Bioinformatics Unit, Institute for Health Science Research Germans Trias i Pujol (IGTP), 08916 Badalona, Spain
Lluís Armengol Quantitative Genomic Medicine Laboratories (qGenomics), Esplugues del Llobregat, 08950, Spain
Olivier Delaneau Department of Computational Biology, University of Lausanne, Génopode, 1015 Lausanne, Switzerland Swiss Institute of Bioinformatics (SIB), University of Lausanne, Quartier Sorge – Batiment Amphipole, 1015 Lausanne, Switzerland
Mario Cáceres Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain ICREA, Barcelona 08010, Spain
Rafael de Cid Correspondence may also be addressed to Rafael de Cid. Tel: +34 930330542;
David Torrents To whom correspondence should be addressed. Tel: +34 934134074;

Collapse

Henriksen RA, Jenjaroenpun P, Sjøstrøm IB, Jensen KR, Prada-Luengo I, Wongsurawat T, Nookaew I, Regenberg B. Circular DNA in the human germline and its association with recombination. Mol Cell 2022;82:209-217.e7. [PMID: 34951964 PMCID: PMC10707452 DOI: 10.1016/j.molcel.2021.11.027] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 08/24/2021] [Accepted: 11/23/2021] [Indexed: 12/24/2022]

The correctness of large scale analysis of genomic data. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES 2021. [DOI: 10.2478/fcds-2021-0024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Zhang T, Li Q, Dong B, Liang X, Jia M, Bai J, Yu J, Fu S. Genetic Polymorphism of Drug Metabolic Gene CYPs, VKORC1, NAT2, DPYD and CHST3 of Five Ethnic Minorities in Heilongjiang Province, Northeast China. Pharmgenomics Pers Med 2021;14:1537-1547. [PMID: 34876832 PMCID: PMC8643223 DOI: 10.2147/pgpm.s339854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 11/05/2021] [Indexed: 11/23/2022] Open

Abstract

Introduction

Genetic variability in genes encoding drug-metabolizing enzymes may contribute to the heterogeneity of drug responses in different populations. Extensive research in pharmacogenomics in major populations around the world provides us with a great deal of information about drug-related genetic polymorphisms.

Objective

The purpose of this study was to detect the genetic variation of drug-metabolism-related genes in the five ethnic minorities Daur, Hezhen, Ewenki, Mongolian and Manchu in China, and to analyze the distribution differences among ethnic groups.

Methods

We genotyped 32 SNPs of drug metabolism genes in 882 healthy Chinese volunteers from five ethnic groups. The genotype frequency and allele frequency of the five ethnic groups were calculated, and the different variants among the five ethnic groups were compared by chi-square test. Genetic parameters were analyzed using Popgene software. The genetic structure of five ethnic minorities was analyzed by principal component analysis, and compared with 26 populations.

Results

We found that SNPs of genes related to drug metabolism existed diversity in different populations. Among them, rs8192766 and rs9419082 in CYP2E1 showed statistical differences between Daur and Manchu, and NAT2 rs1801280 showed statistical differences between Hezhen and Mongolian. In addition, the five populations we studied had the smallest differences with EAS populations. There was haplotype diversity in CHST3, VKORC1, CYP1A2 and CYP2E1 genes in the five ethnic minorities, and these haplotype polymorphisms were related to the use of corresponding drug doses. Cluster analysis shows that the five ethnic minorities in Heilongjiang Province are clustered together with the EAS populations.

Conclusion

These results suggest that understanding the diversity of drug-related genetic markers is critical for individualized drug gene therapy programs in ethnic minorities in China as well as in populations highly mixed with these ethnic groups.

Collapse

Affiliation(s)

Tingting Zhang Laboratory of Medical Genetics, Harbin Medical University, Harbin, People's Republic of China.,Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China
Qiuyan Li Laboratory of Medical Genetics, Harbin Medical University, Harbin, People's Republic of China.,Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China.,Editorial Department of International Journal of Genetics, Harbin Medical University, Harbin, People's Republic of China
Bonan Dong Laboratory of Medical Genetics, Harbin Medical University, Harbin, People's Republic of China.,Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China
Xiao Liang Laboratory of Medical Genetics, Harbin Medical University, Harbin, People's Republic of China.,Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China
Mansha Jia Scientific Research Centre, The Second Affiliated Hospital of Harbin Medical University, Harbin, People's Republic of China
Jing Bai Laboratory of Medical Genetics, Harbin Medical University, Harbin, People's Republic of China.,Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China
Jingcui Yu Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China.,Scientific Research Centre, The Second Affiliated Hospital of Harbin Medical University, Harbin, People's Republic of China
Songbin Fu Laboratory of Medical Genetics, Harbin Medical University, Harbin, People's Republic of China.,Key Laboratory of Preservation of Human Genetic Resources and Disease Control in China (Harbin Medical University), Ministry of Education, Harbin, People's Republic of China

Collapse

Zverinova S, Guryev V. Variant calling: Considerations, practices, and developments. Hum Mutat 2021;43:976-985. [PMID: 34882898 PMCID: PMC9545713 DOI: 10.1002/humu.24311] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 11/02/2021] [Accepted: 12/03/2021] [Indexed: 11/10/2022]

Krannich T, White WTJ, Niehus S, Holley G, Halldórsson BV, Kehr B. Population-scale detection of non-reference sequence variants using colored de Bruijn graphs. Bioinformatics 2021;38:604-611. [PMID: 34726732 PMCID: PMC8756200 DOI: 10.1093/bioinformatics/btab749] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 09/27/2021] [Accepted: 10/28/2021] [Indexed: 02/03/2023] Open

Miga KH, Wang T. The Need for a Human Pangenome Reference Sequence. Annu Rev Genomics Hum Genet 2021;22:81-102. [PMID: 33929893 PMCID: PMC8410644 DOI: 10.1146/annurev-genom-120120-081921] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Li Q, Tian S, Yan B, Liu CM, Lam TW, Li R, Luo R. Building a Chinese pan-genome of 486 individuals. Commun Biol 2021;4:1016. [PMID: 34462542 PMCID: PMC8405635 DOI: 10.1038/s42003-021-02556-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Accepted: 08/13/2021] [Indexed: 02/07/2023] Open

McDonald TL, Zhou W, Castro CP, Mumm C, Switzenberg JA, Mills RE, Boyle AP. Cas9 targeted enrichment of mobile elements using nanopore sequencing. Nat Commun 2021;12:3586. [PMID: 34117247 PMCID: PMC8196195 DOI: 10.1038/s41467-021-23918-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 05/25/2021] [Indexed: 02/05/2023] Open

Giles HH, Hegde MR, Lyon E, Stanley CM, Kerr ID, Garlapow ME, Eggington JM. The Science and Art of Clinical Genetic Variant Classification and Its Impact on Test Accuracy. Annu Rev Genomics Hum Genet 2021;22:285-307. [PMID: 33900788 DOI: 10.1146/annurev-genom-121620-082709] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Charon C, Allodji R, Meyer V, Deleuze JF. Impact of pre- and post-variant filtration strategies on imputation. Sci Rep 2021;11:6214. [PMID: 33737531 PMCID: PMC7973508 DOI: 10.1038/s41598-021-85333-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 02/22/2021] [Indexed: 01/04/2023] Open

Feng X, Li H. Higher Rates of Processed Pseudogene Acquisition in Humans and Three Great Apes Revealed by Long-Read Assemblies. Mol Biol Evol 2021;38:2958-2966. [PMID: 33681998 DOI: 10.1093/molbev/msab062] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Torkamaneh D, Belzile F. Accurate Imputation of Untyped Variants from Deep Sequencing Data. Methods Mol Biol 2021;2243:271-281. [PMID: 33606262 DOI: 10.1007/978-1-0716-1103-6_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]

Chen L, Pryce JE, Hayes BJ, Daetwyler HD. Investigating the Effect of Imputed Structural Variants from Whole-Genome Sequence on Genome-Wide Association and Genomic Prediction in Dairy Cattle. Animals (Basel) 2021;11:ani11020541. [PMID: 33669735 PMCID: PMC7922624 DOI: 10.3390/ani11020541] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 02/09/2021] [Accepted: 02/12/2021] [Indexed: 02/06/2023] Open

Abstract

Simple Summary

Structural variants are large changes to the DNA sequences that differ from individual to individual. We discovered and quality-controlled a set of 24,908 structural variants and used a technique called imputation to infer them into 35,588 Holstein and Jersey cattle. We then investigated whether the structural variants affected key dairy cattle traits such as milk production, fertility and overall conformation. Structural variants explained generally less than 10 percent of the phenotypic variation in these traits. Four of the structural variants were significantly associated with dairy cattle production traits. However, the inclusion of the structural variants in the genomic prediction model did not increase genomic prediction accuracy.

Abstract

Structural variations (SVs) are large DNA segments of deletions, duplications, copy number variations, inversions and translocations in a re-sequenced genome compared to a reference genome. They have been found to be associated with several complex traits in dairy cattle and could potentially help to improve genomic prediction accuracy of dairy traits. Imputation of SVs was performed in individuals genotyped with single-nucleotide polymorphism (SNP) panels without the expense of sequencing them. In this study, we generated 24,908 high-quality SVs in a total of 478 whole-genome sequenced Holstein and Jersey cattle. We imputed 4489 SVs with R2 > 0.5 into 35,568 Holstein and Jersey dairy cattle with 578,999 SNPs with two pipelines, FImpute and Eagle2.3-Minimac3. Genome-wide association studies for production, fertility and overall type with these 4489 SVs revealed four significant SVs, of which two were highly linked to significant SNP. We also estimated the variance components for SNP and SV models for these traits using genomic best linear unbiased prediction (GBLUP). Furthermore, we assessed the effect on genomic prediction accuracy of adding SVs to GBLUP models. The estimated percentage of genetic variance captured by SVs for production traits was up to 4.57% for milk yield in bulls and 3.53% for protein yield in cows. Finally, no consistent increase in genomic prediction accuracy was observed when including SVs in GBLUP.

Collapse

Y chromosome structural variation in infertile men detected by targeted next-generation sequencing. J Assist Reprod Genet 2021;38:941-948. [PMID: 33454900 DOI: 10.1007/s10815-020-02031-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 12/08/2020] [Indexed: 01/21/2023] Open

Lee YG, Lee JY, Kim J, Kim YJ. Insertion variants missing in the human reference genome are widespread among human populations. BMC Biol 2020;18:167. [PMID: 33187521 PMCID: PMC7666470 DOI: 10.1186/s12915-020-00894-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 10/09/2020] [Indexed: 01/07/2023] Open

Abstract

Background

Structural variants comprise diverse genomic arrangements including deletions, insertions, inversions, and translocations, which can generally be detected in humans through sequence comparison to the reference genome. Among structural variants, insertions are the least frequently identified variants, mainly due to ascertainment bias in the reference genome, lack of previous sequence knowledge, and low complexity of typical insertion sequences. Though recent developments in long-read sequencing deliver promise in annotating individual non-reference insertions, population-level catalogues on non-reference insertion variants have not been identified and the possible functional roles of these hidden variants remain elusive.

Results

To detect non-reference insertion variants, we developed a pipeline, InserTag, which generates non-reference contigs by local de novo assembly and then infers the full-sequence of insertion variants by tracing contigs from non-human primates and other human genome assemblies. Application of the pipeline to data from 2535 individuals of the 1000 Genomes Project helped identify 1696 non-reference insertion variants and re-classify the variants as retention of ancestral sequences or novel sequence insertions based on the ancestral state. Genotyping of the variants showed that individuals had, on average, 0.92-Mbp sequences missing from the reference genome, 92% of the variants were common (allele frequency > 5%) among human populations, and more than half of the variants were major alleles. Among human populations, African populations were the most divergent and had the most non-reference sequences, which was attributed to the greater prevalence of high-frequency insertion variants. The subsets of insertion variants were in high linkage disequilibrium with phenotype-associated SNPs and showed signals of recent continent-specific selection.

Conclusions

Non-reference insertion variants represent an important type of genetic variation in the human population, and our developed pipeline, InserTag, provides the frameworks for the detection and genotyping of non-reference sequences missing from human populations.

Supplementary information

Supplementary information accompanies this paper at 10.1186/s12915-020-00894-1.

Collapse

Crysnanto D, Pausch H. Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery. Genome Biol 2020;21:184. [PMID: 32718320 PMCID: PMC7385871 DOI: 10.1186/s13059-020-02105-0%0a%0a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Accepted: 07/14/2020] [Indexed: 09/28/2023] Open

Abstract

BACKGROUND

The current bovine genomic reference sequence was assembled from a Hereford cow. The resulting linear assembly lacks diversity because it does not contain allelic variation, a drawback of linear references that causes reference allele bias. High nucleotide diversity and the separation of individuals by hundreds of breeds make cattle ideally suited to investigate the optimal composition of variation-aware references.

RESULTS

We augment the bovine linear reference sequence (ARS-UCD1.2) with variants filtered for allele frequency in dairy (Brown Swiss, Holstein) and dual-purpose (Fleckvieh, Original Braunvieh) cattle breeds to construct either breed-specific or pan-genome reference graphs using the vg toolkit. We find that read mapping is more accurate to variation-aware than linear references if pre-selected variants are used to construct the genome graphs. Graphs that contain random variants do not improve read mapping over the linear reference sequence. Breed-specific augmented and pan-genome graphs enable almost similar mapping accuracy improvements over the linear reference. We construct a whole-genome graph that contains the Hereford-based reference sequence and 14 million alleles that have alternate allele frequency greater than 0.03 in the Brown Swiss cattle breed. Our novel variation-aware reference facilitates accurate read mapping and unbiased sequence variant genotyping for SNPs and Indels.

CONCLUSIONS

We develop the first variation-aware reference graph for an agricultural animal ( https://doi.org/10.5281/zenodo.3759712 ). Our novel reference structure improves sequence read mapping and variant genotyping over the linear reference. Our work is a first step towards the transition from linear to variation-aware reference structures in species with high genetic diversity and many sub-populations.

Collapse

Crysnanto D, Pausch H. Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery. Genome Biol 2020;21:184. [PMID: 32718320 PMCID: PMC7385871 DOI: 10.1186/s13059-020-02105-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Accepted: 07/14/2020] [Indexed: 12/19/2022] Open

Abstract

BACKGROUND

RESULTS

CONCLUSIONS

Collapse

Abel HJ, Larson DE, Regier AA, Chiang C, Das I, Kanchi KL, Layer RM, Neale BM, Salerno WJ, Reeves C, Buyske S, Matise TC, Muzny DM, Zody MC, Lander ES, Dutcher SK, Stitziel NO, Hall IM. Mapping and characterization of structural variation in 17,795 human genomes. Nature 2020;583:83-89. [PMID: 32460305 PMCID: PMC7547914 DOI: 10.1038/s41586-020-2371-0] [Citation(s) in RCA: 159] [Impact Index Per Article: 39.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Accepted: 05/18/2020] [Indexed: 12/18/2022]

Affiliation(s)

Haley J Abel McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
David E Larson McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
Allison A Regier McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
Colby Chiang McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
Indraniel Das McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
Krishna L Kanchi McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
Ryan M Layer BioFrontiers Institute, University of Colorado, Boulder, CO, USA Department of Computer Science, University of Colorado, Boulder, CO, USA
Benjamin M Neale Broad Institute of MIT and Harvard, Cambridge, MA, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
William J Salerno Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Catherine Reeves New York Genome Center, New York, NY, USA
Steven Buyske Department of Statistics, Rutgers University, Piscataway, NJ, USA
Tara C Matise Department of Genetics, Rutgers University, Piscataway, NJ, USA
Donna M Muzny Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Michael C Zody New York Genome Center, New York, NY, USA
Eric S Lander Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA Department of Systems Biology, Harvard Medical School, Boston, MA, USA
Susan K Dutcher McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
Nathan O Stitziel McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA Department of Genetics, Washington University School of Medicine, St Louis, MO, USA Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
Ira M Hall McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA. Department of Genetics, Washington University School of Medicine, St Louis, MO, USA. Department of Medicine, Washington University School of Medicine, St Louis, MO, USA.

Collapse

Louzada S, Algady W, Weyell E, Zuccherato LW, Brajer P, Almalki F, Scliar MO, Naslavsky MS, Yamamoto GL, Duarte YAO, Passos-Bueno MR, Zatz M, Yang F, Hollox EJ. Structural variation of the malaria-associated human glycophorin A-B-E region. BMC Genomics 2020;21:446. [PMID: 32600246 PMCID: PMC7325229 DOI: 10.1186/s12864-020-06849-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 06/18/2020] [Indexed: 12/18/2022] Open

Abstract

BACKGROUND

Approximately 5% of the human genome shows common structural variation, which is enriched for genes involved in the immune response and cell-cell interactions. A well-established region of extensive structural variation is the glycophorin gene cluster, comprising three tandemly-repeated regions about 120 kb in length and carrying the highly homologous genes GYPA, GYPB and GYPE. Glycophorin A (encoded by GYPA) and glycophorin B (encoded by GYPB) are glycoproteins present at high levels on the surface of erythrocytes, and they have been suggested to act as decoy receptors for viral pathogens. They are receptors for the invasion of the protist parasite Plasmodium falciparum, a causative agent of malaria. A particular complex structural variant, called DUP4, creates a GYPB-GYPA fusion gene known to confer resistance to malaria. Many other structural variants exist across the glycophorin gene cluster, and they remain poorly characterised.

RESULTS

Here, we analyse sequences from 3234 diploid genomes from across the world for structural variation at the glycophorin locus, confirming 15 variants in the 1000 Genomes project cohort, discovering 9 new variants, and characterising a selection of these variants using fibre-FISH and breakpoint mapping at the sequence level. We identify variants predicted to create novel fusion genes and a common inversion duplication variant at appreciable frequencies in West Africans. We show that almost all variants can be explained by non-allelic homologous recombination and by comparing the structural variant breakpoints with recombination hotspot maps, confirm the importance of a particular meiotic recombination hotspot on structural variant formation in this region.

CONCLUSIONS

We identify and validate large structural variants in the human glycophorin A-B-E gene cluster which may be associated with different clinical aspects of malaria.

Collapse

Affiliation(s)

Sandra Louzada Wellcome Sanger Institute, Hinxton, Cambridge, UK Present address: Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology, University of Trás-os-Montes and Alto Douro (UTAD), Vila Real, Portugal Present address: BioISI - Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, Lisbon, Portugal
Walid Algady Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
Eleanor Weyell Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
Luciana W Zuccherato Department of Pathology, Faculty of Medicine, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Paulina Brajer Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
Faisal Almalki Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
Marilia O Scliar Human Genome and Stem Cell Research Center, Department of Genetics and Evolutionary Biology, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
Michel S Naslavsky Human Genome and Stem Cell Research Center, Department of Genetics and Evolutionary Biology, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
Guilherme L Yamamoto Human Genome and Stem Cell Research Center, Department of Genetics and Evolutionary Biology, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
Yeda A O Duarte School of Nursing, Universidade de São Paulo, São Paulo, Brazil
Maria Rita Passos-Bueno Human Genome and Stem Cell Research Center, Department of Genetics and Evolutionary Biology, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
Mayana Zatz Human Genome and Stem Cell Research Center, Department of Genetics and Evolutionary Biology, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
Fengtang Yang Wellcome Sanger Institute, Hinxton, Cambridge, UK
Edward J Hollox Department of Genetics and Genome Biology, University of Leicester, Leicester, UK.

Collapse

Jakubosky D, Smith EN, D'Antonio M, Jan Bonder M, Young Greenwald WW, D'Antonio-Chronowska A, Matsui H, Stegle O, Montgomery SB, DeBoever C, Frazer KA. Discovery and quality analysis of a comprehensive set of structural variants and short tandem repeats. Nat Commun 2020;11:2928. [PMID: 32522985 PMCID: PMC7287045 DOI: 10.1038/s41467-020-16481-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 05/05/2020] [Indexed: 02/07/2023] Open

Linder RA, Majumder A, Chakraborty M, Long A. Two Synthetic 18-Way Outcrossed Populations of Diploid Budding Yeast with Utility for Complex Trait Dissection. Genetics 2020;215:323-342. [PMID: 32241804 PMCID: PMC7268983 DOI: 10.1534/genetics.120.303202] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Accepted: 03/31/2020] [Indexed: 02/07/2023] Open

Abstract

Advanced-generation multiparent populations (MPPs) are a valuable tool for dissecting complex traits, having more power than genome-wide association studies to detect rare variants and higher resolution than F2 linkage mapping. To extend the advantages of MPPs in budding yeast, we describe the creation and characterization of two outbred MPPs derived from 18 genetically diverse founding strains. We carried out de novo assemblies of the genomes of the 18 founder strains, such that virtually all variation segregating between these strains is known, and represented those assemblies as Santa Cruz Genome Browser tracks. We discovered complex patterns of structural variation segregating among the founders, including a large deletion within the vacuolar ATPase VMA1, several different deletions within the osmosensor MSB2, a series of deletions and insertions at PRM7 and the adjacent BSC1, as well as copy number variation at the dehydrogenase ALD2 Resequenced haploid recombinant clones from the two MPPs have a median unrecombined block size of 66 kb, demonstrating that the population is highly recombined. We pool-sequenced the two MPPs to 3270× and 2226× coverage and demonstrated that we can accurately estimate local haplotype frequencies using pooled data. We further downsampled the pool-sequenced data to ∼20-40× and showed that local haplotype frequency estimates remained accurate, with median error rates 0.8 and 0.6% at 20× and 40×, respectively. Haplotypes frequencies are estimated much more accurately than SNP frequencies obtained directly from the same data. Deep sequencing of the two populations revealed that 10 or more founders are present at a detectable frequency for > 98% of the genome, validating the utility of this resource for the exploration of the role of standing variation in the architecture of complex traits.

Collapse

Mapping and characterization of structural variation in 17,795 human genomes. Nature 2020. [PMID: 32460305 DOI: 10.1038/s41586‐020‐2371‐0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Collins RL, Brand H, Karczewski KJ, Zhao X, Alföldi J, Francioli LC, Khera AV, Lowther C, Gauthier LD, Wang H, Watts NA, Solomonson M, O'Donnell-Luria A, Baumann A, Munshi R, Walker M, Whelan CW, Huang Y, Brookings T, Sharpe T, Stone MR, Valkanas E, Fu J, Tiao G, Laricchia KM, Ruano-Rubio V, Stevens C, Gupta N, Cusick C, Margolin L, Taylor KD, Lin HJ, Rich SS, Post WS, Chen YDI, Rotter JI, Nusbaum C, Philippakis A, Lander E, Gabriel S, Neale BM, Kathiresan S, Daly MJ, Banks E, MacArthur DG, Talkowski ME. A structural variation reference for medical and population genetics. Nature 2020;581:444-451. [PMID: 32461652 PMCID: PMC7334194 DOI: 10.1038/s41586-020-2287-8] [Citation(s) in RCA: 516] [Impact Index Per Article: 129.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Accepted: 03/31/2020] [Indexed: 12/16/2022]

Abstract

Structural variants (SVs) rearrange large segments of DNA¹ and can have profound consequences in evolution and human disease^2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)⁴ have become integral in the interpretation of single-nucleotide variants (SNVs)⁵. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25–29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage⁶. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings⁷. This SV resource is freely distributed via the gnomAD browser⁸ and will have broad utility in population genetics, disease-association studies, and diagnostic screening.

A large empirical assessment of sequence-resolved structural variants from 14,891 genomes across diverse global populations in the Genome Aggregation Database (gnomAD) provides a reference map for disease-association studies, population genetics, and diagnostic screening.

Collapse

Affiliation(s)

Ryan L Collins Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
Harrison Brand Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Konrad J Karczewski Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Xuefang Zhao Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Jessica Alföldi Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Laurent C Francioli Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Department of Medicine, Harvard Medical School, Boston, MA, USA
Amit V Khera Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Chelsea Lowther Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Laura D Gauthier Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Harold Wang Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Nicholas A Watts Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Matthew Solomonson Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Anne O'Donnell-Luria Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Alexander Baumann Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Ruchi Munshi Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Mark Walker Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Christopher W Whelan Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Yongqing Huang Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Ted Brookings Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Ted Sharpe Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Matthew R Stone Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Elise Valkanas Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
Jack Fu Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Grace Tiao Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Kristen M Laricchia Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Valentin Ruano-Rubio Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Christine Stevens Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Namrata Gupta Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Caroline Cusick Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Lauren Margolin Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA


Kent D Taylor The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
Henry J Lin The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
Stephen S Rich Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
Wendy S Post Johns Hopkins University School of Medicine, Baltimore, MD, USA
Yii-Der Ida Chen The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
Jerome I Rotter The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
Chad Nusbaum Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Cellarity Inc., Cambridge, MA, USA
Anthony Philippakis Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Eric Lander Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Systems Biology, Harvard Medical School, Boston, MA, USA.,Department of Biology, MIT, Cambridge, MA, USA
Stacey Gabriel Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Benjamin M Neale Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Sekar Kathiresan Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Department of Medicine, Harvard Medical School, Boston, MA, USA.,Division of Cardiology, Massachusetts General Hospital, Boston, MA, USA
Mark J Daly Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Eric Banks Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Daniel G MacArthur Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.,Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Department of Medicine, Harvard Medical School, Boston, MA, USA.,Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, Australia.,Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia
Michael E Talkowski Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA. .,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA. .,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA. .,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Collapse

Eizenga JM, Novak AM, Sibbesen JA, Heumos S, Ghaffaari A, Hickey G, Chang X, Seaman JD, Rounthwaite R, Ebler J, Rautiainen M, Garg S, Paten B, Marschall T, Sirén J, Garrison E. Pangenome Graphs. Annu Rev Genomics Hum Genet 2020;21:139-162. [PMID: 32453966 DOI: 10.1146/annurev-genom-120219-080406] [Citation(s) in RCA: 100] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Affiliation(s)

Jordan M Eizenga Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Adam M Novak Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Jonas A Sibbesen Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Simon Heumos Quantitative Biology Center, University of Tübingen, 72076 Tübingen, Germany
Ali Ghaffaari Center for Bioinformatics, Saarland University, 66123 Saarbrücken, Germany.,Max Planck Institute for Informatics, 66123 Saarbrücken, Germany.,Saarbrücken Graduate School for Computer Science, Saarland University, 66123 Saarbrücken, Germany
Glenn Hickey Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Xian Chang Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Josiah D Seaman Royal Botanic Gardens, Kew, Richmond TW9 3AB, United Kingdom.,School of Biological and Chemical Sciences, Queen Mary University of London, London E1 4NS, United Kingdom
Robin Rounthwaite Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Jana Ebler Center for Bioinformatics, Saarland University, 66123 Saarbrücken, Germany.,Max Planck Institute for Informatics, 66123 Saarbrücken, Germany.,Saarbrücken Graduate School for Computer Science, Saarland University, 66123 Saarbrücken, Germany
Mikko Rautiainen Center for Bioinformatics, Saarland University, 66123 Saarbrücken, Germany.,Max Planck Institute for Informatics, 66123 Saarbrücken, Germany.,Saarbrücken Graduate School for Computer Science, Saarland University, 66123 Saarbrücken, Germany
Shilpa Garg Departments of Genetics and Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02215, USA.,Department of Data Sciences, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
Benedict Paten Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Tobias Marschall Center for Bioinformatics, Saarland University, 66123 Saarbrücken, Germany.,Max Planck Institute for Informatics, 66123 Saarbrücken, Germany
Jouni Sirén Genomics Institute, University of California, Santa Cruz, California 95064, USA;
Erik Garrison Genomics Institute, University of California, Santa Cruz, California 95064, USA;

Collapse

Determining the impact of uncharacterized inversions in the human genome by droplet digital PCR. Genome Res 2020;30:724-735. [PMID: 32424072 PMCID: PMC7263195 DOI: 10.1101/gr.255273.119] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 04/17/2020] [Indexed: 12/20/2022]

Sherman RM, Salzberg SL. Pan-genomics in the human genome era. Nat Rev Genet 2020;21:243-254. [PMID: 32034321 PMCID: PMC7752153 DOI: 10.1038/s41576-020-0210-7] [Citation(s) in RCA: 147] [Impact Index Per Article: 36.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/02/2020] [Indexed: 12/25/2022]

Ho SS, Urban AE, Mills RE. Structural variation in the sequencing era. Nat Rev Genet 2020;21:171-189. [PMID: 31729472 PMCID: PMC7402362 DOI: 10.1038/s41576-019-0180-9] [Citation(s) in RCA: 280] [Impact Index Per Article: 70.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2019] [Indexed: 12/13/2022]

Alanko J, Bannai H, Cazaux B, Peterlongo P, Stoye J. Finding all maximal perfect haplotype blocks in linear time. Algorithms Mol Biol 2020;15:2. [PMID: 32055252 PMCID: PMC7008532 DOI: 10.1186/s13015-020-0163-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 01/28/2020] [Indexed: 11/10/2022] Open