1
|
Ungar RA, Goddard PC, Jensen TD, Degalez F, Smith KS, Jin CA, Bonner DE, Bernstein JA, Wheeler MT, Montgomery SB. Impact of genome build on RNA-seq interpretation and diagnostics. Am J Hum Genet 2024; 111:1282-1300. [PMID: 38834072 PMCID: PMC11267525 DOI: 10.1016/j.ajhg.2024.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 05/04/2024] [Accepted: 05/06/2024] [Indexed: 06/06/2024] Open
Abstract
Transcriptomics is a powerful tool for unraveling the molecular effects of genetic variants and disease diagnosis. Prior studies have demonstrated that choice of genome build impacts variant interpretation and diagnostic yield for genomic analyses. To identify the extent genome build also impacts transcriptomics analyses, we studied the effect of the hg19, hg38, and CHM13 genome builds on expression quantification and outlier detection in 386 rare disease and familial control samples from both the Undiagnosed Diseases Network and Genomics Research to Elucidate the Genetics of Rare Disease Consortium. Across six routinely collected biospecimens, 61% of quantified genes were not influenced by genome build. However, we identified 1,492 genes with build-dependent quantification, 3,377 genes with build-exclusive expression, and 9,077 genes with annotation-specific expression across six routinely collected biospecimens, including 566 clinically relevant and 512 known OMIM genes. Further, we demonstrate that between builds for a given gene, a larger difference in quantification is well correlated with a larger change in expression outlier calling. Combined, we provide a database of genes impacted by build choice and recommend that transcriptomics-guided analyses and diagnoses are cross referenced with these data for robustness.
Collapse
Affiliation(s)
- Rachel A Ungar
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA; Department of Pathology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Pagé C Goddard
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA; Department of Pathology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Tanner D Jensen
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA; Department of Pathology, School of Medicine, Stanford University, Stanford, CA, USA
| | | | - Kevin S Smith
- Department of Pathology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Christopher A Jin
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA
| | - Devon E Bonner
- Department of Pediatrics, School of Medicine, Stanford University, Stanford, CA, USA; Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, USA
| | - Jonathan A Bernstein
- Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, USA
| | - Matthew T Wheeler
- Department of Cardiovascular Medicine, School of Medicine, Stanford University, Stanford, CA, USA
| | - Stephen B Montgomery
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA; Department of Pathology, School of Medicine, Stanford University, Stanford, CA, USA; Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
| |
Collapse
|
2
|
Ungar RA, Goddard PC, Jensen TD, Degalez F, Smith KS, Jin CA, Bonner DE, Bernstein JA, Wheeler MT, Montgomery SB. Impact of genome build on RNA-seq interpretation and diagnostics. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.01.11.24301165. [PMID: 38260490 PMCID: PMC10802764 DOI: 10.1101/2024.01.11.24301165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Transcriptomics is a powerful tool for unraveling the molecular effects of genetic variants and disease diagnosis. Prior studies have demonstrated that choice of genome build impacts variant interpretation and diagnostic yield for genomic analyses. To identify the extent genome build also impacts transcriptomics analyses, we studied the effect of the hg19, hg38, and CHM13 genome builds on expression quantification and outlier detection in 386 rare disease and familial control samples from both the Undiagnosed Diseases Network (UDN) and Genomics Research to Elucidate the Genetics of Rare Disease (GREGoR) Consortium. We identified 2,800 genes with build-dependent quantification across six routinely-collected biospecimens, including 1,391 protein-coding genes and 341 known rare disease genes. We further observed multiple genes that only have detectable expression in a subset of genome builds. Finally, we characterized how genome build impacts the detection of outlier transcriptomic events. Combined, we provide a database of genes impacted by build choice, and recommend that transcriptomics-guided analyses and diagnoses are cross-referenced with these data for robustness.
Collapse
Affiliation(s)
- Rachel A. Ungar
- Department of Genetics, School of Medicine, Stanford University
- Department of Pathology, School of Medicine, Stanford University
| | - Pagé C. Goddard
- Department of Genetics, School of Medicine, Stanford University
- Department of Pathology, School of Medicine, Stanford University
| | - Tanner D. Jensen
- Department of Genetics, School of Medicine, Stanford University
- Department of Pathology, School of Medicine, Stanford University
| | | | - Kevin S. Smith
- Department of Pathology, School of Medicine, Stanford University
| | | | | | - Devon E. Bonner
- Department of Pediatrics, School of Medicine, Stanford University
- Stanford Center for Undiagnosed Diseases, Stanford University
| | | | - Matthew T. Wheeler
- Department of Cardiovascular Medicine, School of Medicine, Stanford University
| | - Stephen B. Montgomery
- Department of Genetics, School of Medicine, Stanford University
- Department of Pathology, School of Medicine, Stanford University
- Department of Biomedical Data Science, Stanford University
| |
Collapse
|
3
|
Dubowsky JG, Estevez JJ, Craig JE, Appukuttan B, Carr JM. Disease profiles in the Indigenous Australian population are suggestive of a common complement control haplotype. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2023:105453. [PMID: 37245779 DOI: 10.1016/j.meegid.2023.105453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 05/05/2023] [Accepted: 05/23/2023] [Indexed: 05/30/2023]
Abstract
Aboriginal and Torres Strait Islander People (respectfully referred to as Indigenous Australians herein) are disparately burdened by many infectious and chronic diseases relative to Australians with European genetic ancestry. Some of these diseases are described in other populations to be influenced by the inherited profile of complement genes. These include complement factor B, H, I and complement factor H-related (CFHR) genes that can contribute to a polygenic complotype. Here the focus is on the combined deletion of CFHR1 and 3 to form a common haplotype (CFHR3-1Δ). The prevalence of CFHR3-1Δ is high in people with Nigerian and African American genetic ancestry and correlates to a higher frequency and severity of systemic lupus erythematosus (SLE) but a lower prevalence of age-related macular degeneration (AMD) and IgA-nephropathy (IgAN). This pattern of disease is similarly observed among Indigenous Australian communities. Additionally, the CFHR3-1Δ complotype is also associated with increased susceptibility to infection with pathogens, such as Neisseria meningitidis and Streptococcus pyogenes, which also have high incidences in Indigenous Australian communities. The prevalence of these diseases, while likely influenced by social, political, environmental and biological factors, including variants in other components of the complement system, may also be suggestive of the CFHR3-1Δ haplotype in Indigenous Australians. These data highlight a need to define the Indigenous Australian complotypes, which may lead to the discovery of new risk factors for common diseases and progress towards precision medicines for treating complement-associated diseases in Indigenous and non-Indigenous populations. Herein, the disease profiles suggestive of a common complement CFHR3-1Δ control haplotype are examined.
Collapse
Affiliation(s)
- Joshua G Dubowsky
- Microbiology and Infectious Diseases, College of Medicine and Public Health, and Flinders Health and Medical Research Institute, Flinders University, Bedford Park, South Australia, Australia
| | - Jose J Estevez
- Wardliparingga Aboriginal Health Equity Theme, South Australia Health and Medical Research Institute, Adelaide, South Australia, Australia; Flinders Centre for Ophthalmology, Eye and Vision Research, Department of Ophthalmology, Flinders University, Bedford Park, South Australia, Australia; Caring Futures Institute, College of Nursing and Health Sciences, Optometry and Vision Science, Flinders University, Adelaide, Australia
| | - Jamie E Craig
- Flinders Centre for Ophthalmology, Eye and Vision Research, Department of Ophthalmology, Flinders University, Bedford Park, South Australia, Australia
| | - Binoy Appukuttan
- Molecular Medical Science, College of Medicine and Public Health, Flinders University, Bedford Park, South Australia, Australia
| | - Jillian M Carr
- Microbiology and Infectious Diseases, College of Medicine and Public Health, and Flinders Health and Medical Research Institute, Flinders University, Bedford Park, South Australia, Australia.
| |
Collapse
|