1
|
Kwon S, Safer J, Nguyen DT, Hoksza D, May P, Arbesfeld JA, Rubin AF, Campbell AJ, Burgin A, Iqbal S. Genomics 2 Proteins portal: A resource and discovery tool for linking genetic screening outputs to protein sequences and structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.02.573913. [PMID: 38260256 PMCID: PMC10802383 DOI: 10.1101/2024.01.02.573913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics technologies have enabled the detection and generation of variants at an unprecedented scale. However, efficient tools and resources are needed to link these two disparate data types - to "map" variants onto protein structures, to better understand how the variation causes disease and thereby design therapeutics. Here we present the Genomics 2 Proteins Portal (G2P; g2p.broadinstitute.org/): a human proteome-wide resource that maps 19,996,443 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the G2P portal generalizes the capability of linking genomics to proteins beyond databases by allowing users to interactively upload protein residue-wise annotations (variants, scores, etc.) as well as the protein structure to establish the connection. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotype.
Collapse
|
2
|
Loftus SK, Gillis MF, Lundh L, Baxter LL, Wedel JC, Watkins-Chow DE, Donovan FX, Sergeev YV, Oetting WS, Pavan WJ, Adams DR. Haplotype-based analysis resolves missing heritability in oculocutaneous albinism type 1B. Am J Hum Genet 2023; 110:1123-1137. [PMID: 37327787 PMCID: PMC10357474 DOI: 10.1016/j.ajhg.2023.05.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 05/19/2023] [Accepted: 05/22/2023] [Indexed: 06/18/2023] Open
Abstract
Oculocutaneous albinism (OCA) is a rare disorder of pigment production. Affected individuals have variably decreased global pigmentation and visual-developmental changes that lead to low vision. OCA is notable for significant missing heritability, particularly among individuals with residual pigmentation. Tyrosinase (TYR) is the rate-limiting enzyme in melanin pigment biosynthesis and mutations that decrease enzyme function are one of the most common causes of OCA. We present the analysis of high-depth short-read TYR sequencing data for a cohort of 352 OCA probands, ∼50% of whom were previously sequenced without yielding a definitive diagnostic result. Our analysis identified 66 TYR single-nucleotide variants (SNVs) and small insertion/deletions (indels), 3 structural variants, and a rare haplotype comprised of two common frequency variants (p.Ser192Tyr and p.Arg402Gln) in cis-orientation, present in 149/352 OCA probands. We further describe a detailed analysis of the disease-causing haplotype, p.[Ser192Tyr; Arg402Gln] ("cis-YQ"). Haplotype analysis suggests that the cis-YQ allele arose by recombination and that multiple cis-YQ haplotypes are segregating in OCA-affected individuals and control populations. The cis-YQ allele is the most common disease-causing allele in our cohort, representing 19.1% (57/298) of TYR pathogenic alleles in individuals with type 1 (TYR-associated) OCA. Finally, among the 66 TYR variants, we found several additional alleles defined by a cis-oriented combination of minor, potentially hypomorph-producing alleles at common variant sites plus a second, rare pathogenic variant. Together, these results suggest that identification of phased variants for the full TYR locus are required for an exhaustive assessment for potentially disease-causing alleles.
Collapse
Affiliation(s)
- Stacie K Loftus
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA; Genetic Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| | - Meredith F Gillis
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Linnea Lundh
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Laura L Baxter
- Genetic Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Julia C Wedel
- Genetic Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Dawn E Watkins-Chow
- Genetic Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Frank X Donovan
- Cancer Genomics Unit, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Yuri V Sergeev
- Ophthalmic Genetics and Visual Function Branch, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - William S Oetting
- Department of Experimental and Clinical Pharmacology, University of Minnesota, Minneapolis, MN 55455, USA
| | - William J Pavan
- Genetic Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - David R Adams
- Office of the Clinical Director, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| |
Collapse
|
3
|
Association of TYR SNP rs1042602 with Melanoma Risk and Prognosis. LIFE (BASEL, SWITZERLAND) 2022; 12:life12122004. [PMID: 36556369 PMCID: PMC9785037 DOI: 10.3390/life12122004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 11/26/2022] [Accepted: 11/28/2022] [Indexed: 12/02/2022]
Abstract
Cutaneous melanoma is the most aggressive of skin tumors. In order to discover new biomarkers that could help us improve prognostic prediction in melanoma patients, we have searched for germline DNA variants associated with melanoma progression. Thus, after exome sequencing of a set of melanoma patients and healthy control individuals, we identified rs1042602, an SNP within TYR, as a good candidate. After genotyping rs1042602 in 1025 patients and 773 healthy donors, we found that the rs1042602-A allele was significantly associated with susceptibility to melanoma (CATT test: p = 0.0035). Interestingly, we also observed significant differences between patients with good and bad prognosis (5 years of follow-up) (n = 664) (CATT test for all samples p = 0.0384 and for men alone p = 0.0054). Disease-free-survival (DFS) analyses also showed that patients with the A allele had shorter DFS periods. In men, the association remained significant even in a multivariate Cox Proportional-hazards model, which was adjusted for age at diagnosis, Breslow thickness, ulceration and melanoma subtype (HR 0.4; 95% confidence interval (CI) 0.20-0.83; p = 0.0139). Based on our results, we propose that rs1042602-A is a risk allele for melanoma, which also seems to be responsible for a poorer prognosis of the disease, particularly in men.
Collapse
|