Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lu Q, Hu Y, Sun J, Cheng Y, Cheung KH, Zhao H. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data. Sci Rep 2015;5:10576. [PMID: 26015273 DOI: 10.1038/srep10576] [Citation(s) in RCA: 112] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 04/20/2015] [Indexed: 12/16/2022] Open

For:	Lu Q, Hu Y, Sun J, Cheng Y, Cheung KH, Zhao H. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data. Sci Rep 2015;5:10576. [PMID: 26015273 DOI: 10.1038/srep10576] [Citation(s) in RCA: 112] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 04/20/2015] [Indexed: 12/16/2022] Open

Number

Cited by Other Article(s)

Giovannetti A, Lazzari S, Mangoni M, Traversa A, Mazza T, Parisi C, Caputo V. Exploring non-coding genetic variability in ACE2: Functional annotation and in vitro validation of regulatory variants. Gene 2024;915:148422. [PMID: 38570058 DOI: 10.1016/j.gene.2024.148422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 02/23/2024] [Accepted: 03/13/2024] [Indexed: 04/05/2024]

Tabet DR, Kuang D, Lancaster MC, Li R, Liu K, Weile J, Coté AG, Wu Y, Hegele RA, Roden DM, Roth FP. Benchmarking computational variant effect predictors by their ability to infer human traits. Genome Biol 2024;25:172. [PMID: 38951922 PMCID: PMC11218265 DOI: 10.1186/s13059-024-03314-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 06/17/2024] [Indexed: 07/03/2024] Open

Affiliation(s)

Daniel R Tabet Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Da Kuang Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Megan C Lancaster Division of Cardiovascular Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Roujia Li Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Karen Liu Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Jochen Weile Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Atina G Coté Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Yingzhou Wu Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Robert A Hegele Department of Medicine, Department of Biochemistry, Schulich School of Medicine and Dentistry, Robarts Research Institute, Western University, London, ON, Canada
Dan M Roden Division of Cardiovascular Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA Department of Pharmacology, Vanderbilt University Medical Centre, Nashville, TN, USA Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Frederick P Roth Donnelly Centre, University of Toronto, Toronto, ON, Canada. Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada. Department of Computer Science, University of Toronto, Toronto, ON, Canada. Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada. Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA.

Collapse

Lin YJ, Menon AS, Hu Z, Brenner SE. Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600283. [PMID: 38979289 PMCID: PMC11230257 DOI: 10.1101/2024.06.25.600283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]

Rastogi R, Chung R, Li S, Li C, Lee K, Woo J, Kim DW, Keum C, Babbi G, Martelli PL, Savojardo C, Casadio R, Chennen K, Weber T, Poch O, Ancien F, Cia G, Pucci F, Raimondi D, Vranken W, Rooman M, Marquet C, Olenyi T, Rost B, Andreoletti G, Kamandula A, Peng Y, Bakolitsa C, Mort M, Cooper DN, Bergquist T, Pejaver V, Liu X, Radivojac P, Brenner SE, Ioannidis NM. Critical assessment of missense variant effect predictors on disease-relevant variant data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.06.597828. [PMID: 38895200 PMCID: PMC11185644 DOI: 10.1101/2024.06.06.597828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]

Abstract

Regular, systematic, and independent assessment of computational tools used to predict the pathogenicity of missense variants is necessary to evaluate their clinical and research utility and suggest directions for future improvement. Here, as part of the sixth edition of the Critical Assessment of Genome Interpretation (CAGI) challenge, we assess missense variant effect predictors (or variant impact predictors) on an evaluation dataset of rare missense variants from disease-relevant databases. Our assessment evaluates predictors submitted to the CAGI6 Annotate-All-Missense challenge, predictors commonly used by the clinical genetics community, and recently developed deep learning methods for variant effect prediction. To explore a variety of settings that are relevant for different clinical and research applications, we assess performance within different subsets of the evaluation data and within high-specificity and high-sensitivity regimes. We find strong performance of many predictors across multiple settings. Meta-predictors tend to outperform their constituent individual predictors; however, several individual predictors have performance similar to that of commonly used meta-predictors. The relative performance of predictors differs in high-specificity and high-sensitivity regimes, suggesting that different methods may be best suited to different use cases. We also characterize two potential sources of bias. Predictors that incorporate allele frequency as a predictive feature tend to have reduced performance when distinguishing pathogenic variants from very rare benign variants, and predictors supervised on pathogenicity labels from curated variant databases often learn label imbalances within genes. Overall, we find notable advances over the oldest and most cited missense variant effect predictors and continued improvements among the most recently developed tools, and the CAGI Annotate-All-Missense challenge (also termed the Missense Marathon) will continue to assess state-of-the-art methods as the field progresses. Together, our results help illuminate the current clinical and research utility of missense variant effect predictors and identify potential areas for future development.

Collapse

Zhou Y, Pirmann S, Lauschke VM. APF2: an improved ensemble method for pharmacogenomic variant effect prediction. THE PHARMACOGENOMICS JOURNAL 2024;24:17. [PMID: 38802404 PMCID: PMC11129946 DOI: 10.1038/s41397-024-00338-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/26/2024] [Accepted: 05/15/2024] [Indexed: 05/29/2024]

Abstract

Lack of efficacy or adverse drug response are common phenomena in pharmacological therapy causing considerable morbidity and mortality. It is estimated that 20-30% of this variability in drug response stems from variations in genes encoding drug targets or factors involved in drug disposition. Leveraging such pharmacogenomic information for the preemptive identification of patients who would benefit from dose adjustments or alternative medications thus constitutes an important frontier of precision medicine. Computational methods can be used to predict the functional effects of variant of unknown significance. However, their performance on pharmacogenomic variant data has been lackluster. To overcome this limitation, we previously developed an ensemble classifier, termed APF, specifically designed for pharmacogenomic variant prediction. Here, we aimed to further improve predictions by leveraging recent key advances in the prediction of protein folding based on deep neural networks. Benchmarking of 28 variant effect predictors on 530 pharmacogenetic missense variants revealed that structural predictions using AlphaMissense were most specific, whereas APF exhibited the most balanced performance. We then developed a new tool, APF2, by optimizing algorithm parametrization of the top performing algorithms for pharmacogenomic variations and aggregating their predictions into a unified ensemble score. Importantly, APF2 provides quantitative variant effect estimates that correlate well with experimental results (R2 = 0.91, p = 0.003) and predicts the functional impact of pharmacogenomic variants with higher accuracy than previous methods, particularly for clinically relevant variations with actionable pharmacogenomic guidelines. We furthermore demonstrate better performance (92% accuracy) on an independent test set of 146 variants across 61 pharmacogenes not used for model training or validation. Application of APF2 to population-scale sequencing data from over 800,000 individuals revealed drastic ethnogeographic differences with important implications for pharmacotherapy. We thus think that APF2 holds the potential to improve the translation of genetic information into pharmacogenetic recommendations, thereby facilitating the use of Next-Generation Sequencing data for stratified medicine.

Collapse

Ginete C, Delgadinho M, Santos B, Miranda A, Silva C, Guerreiro P, Chimusa ER, Brito M. Genetic Modifiers of Sickle Cell Anemia Phenotype in a Cohort of Angolan Children. Genes (Basel) 2024;15:469. [PMID: 38674403 PMCID: PMC11049512 DOI: 10.3390/genes15040469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 04/04/2024] [Accepted: 04/05/2024] [Indexed: 04/28/2024] Open

Wei X, Li H, Zhu T, Sun Z, Sui R. Genotype-Phenotype Associations in an X-Linked Retinoschisis Patient Cohort: The Molecular Dynamic Insight and a Promising SD-OCT Indicator. Invest Ophthalmol Vis Sci 2024;65:17. [PMID: 38324300 PMCID: PMC10854265 DOI: 10.1167/iovs.65.2.17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 01/23/2024] [Indexed: 02/08/2024] Open

Abstract

Purpose

This study investigated a three-dimensional indicator in spectral-domain optical coherence tomography (SD-OCT) and established phenotype-genotype correlation in X-linked retinoschisis (XLRS).

Methods

Thirty-seven patients with XLRS underwent comprehensive ophthalmic examinations, including visual acuity (VA), fundus examination, electroretinogram (ERG), and SD-OCT. SD-OCT parameters of central foveal thickness (CFT), cyst cavity volume (CCV), and photoreceptor outer segment length were assessed. CCV was defined as the sum of the areas of cyst cavities in uential B-scans, measured automatically by self-developed software (OCT-CCSEG). Structural changes of the protein associated with missense variants were quantified by molecular dynamics (MD). The correlation between genotype and phenotype was analyzed.

Results

Twenty-seven different RS1 variants were identified, including a novel variant c.336_337insT(p.L113Sfs*8). The average age of onset was 14.76 ± 15.75 years, and the mean VA was 0.84 ± 0.43 logMAR. The mean CCV was 1.69 ± 1.87 mm3, correlating significantly with CFT (R = 0.66; P < 0.01). In the genotype-phenotype analysis of missense variants, CCV significantly correlated with the structural effect on the protein of mutational changes referred to as wild type, including root-mean-square deviation (R = 0.34; P = 0.04), solvent accessible surface area (R = 0.38; P = 0.02), and surface hydrophobic area (R = 0.37; P = 0.03). The amplitude of scotopic 3.0 ERG a-waves and b-waves significantly correlated with the percentage change of the β-strand in the secondary structure (a-wave: R = -0.58, P < 0.01; b-wave: R = -0.53, P < 0.01).

Conclusions

CCV is a promising indicator to quantify the structural disorganization of XLRS retina. The OCT-CCSEG software calculated CCV automatically, potentially facilitating prognosis assessment and development of personalized treatment. Moreover, MD-involved genotype-phenotype analysis suggests an association between protein structural alterations and XLRS severity measured by CCV and ERG.

Collapse

Nourbakhsh M, Degn K, Saksager A, Tiberti M, Papaleo E. Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks. Brief Bioinform 2024;25:bbad519. [PMID: 38261338 PMCID: PMC10805075 DOI: 10.1093/bib/bbad519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 11/27/2023] [Accepted: 12/11/2023] [Indexed: 01/24/2024] Open

Wang Z, Zhao G, Zhu Z, Wang Y, Xiang X, Zhang S, Luo T, Zhou Q, Qiu J, Tang B, Xia K, Li B, Li J. VarCards2: an integrated genetic and clinical database for ACMG-AMP variant-interpretation guidelines in the human whole genome. Nucleic Acids Res 2024;52:D1478-D1489. [PMID: 37956311 PMCID: PMC10767961 DOI: 10.1093/nar/gkad1061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/21/2023] [Accepted: 10/25/2023] [Indexed: 11/15/2023] Open

Affiliation(s)

Zheng Wang National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Department of Neurology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Hunan Key Laboratory of Molecular Precision Medicine, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Guihu Zhao National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Department of Neurology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Bioinformatics Center, Furong Laboratory & Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Zhaopo Zhu Center for Medical Genetics & Hunan Key Laboratory, School of Life Sciences, Central South University, Changsha, Hunan 410008, China
Yijing Wang National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Bioinformatics Center, Furong Laboratory & Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Xudong Xiang National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Shiyu Zhang Xiangya School of Medicine, Central South University, Changsha, Hunan 410013, China
Tengfei Luo Center for Medical Genetics & Hunan Key Laboratory, School of Life Sciences, Central South University, Changsha, Hunan 410008, China
Qiao Zhou National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Bioinformatics Center, Furong Laboratory & Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Jian Qiu National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Department of Neurology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Hunan Key Laboratory of Molecular Precision Medicine, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Beisha Tang National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Department of Neurology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Department of Neurology, & Multi-Omics Research Center for Brain Disorders, The First Affiliated Hospital, University of South China, Hengyang, Hunan, China
Kun Xia Center for Medical Genetics & Hunan Key Laboratory, School of Life Sciences, Central South University, Changsha, Hunan 410008, China
Bin Li National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Department of Neurology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Bioinformatics Center, Furong Laboratory & Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
Jinchen Li National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Center for Medical Genetics & Hunan Key Laboratory, School of Life Sciences, Central South University, Changsha, Hunan 410008, China Department of Neurology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China Bioinformatics Center, Furong Laboratory & Xiangya Hospital, Central South University, Changsha, Hunan 410008, China

Collapse

Tan HJ, Deng ZH, Shen H, Deng HW, Xiao HM. Single-cell RNA-seq identified novel genes involved in primordial follicle formation. Front Endocrinol (Lausanne) 2023;14:1285667. [PMID: 38149096 PMCID: PMC10750415 DOI: 10.3389/fendo.2023.1285667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 11/27/2023] [Indexed: 12/28/2023] Open

Abstract

Introduction

The number of primordial follicles (PFs) in mammals determines the ovarian reserve, and impairment of primordial follicle formation (PFF) will cause premature ovarian insufficiency (POI).

Methods

By analyzing public single-cell RNA sequencing performed during PFF on mice and human ovaries, we identified novel functional genes and novel ligand-receptor interaction during PFF. Based on immunofluorescence and in vitro ovarian culture, we confirmed mechanisms of genes and ligand-receptor interaction in PFF. We also applied whole exome sequencing (WES) in 93 cases with POI and whole genome sequencing (WGS) in 465 controls. Variants in POI patients were further investigated by in silico analysis and functional verification.

Results

We revealed ANXA7 (annexin A7) and GTF2F1 (general transcription factor IIF subunit 1) in germ cells to be novel potentially genes in promoting PFF. Ligand Mdk (midkine) in germ cells and its receptor Sdc1 (syndecan 1) in granulosa cells are novel interaction crucial for PFF. Based on immunofluorescence, we confirmed significant up-regulation of ANXA7 in PFs compared with germline cysts, and uniform expression of GTF2F1, MDK and SDC1 during PFF, in 25 weeks human fetal ovary. In vitro investigation indicated that Anxa7 and Gtf2f1 are vital for mice PFF by regulating Jak/Stat3 and Jnk signaling pathways, respectively. Ligand-receptor (Mdk-Sdc1) are crucial for PFF by regulating Pi3k-akt signaling pathway. Two heterozygous variants in GTF2F1, and one heterozygous variants in SDC1 were identified in cases, but no variant were identified in controls. The protein level of GTF2F1 or SDC1 in POI cases are significantly lower than that of controls, indicating the pathogenic effects of the two genes on ovarian function were dosage dependent.

Discussion

Our study identified novel genes and novel ligand-receptor interaction during PFF, and further expanding the genetic architecture of POI.

Collapse

Stein D, Kars ME, Wu Y, Bayrak ÇS, Stenson PD, Cooper DN, Schlessinger A, Itan Y. Genome-wide prediction of pathogenic gain- and loss-of-function variants from ensemble learning of a diverse feature set. Genome Med 2023;15:103. [PMID: 38037155 PMCID: PMC10688473 DOI: 10.1186/s13073-023-01261-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 11/16/2023] [Indexed: 12/02/2023] Open

Jorge SD, Chi YI, Mazaba JL, Haque N, Wagenknecht J, Smith BC, Volkman BF, Mathison AJ, Lomberk G, Zimmermann MT, Urrutia R. Deep computational phenotyping of genomic variants impacting the SET domain of KMT2C reveal molecular mechanisms for their dysfunction. Front Genet 2023;14:1291307. [PMID: 38090150 PMCID: PMC10715303 DOI: 10.3389/fgene.2023.1291307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 11/17/2023] [Indexed: 12/29/2023] Open

Abstract

Introduction: Kleefstra Syndrome type 2 (KLEFS-2) is a genetic, neurodevelopmental disorder characterized by intellectual disability, infantile hypotonia, severe expressive language delay, and characteristic facial appearance, with a spectrum of other distinct clinical manifestations. Pathogenic mutations in the epigenetic modifier type 2 lysine methyltransferase KMT2C have been identified to be causative in KLEFS-2 individuals. Methods: This work reports a translational genomic study that applies a multidimensional computational approach for deep variant phenotyping, combining conventional genomic analyses, advanced protein bioinformatics, computational biophysics, biochemistry, and biostatistics-based modeling. We use standard variant annotation, paralog annotation analyses, molecular mechanics, and molecular dynamics simulations to evaluate damaging scores and provide potential mechanisms underlying KMT2C variant dysfunction. Results: We integrated data derived from the structure and dynamics of KMT2C to classify variants into SV (Structural Variant), DV (Dynamic Variant), SDV (Structural and Dynamic Variant), and VUS (Variant of Uncertain Significance). When compared with controls, these variants show values reflecting alterations in molecular fitness in both structure and dynamics. Discussion: We demonstrate that our 3D models for KMT2C variants suggest distinct mechanisms that lead to their imbalance and are not predictable from sequence alone. Thus, the missense variants studied here cause destabilizing effects on KMT2C function by different biophysical and biochemical mechanisms which we adeptly describe. This new knowledge extends our understanding of how variations in the KMT2C gene cause the dysfunction of its methyltransferase enzyme product, thereby bearing significant biomedical relevance for carriers of KLEFS2-associated genomic mutations.

Collapse

Affiliation(s)

Salomão Dória Jorge Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States
Young-In Chi Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, United States
Jose Lizarraga Mazaba Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States
Neshatul Haque Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States
Jessica Wagenknecht Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States
Brian C. Smith Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, United States
Brian F. Volkman Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, United States
Angela J. Mathison Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, United States
Gwen Lomberk Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, United States Department of Pharmacology and Toxicology, Medical College of Wisconsin, Milwaukee, WI, United States
Michael T. Zimmermann Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, United States Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, WI, United States
Raul Urrutia Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, United States Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, United States Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, United States

Collapse

Ge F, Arif M, Yan Z, Alahmadi H, Worachartcheewan A, Yu DJ, Shoombuatong W. MMPatho: Leveraging Multilevel Consensus and Evolutionary Information for Enhanced Missense Mutation Pathogenic Prediction. J Chem Inf Model 2023;63:7239-7257. [PMID: 37947586 PMCID: PMC10685454 DOI: 10.1021/acs.jcim.3c00950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 10/21/2023] [Accepted: 10/23/2023] [Indexed: 11/12/2023]

Abstract

Understanding the pathogenicity of missense mutation (MM) is essential for shed light on genetic diseases, gene functions, and individual variations. In this study, we propose a novel computational approach, called MMPatho, for enhancing missense mutation pathogenic prediction. First, we established a large-scale nonredundant MM benchmark data set based on the entire Ensembl database, complemented by a focused blind test set specifically for pathogenic GOF/LOF MM. Based on this data set, for each mutation, we utilized Ensembl VEP v104 and dbNSFP v4.1a to extract variant-level, amino acid-level, individuals' outputs, and genome-level features. Additionally, protein sequences were generated using ENSP identifiers with the Ensembl API, and then encoded. The mutant sites' ESM-1b and ProtTrans-T5 embeddings were subsequently extracted. Then, our model group (MMPatho) was developed by leveraging upon these efforts, which comprised ConsMM and EvoIndMM. To be specific, ConsMM employs individuals' outputs and XGBoost with SHAP explanation analysis, while EvoIndMM investigates the potential enhancement of predictive capability by incorporating evolutionary information from ESM-1b and ProtT5-XL-U50, large protein language embeddings. Through rigorous comparative experiments, both ConsMM and EvoIndMM were capable of achieving remarkable AUROC (0.9836 and 0.9854) and AUPR (0.9852 and 0.9902) values on the blind test set devoid of overlapping variations and proteins from the training data, thus highlighting the superiority of our computational approach in the prediction of MM pathogenicity. Our Web server, available at http://csbio.njust.edu.cn/bioinf/mmpatho/, allows researchers to predict the pathogenicity (alongside the reliability index score) of MMs using the ConsMM and EvoIndMM models and provides extensive annotations for user input. Additionally, the newly constructed benchmark data set and blind test set can be accessed via the data page of our web server.

Collapse

Moore A, Marks JA, Quach BC, Guo Y, Bierut LJ, Gaddis NC, Hancock DB, Page GP, Johnson EO. Evaluating 17 methods incorporating biological function with GWAS summary statistics to accelerate discovery demonstrates a tradeoff between high sensitivity and high positive predictive value. Commun Biol 2023;6:1199. [PMID: 38001305 PMCID: PMC10673847 DOI: 10.1038/s42003-023-05413-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 10/03/2023] [Indexed: 11/26/2023] Open

Tao LR, Ye Y, Zhao H. Early breast cancer risk detection: a novel framework leveraging polygenic risk scores and machine learning. J Med Genet 2023;60:960-964. [PMID: 37055164 DOI: 10.1136/jmg-2022-108582] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Accepted: 03/27/2023] [Indexed: 04/15/2023]

Abstract

BACKGROUND

Breast cancer (BC) is the most common cancer and the second leading cause of cancer death in women; an estimated one in eight women in the USA will develop BC during her lifetime. However, current methods of BC screening, including clinical breast exams, mammograms, biopsies and others, are often underused due to limited access, expense and a lack of risk awareness, causing 30% (up to 80% in low-income and middle-income countries) of patients with BC to miss the precious early detection phase.

METHODS

This study creates a key step to supplement the current BC diagnostic pipeline: a prescreening platform, prior to traditional detection and diagnostic steps. We have developed BREast CAncer Risk Detection Application (BRECARDA), a novel framework that personalises BC risk assessment using artificial intelligence neural networks to incorporate relevant genetic and non-genetic risk factors. A polygenic risk score (PRS) was enhanced by employing AnnoPred and validated by fivefolds cross-validation, outperforming three existing state-of-the-art PRS methods.

RESULTS

We used data from 97 597 female participants of the UK BioBank to train our algorithm. Using the enhanced PRS thus trained together with non-genetic information, BRECARDA was evaluated in a testing dataset with 48 074 UK Biobank female participants and achieved a high accuracy of 94.28% and area under the curve of 0.7861. Our optimised AnnoPred outperformed other state-of-the-art methods on quantifying genetic risk, indicating its potential for supplementing the current BC detection tests, population screening and risk evaluation.

CONCLUSION

BRECARDA can enhance disease risk prediction, identify high-risk individuals for BC screening, facilitate disease diagnosis and improve population-level screening efficiency. It can serve as a valuable and supplemental platform to assist doctors in BC diagnosis and evaluation.

Collapse

He Q, Keding TJ, Zhang Q, Miao J, Russell JD, Herringa RJ, Lu Q, Travers BG, Li JJ. Neurogenetic mechanisms of risk for ADHD: Examining associations of polygenic scores and brain volumes in a population cohort. J Neurodev Disord 2023;15:30. [PMID: 37653373 PMCID: PMC10469494 DOI: 10.1186/s11689-023-09498-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 08/21/2023] [Indexed: 09/02/2023] Open

Ye Y, Noche RB, Szejko N, Both CP, Acosta JN, Leasure AC, Brown SC, Sheth KN, Gill TM, Zhao H, Falcone GJ. A genome-wide association study of frailty identifies significant genetic correlation with neuropsychiatric, cardiovascular, and inflammation pathways. GeroScience 2023;45:2511-2523. [PMID: 36928559 PMCID: PMC10651618 DOI: 10.1007/s11357-023-00771-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Accepted: 03/10/2023] [Indexed: 03/18/2023] Open

Shi FY, Wang Y, Huang D, Liang Y, Liang N, Chen XW, Gao G. Computational Assessment of the Expression-modulating Potential for Non-coding Variants. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023;21:662-673. [PMID: 34890839 PMCID: PMC10787178 DOI: 10.1016/j.gpb.2021.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 10/13/2021] [Accepted: 11/01/2021] [Indexed: 06/13/2023]

Wang Z, Zhao G, Li B, Fang Z, Chen Q, Wang X, Luo T, Wang Y, Zhou Q, Li K, Xia L, Zhang Y, Zhou X, Pan H, Zhao Y, Wang Y, Wang L, Guo J, Tang B, Xia K, Li J. Performance Comparison of Computational Methods for the Prediction of the Function and Pathogenicity of Non-coding Variants. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023;21:649-661. [PMID: 35272052 PMCID: PMC10787016 DOI: 10.1016/j.gpb.2022.02.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 12/28/2021] [Accepted: 02/27/2022] [Indexed: 06/14/2023]

Abstract

Non-coding variants in the human genome significantly influence human traits and complex diseases via their regulation and modification effects. Hence, an increasing number of computational methods are developed to predict the effects of variants in human non-coding sequences. However, it is difficult for inexperienced users to select appropriate computational methods from dozens of available methods. To solve this issue, we assessed 12 performance metrics of 24 methods on four independent non-coding variant benchmark datasets: (1) rare germline variants from clinical relevant sequence variants (ClinVar), (2) rare somatic variants from Catalogue Of Somatic Mutations In Cancer (COSMIC), (3) common regulatory variants from curated expression quantitative trait locus (eQTL) data, and (4) disease-associated common variants from curated genome-wide association studies (GWAS). All 24 tested methods performed differently under various conditions, indicating varying strengths and weaknesses under different scenarios. Importantly, the performance of existing methods was acceptable for rare germline variants from ClinVar with the area under the receiver operating characteristic curve (AUROC) of 0.4481-0.8033 and poor for rare somatic variants from COSMIC (AUROC = 0.4984-0.7131), common regulatory variants from curated eQTL data (AUROC = 0.4837-0.6472), and disease-associated common variants from curated GWAS (AUROC = 0.4766-0.5188). We also compared the prediction performance of 24 methods for non-coding de novo mutations in autism spectrum disorder, and found that the combined annotation-dependent depletion (CADD) and context-dependent tolerance score (CDTS) methods showed better performance. Summarily, we assessed the performance of 24 computational methods under diverse scenarios, providing preliminary advice for proper tool selection and guiding the development of new techniques in interpreting non-coding variants.

Collapse

Affiliation(s)

Zheng Wang National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China
Guihu Zhao National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China; Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Bin Li National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China; Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Zhenghuan Fang Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Qian Chen National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China
Xiaomeng Wang Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Tengfei Luo Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Yijing Wang Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Qiao Zhou National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China
Kuokuo Li Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Lu Xia Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Yi Zhang National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China
Xun Zhou National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China
Hongxu Pan Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Yuwen Zhao Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Yige Wang Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Lin Wang Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China; Reproductive Medicine Center, Xiangya Hospital, Central South University, Changsha 410008, China
Jifeng Guo National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China; Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Beisha Tang National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China; Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China
Kun Xia Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China
Jinchen Li National Clinical Research Centre for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha 410008, China; Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China; Centre for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha 410008, China.

Collapse

Johnson EC, Kapoor M, Hatoum AS, Zhou H, Polimanti R, Wendt FR, Walters RK, Lai D, Kember RL, Hartz S, Meyers JL, Peterson RE, Ripke S, Bigdeli TB, Fanous AH, Pato CN, Pato MT, Goate AM, Kranzler HR, O'Donovan MC, Walters JTR, Gelernter J, Edenberg HJ, Agrawal A. Investigation of convergent and divergent genetic influences underlying schizophrenia and alcohol use disorder. Psychol Med 2023;53:1196-1204. [PMID: 34231451 PMCID: PMC8738774 DOI: 10.1017/s003329172100266x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Affiliation(s)

Emma C Johnson Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO, USA
Manav Kapoor Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Alexander S Hatoum Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO, USA
Hang Zhou Department of Psychiatry, Division of Human Genetics, Yale University School of Medicine, New Haven, CT, USA Department of Psychiatry, Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
Renato Polimanti Department of Psychiatry, Division of Human Genetics, Yale University School of Medicine, New Haven, CT, USA Department of Psychiatry, Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
Frank R Wendt Department of Psychiatry, Division of Human Genetics, Yale University School of Medicine, New Haven, CT, USA Department of Psychiatry, Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
Raymond K Walters Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Dongbing Lai Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
Rachel L Kember Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA VISN 4 MIRECC, Crescenz VAMC, Philadelphia, PA, USA
Sarah Hartz Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO, USA
Jacquelyn L Meyers Department of Psychiatry and Behavioral Sciences, SUNY Downstate Health Sciences University, Brooklyn, NY, USA Henri Begleiter Neurodynamics Laboratory, SUNY Downstate Health Sciences University, Brooklyn, NY, USA
Roseann E Peterson Department of Psychiatry, Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA
Stephan Ripke Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, Germany
Tim B Bigdeli Department of Psychiatry and Behavioral Sciences, SUNY Downstate Health Sciences University, Brooklyn, NY, USA
Ayman H Fanous Department of Psychiatry and Behavioral Sciences, SUNY Downstate Health Sciences University, Brooklyn, NY, USA
Carlos N Pato Department of Psychiatry and Behavioral Sciences, SUNY Downstate Health Sciences University, Brooklyn, NY, USA
Michele T Pato Department of Psychiatry and Behavioral Sciences, SUNY Downstate Health Sciences University, Brooklyn, NY, USA
Alison M Goate Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Henry R Kranzler Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA VISN 4 MIRECC, Crescenz VAMC, Philadelphia, PA, USA
Michael C O'Donovan Division of Psychological Medicine and Clinical Neurosciences, MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University School of Medicine, Cardiff, UK
James T R Walters Division of Psychological Medicine and Clinical Neurosciences, MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University School of Medicine, Cardiff, UK
Joel Gelernter Department of Psychiatry, Division of Human Genetics, Yale University School of Medicine, New Haven, CT, USA Department of Psychiatry, Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA Department of Genetics, Yale University School of Medicine, New Haven, CT, USA Department of Neuroscience, Yale University School of Medicine, New Haven, CT, USA
Howard J Edenberg Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, USA
Arpana Agrawal Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO, USA

Collapse

Zhang J, Zhao H. eQTL Studies: from Bulk Tissues to Single Cells. ARXIV 2023:arXiv:2302.11662v1. [PMID: 36866231 PMCID: PMC9980190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Molecular Dynamic Simulation Analysis of a Novel Missense Variant in CYB5R3 Gene in Patients with Methemoglobinemia. MEDICINA (KAUNAS, LITHUANIA) 2023;59:medicina59020379. [PMID: 36837579 PMCID: PMC9967277 DOI: 10.3390/medicina59020379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/13/2023] [Accepted: 02/14/2023] [Indexed: 02/18/2023]

Abstract

Background and Objective: Mutations in the CYB5R3 gene cause reduced NADH-dependent cytochrome b5 reductase enzyme function and consequently lead to recessive congenital methemoglobinemia (RCM). RCM exists as RCM type I (RCM1) and RCM type II (RCM2). RCM1 leads to higher methemoglobin levels causing only cyanosis, while in RCM2, neurological complications are also present along with cyanosis. Materials and Methods: In the current study, a consanguineous Pakistani family with three individuals showing clinical manifestations of cyanosis, chest pain radiating to the left arm, dyspnea, orthopnea, and hemoptysis was studied. Following clinical assessment, a search for the causative gene was performed using whole exome sequencing (WES) and Sanger sequencing. Various variant effect prediction tools and ACMG criteria were applied to interpret the pathogenicity of the prioritized variants. Molecular dynamic simulation studies of wild and mutant systems were performed to determine the stability of the mutant CYB5R3 protein. Results: Data analysis of WES revealed a novel homozygous missense variant NM_001171660.2: c.670A > T: NP_001165131.1: p.(Ile224Phe) in exon 8 of the CYB5R3 gene located on chromosome 22q13.2. Sanger sequencing validated the segregation of the identified variant with the disease phenotype within the family. Bioinformatics prediction tools and ACMG guidelines predicted the identified variant p.(Ile224Phe) as disease-causing and likely pathogenic, respectively. Molecular dynamics study revealed that the variant p.(Ile224Phe) in the CYB5R3 resides in the NADH domain of the protein, the aberrant function of which is detrimental. Conclusions: The present study expanded the variant spectrum of the CYB5R3 gene. This will facilitate genetic counselling of the same and other similar families carrying mutations in the CYB5R3 gene.

Collapse

Li RY, Huang Y, Zhao Z, Qin ZS. Comprehensive 100-bp resolution genome-wide epigenomic profiling data for the hg38 human reference genome. Data Brief 2022;46:108827. [PMID: 36582986 PMCID: PMC9792340 DOI: 10.1016/j.dib.2022.108827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 11/21/2022] [Accepted: 12/09/2022] [Indexed: 12/15/2022] Open

Garcia FADO, de Andrade ES, Palmero EI. Insights on variant analysis in silico tools for pathogenicity prediction. Front Genet 2022;13:1010327. [PMID: 36568376 PMCID: PMC9774026 DOI: 10.3389/fgene.2022.1010327] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 11/14/2022] [Indexed: 12/03/2022] Open

He Z, Liu L, Belloy ME, Le Guen Y, Sossin A, Liu X, Qi X, Ma S, Gyawali PK, Wyss-Coray T, Tang H, Sabatti C, Candès E, Greicius MD, Ionita-Laza I. GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies. Nat Commun 2022;13:7209. [PMID: 36418338 PMCID: PMC9684164 DOI: 10.1038/s41467-022-34932-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 11/09/2022] [Indexed: 11/27/2022] Open

Multi-omics approach dissects cis-regulatory mechanisms underlying North Carolina macular dystrophy, a retinal enhanceropathy. Am J Hum Genet 2022;109:2029-2048. [PMID: 36243009 PMCID: PMC9674966 DOI: 10.1016/j.ajhg.2022.09.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 09/28/2022] [Indexed: 01/26/2023] Open

Exploration of Tools for the Interpretation of Human Non-Coding Variants. Int J Mol Sci 2022;23:ijms232112977. [PMID: 36361767 PMCID: PMC9654743 DOI: 10.3390/ijms232112977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/17/2022] [Accepted: 10/23/2022] [Indexed: 02/01/2023] Open

Li C, Zhi D, Wang K, Liu X. MetaRNN: differentiating rare pathogenic and rare benign missense SNVs and InDels using deep learning. Genome Med 2022;14:115. [PMID: 36209109 PMCID: PMC9548151 DOI: 10.1186/s13073-022-01120-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 09/22/2022] [Indexed: 11/22/2022] Open

Huang YS, Hsu C, Chune YC, Liao IC, Wang H, Lin YL, Hwu WL, Lee NC, Lai F. Diagnosis of a Single-Nucleotide Variant in Whole-Exome Sequencing Data for Patients With Inherited Diseases: Machine Learning Study Using Artificial Intelligence Variant Prioritization. JMIR BIOINFORMATICS AND BIOTECHNOLOGY 2022;3:e37701. [PMID: 38935959 PMCID: PMC11168239 DOI: 10.2196/37701] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 07/29/2022] [Accepted: 08/22/2022] [Indexed: 06/29/2024]

Abstract

BACKGROUND

In recent years, thanks to the rapid development of next-generation sequencing (NGS) technology, an entire human genome can be sequenced in a short period. As a result, NGS technology is now being widely introduced into clinical diagnosis practice, especially for diagnosis of hereditary disorders. Although the exome data of single-nucleotide variant (SNV) can be generated using these approaches, processing the DNA sequence data of a patient requires multiple tools and complex bioinformatics pipelines.

OBJECTIVE

This study aims to assist physicians to automatically interpret the genetic variation information generated by NGS in a short period. To determine the true causal variants of a patient with genetic disease, currently, physicians often need to view numerous features on every variant manually and search for literature in different databases to understand the effect of genetic variation.

METHODS

We constructed a machine learning model for predicting disease-causing variants in exome data. We collected sequencing data from whole-exome sequencing (WES) and gene panel as training set, and then integrated variant annotations from multiple genetic databases for model training. The model built ranked SNVs and output the most possible disease-causing candidates. For model testing, we collected WES data from 108 patients with rare genetic disorders in National Taiwan University Hospital. We applied sequencing data and phenotypic information automatically extracted by a keyword extraction tool from patient's electronic medical records into our machine learning model.

RESULTS

We succeeded in locating 92.5% (124/134) of the causative variant in the top 10 ranking list among an average of 741 candidate variants per person after filtering. AI Variant Prioritizer was able to assign the target gene to the top rank for around 61.1% (66/108) of the patients, followed by Variant Prioritizer, which assigned it for 44.4% (48/108) of the patients. The cumulative rank result revealed that our AI Variant Prioritizer has the highest accuracy at ranks 1, 5, 10, and 20. It also shows that AI Variant Prioritizer presents better performance than other tools. After adopting the Human Phenotype Ontology (HPO) terms by looking up the databases, the top 10 ranking list can be increased to 93.5% (101/108).

CONCLUSIONS

We successfully applied sequencing data from WES and free-text phenotypic information of patient's disease automatically extracted by the keyword extraction tool for model training and testing. By interpreting our model, we identified which features of variants are important. Besides, we achieved a satisfactory result on finding the target variant in our testing data set. After adopting the HPO terms by looking up the databases, the top 10 ranking list can be increased to 93.5% (101/108). The performance of the model is similar to that of manual analysis, and it has been used to help National Taiwan University Hospital with a genetic diagnosis.

Collapse

Integrating variant functional annotation scores have varied abilities to improve power of genome-wide association studies. Sci Rep 2022;12:10720. [PMID: 35750789 PMCID: PMC9232605 DOI: 10.1038/s41598-022-14924-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Accepted: 06/15/2022] [Indexed: 11/12/2022] Open

Chimusa ER, Alosaimi S, Bope CD. Dissecting Generalizability and Actionability of Disease-Associated Genes From 20 Worldwide Ethnolinguistic Cultural Groups. Front Genet 2022;13:835713. [PMID: 35812734 PMCID: PMC9263835 DOI: 10.3389/fgene.2022.835713] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 04/29/2022] [Indexed: 11/30/2022] Open

Abstract

Findings resulting from whole-genome sequencing (WGS) have markedly increased due to the massive evolvement of sequencing methods and have led to further investigations such as clinical actionability of genes, as documented by the American College of Medical Genetics and Genomics (ACMG). ACMG's actionable genes (ACGs) may not necessarily be clinically actionable across all populations worldwide. It is critical to examine the actionability of these genes in different populations. Here, we have leveraged a combined WES from the African Genome Variation and 1000 Genomes Project to examine the generalizability of ACG and potential actionable genes from four diseases: high-burden malaria, TB, HIV/AIDS, and sickle cell disease. Our results suggest that ethnolinguistic cultural groups from Africa, particularly Bantu and Khoesan, have high genetic diversity, high proportion of derived alleles at low minor allele frequency (0.0-0.1), and the highest proportion of pathogenic variants within HIV, TB, malaria, and sickle cell diseases. In contrast, ethnolinguistic cultural groups from the non-Africa continent, including Latin American, Afro-related, and European-related groups, have a high proportion of pathogenic variants within ACG than most of the ethnolinguistic cultural groups from Africa. Overall, our results show high genetic diversity in the present actionable and known disease-associated genes of four African high-burden diseases, suggesting the limitation of transferability or generalizability of ACG. This supports the use of personalized medicine as beneficial to the worldwide population as well as actionable gene list recommendation to further foster equitable global healthcare. The results point out the bias in the knowledge about the frequency distribution of these phenotypes and genetic variants associated with some diseases, especially in African and African ancestry populations.

Collapse

Chen D, Wang X, Huang T, Jia J. Sleep and Late-Onset Alzheimer's Disease: Shared Genetic Risk Factors, Drug Targets, Molecular Mechanisms, and Causal Effects. Front Genet 2022;13:794202. [PMID: 35656316 PMCID: PMC9152224 DOI: 10.3389/fgene.2022.794202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 03/23/2022] [Indexed: 12/30/2022] Open

Abstract

Late-onset Alzheimer's disease (AD) is associated with sleep-related phenotypes (SRPs). The fact that whether they share a common genetic etiology remains largely unknown. We explored the shared genetics and causality between AD and SRPs by using high-definition likelihood (HDL), cross-phenotype association study (CPASSOC), transcriptome-wide association study (TWAS), and bidirectional Mendelian randomization (MR) in summary-level data for AD (N = 455,258) and summary-level data for seven SRPs (sample size ranges from 359,916 to 1,331,010). AD shared a strong genetic basis with insomnia (r _g = 0.20; p = 9.70 × 10^-5), snoring (r _g = 0.13; p = 2.45 × 10^-3), and sleep duration (r _g = -0.11; p = 1.18 × 10^-3). The CPASSOC identifies 31 independent loci shared between AD and SRPs, including four novel shared loci. Functional analysis and the TWAS showed shared genes were enriched in liver, brain, breast, and heart tissues and highlighted the regulatory roles of immunological disorders, very-low-density lipoprotein particle clearance, triglyceride-rich lipoprotein particle clearance, chylomicron remnant clearance, and positive regulation of T-cell-mediated cytotoxicity pathways. Protein-protein interaction analysis identified three potential drug target genes (APOE, MARK4, and HLA-DRA) that interacted with known FDA-approved drug target genes. The CPASSOC and TWAS demonstrated three regions 11p11.2, 6p22.3, and 16p11.2 may account for the shared basis between AD and sleep duration or snoring. MR showed insomnia had a causal effect on AD (OR_IVW = 1.02, P _IVW = 6.7 × 10^-6), and multivariate MR suggested a potential role of sleep duration and major depression in this association. Our findings provide strong evidence of shared genetics and causation between AD and sleep abnormalities and advance our understanding of the genetic overlap between them. Identifying shared drug targets and molecular pathways can be beneficial for treating AD and sleep disorders more efficiently.

Collapse

Katsonis P, Wilhelm K, Williams A, Lichtarge O. Genome interpretation using in silico predictors of variant impact. Hum Genet 2022;141:1549-1577. [PMID: 35488922 PMCID: PMC9055222 DOI: 10.1007/s00439-022-02457-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 04/17/2022] [Indexed: 02/06/2023]

Chen L, Wang Y, Zhao F. Exploiting deep transfer learning for the prediction of functional non-coding variants using genomic sequence. Bioinformatics 2022;38:3164-3172. [PMID: 35389435 PMCID: PMC9890318 DOI: 10.1093/bioinformatics/btac214] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 03/04/2022] [Accepted: 04/06/2022] [Indexed: 02/04/2023] Open

Analysis of missense variants in the human genome reveals widespread gene-specific clustering and improves prediction of pathogenicity. Am J Hum Genet 2022;109:457-470. [PMID: 35120630 PMCID: PMC8948164 DOI: 10.1016/j.ajhg.2022.01.006] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 01/11/2022] [Indexed: 12/11/2022] Open

Li X, Yung G, Zhou H, Sun R, Li Z, Hou K, Zhang MJ, Liu Y, Arapoglou T, Wang C, Ionita-Laza I, Lin X. A multi-dimensional integrative scoring framework for predicting functional variants in the human genome. Am J Hum Genet 2022;109:446-456. [PMID: 35216679 PMCID: PMC8948160 DOI: 10.1016/j.ajhg.2022.01.017] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 01/26/2022] [Indexed: 12/26/2022] Open

Anderson D, Lassmann T. An expanded phenotype centric benchmark of variant prioritisation tools. Hum Mutat 2022;43:539-546. [PMID: 35224813 PMCID: PMC9313608 DOI: 10.1002/humu.24362] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 01/18/2022] [Accepted: 02/23/2022] [Indexed: 11/17/2022]

DVPred: a disease-specific prediction tool for variant pathogenicity classification for hearing loss. Hum Genet 2022;141:401-411. [PMID: 35182233 DOI: 10.1007/s00439-022-02440-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 02/06/2022] [Indexed: 02/08/2023]

Abstract

Numerous computational prediction tools have been introduced to estimate the functional impact of variants in the human genome based on evolutionary constraints and biochemical metrics. However, their implementation in diagnostic settings to classify variants faced challenges with accuracy and validity. Most existing tools are pan-genome and pan-diseases, which neglected gene- and disease-specific properties and limited the accessibility of curated data. As a proof-of-concept, we developed a disease-specific prediction tool named Deafness Variant deleteriousness Prediction tool (DVPred) that focused on the 157 genes reportedly causing genetic hearing loss (HL). DVPred applied the gradient boosting decision tree (GBDT) algorithm to the dataset consisting of expert-curated pathogenic and benign variants from a large in-house HL patient cohort and public databases. With the incorporation of variant-level and gene-level features, DVPred outperformed the existing universal tools. It boasts an area under the curve (AUC) of 0.98, and showed consistent performance (AUC = 0.985) in an independent assessment dataset. We further demonstrated that multiple gene-level metrics, including low complexity genomic regions and substitution intolerance scores, were the top features of the model. A comprehensive analysis of missense variants showed a gene-specific ratio of predicted deleterious and neutral variants, implying varied tolerance or intolerance to variation in different genes. DVPred explored the utility of disease-specific strategy in improving the deafness variant prediction tool. It can improve the prioritization of pathogenic variants among massive variants identified by high-throughput sequencing on HL genes. It also shed light on the development of variant prediction tools for other genetic disorders.

Collapse

Garcia FADO, de Andrade ES, de Campos Reis Galvão H, da Silva Sábato C, Campacci N, de Paula AE, Evangelista AF, Santana IVV, Melendez ME, Reis RM, Palmero EI. New insights on familial colorectal cancer type X syndrome. Sci Rep 2022;12:2846. [PMID: 35181726 PMCID: PMC8857274 DOI: 10.1038/s41598-022-06782-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 12/17/2021] [Indexed: 12/22/2022] Open

Affiliation(s)

Felipe Antonio de Oliveira Garcia Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil
Edilene Santos de Andrade Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil
Henrique de Campos Reis Galvão Oncogenetics Department, Barretos Cancer Hospital, Barretos, São Paulo, Brazil
Cristina da Silva Sábato Center of Molecular Diagnosis, Barretos Cancer Hospital, Barretos, São Paulo, Brazil
Natália Campacci Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil
Andre Escremin de Paula Center of Molecular Diagnosis, Barretos Cancer Hospital, Barretos, São Paulo, Brazil
Adriane Feijó Evangelista Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil
Iara Viana Vidigal Santana Pathology Department, Barretos Cancer Hospital, Barretos, São Paulo, Brazil
Matias Eliseo Melendez Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil.,Department of Molecular Carcinogenesis, Brazilian National Cancer Institute, Rio de Janeiro, Brazil
Rui Manuel Reis Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil.,Center of Molecular Diagnosis, Barretos Cancer Hospital, Barretos, São Paulo, Brazil.,Life and Health Sciences Research Institute (ICVS), Medical School, University of Minho, Braga, Portugal.,ICVS/3B's-PT Government Associate Laboratory, Braga/Guimarães, Portugal
Edenir Inez Palmero Molecular Oncology Research Center, Barretos Cancer Hospital, Antenor Duarte Villela Street, 1331, Barretos, São Paulo, CEP 14784-400, Brazil. .,Department of Genetics, Brazilian National Cancer Institute, Rio de Janeiro, Brazil.

Collapse

Khatiwada A, Wolf BJ, Yilmaz AS, Ramos PS, Pietrzak M, Lawson A, Hunt KJ, Kim HJ, Chung D. GPA-Tree: statistical approach for functional-annotation-tree-guided prioritization of GWAS results. Bioinformatics 2022;38:1067-1074. [PMID: 34849578 PMCID: PMC10060690 DOI: 10.1093/bioinformatics/btab802] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 10/09/2021] [Accepted: 11/23/2021] [Indexed: 02/03/2023] Open

Computational Resources for the Interpretation of Variations in Cancer. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022;1361:177-198. [DOI: 10.1007/978-3-030-91836-1_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Cao Z, Huang Y, Duan R, Jin P, Qin ZS, Zhang S. Disease category-specific annotation of variants using an ensemble learning framework. Brief Bioinform 2021;23:6394995. [PMID: 34643213 DOI: 10.1093/bib/bbab438] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 09/03/2021] [Accepted: 09/22/2021] [Indexed: 02/01/2023] Open

Zhou Q, Wang J, Xia L, Li R, Zhang Q, Pan S. SYN1 Mutation Causes X-Linked Toothbrushing Epilepsy in a Chinese Family. Front Neurol 2021;12:736977. [PMID: 34616357 PMCID: PMC8488375 DOI: 10.3389/fneur.2021.736977] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 08/25/2021] [Indexed: 11/15/2022] Open

Wu Y, Liu H, Li R, Sun S, Weile J, Roth FP. Improved pathogenicity prediction for rare human missense variants. Am J Hum Genet 2021;108:1891-1906. [PMID: 34551312 PMCID: PMC8546039 DOI: 10.1016/j.ajhg.2021.08.012] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 08/18/2021] [Indexed: 01/01/2023] Open

Hutchinson A, Reales G, Willis T, Wallace C. Leveraging auxiliary data from arbitrary distributions to boost GWAS discovery with Flexible cFDR. PLoS Genet 2021;17:e1009853. [PMID: 34669738 PMCID: PMC8559959 DOI: 10.1371/journal.pgen.1009853] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 11/01/2021] [Accepted: 09/30/2021] [Indexed: 12/15/2022] Open

Jin Y, Jiang J, Wang R, Qin ZS. Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity. Front Genet 2021;12:667866. [PMID: 34567058 PMCID: PMC8458901 DOI: 10.3389/fgene.2021.667866] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 08/02/2021] [Indexed: 02/01/2023] Open

Kim HY, Jeon W, Kim D. An enhanced variant effect predictor based on a deep generative model and the Born-Again Networks. Sci Rep 2021;11:19127. [PMID: 34580383 PMCID: PMC8476491 DOI: 10.1038/s41598-021-98693-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 09/07/2021] [Indexed: 11/09/2022] Open

Fisher V, Sebastiani P, Cupples LA, Liu CT. ANNORE: Genetic fine mapping with functional annotation. Hum Mol Genet 2021;31:32-40. [PMID: 34302344 DOI: 10.1093/hmg/ddab210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 06/30/2021] [Accepted: 07/19/2021] [Indexed: 11/13/2022] Open

Huang Y, Sun X, Jiang H, Yu S, Robins C, Armstrong MJ, Li R, Mei Z, Shi X, Gerasimov ES, De Jager PL, Bennett DA, Wingo AP, Jin P, Wingo TS, Qin ZS. A machine learning approach to brain epigenetic analysis reveals kinases associated with Alzheimer's disease. Nat Commun 2021;12:4472. [PMID: 34294691 PMCID: PMC8298578 DOI: 10.1038/s41467-021-24710-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Accepted: 06/28/2021] [Indexed: 12/21/2022] Open

Seaby EG, Ennis S. Challenges in the diagnosis and discovery of rare genetic disorders using contemporary sequencing technologies. Brief Funct Genomics 2021;19:243-258. [PMID: 32393978 DOI: 10.1093/bfgp/elaa009] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open