1
|
Lin YJ, Menon AS, Hu Z, Brenner SE. Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors. Hum Genomics 2024; 18:90. [PMID: 39198917 PMCID: PMC11360829 DOI: 10.1186/s40246-024-00663-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Accepted: 08/19/2024] [Indexed: 09/01/2024] Open
Abstract
BACKGROUND Variant interpretation is essential for identifying patients' disease-causing genetic variants amongst the millions detected in their genomes. Hundreds of Variant Impact Predictors (VIPs), also known as Variant Effect Predictors (VEPs), have been developed for this purpose, with a variety of methodologies and goals. To facilitate the exploration of available VIP options, we have created the Variant Impact Predictor database (VIPdb). RESULTS The Variant Impact Predictor database (VIPdb) version 2 presents a collection of VIPs developed over the past three decades, summarizing their characteristics, ClinGen calibrated scores, CAGI assessment results, publication details, access information, and citation patterns. We previously summarized 217 VIPs and their features in VIPdb in 2019. Building upon this foundation, we identified and categorized an additional 190 VIPs, resulting in a total of 407 VIPs in VIPdb version 2. The majority of the VIPs have the capacity to predict the impacts of single nucleotide variants and nonsynonymous variants. More VIPs tailored to predict the impacts of insertions and deletions have been developed since the 2010s. In contrast, relatively few VIPs are dedicated to the prediction of splicing, structural, synonymous, and regulatory variants. The increasing rate of citations to VIPs reflects the ongoing growth in their use, and the evolving trends in citations reveal development in the field and individual methods. CONCLUSIONS VIPdb version 2 summarizes 407 VIPs and their features, potentially facilitating VIP exploration for various variant interpretation applications. VIPdb is available at https://genomeinterpretation.org/vipdb.
Collapse
Affiliation(s)
- Yu-Jen Lin
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
- Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
| | - Arul S Menon
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, CA, 94720, USA
| | - Zhiqiang Hu
- Department of Plant and Microbial Biology, University of California, 111 Koshland Hall #3102, Berkeley, CA, 94720-3102, USA
- Illumina, Foster City, CA, 94404, USA
| | - Steven E Brenner
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
- Center for Computational Biology, University of California, Berkeley, CA, 94720, USA.
- College of Computing, Data Science, and Society, University of California, Berkeley, CA, 94720, USA.
- Department of Plant and Microbial Biology, University of California, 111 Koshland Hall #3102, Berkeley, CA, 94720-3102, USA.
| |
Collapse
|
2
|
Villani RM, McKenzie ME, Davidson AL, Spurdle AB. Regional-specific calibration enables application of computational evidence for clinical classification of 5' cis-regulatory variants in Mendelian disease. Am J Hum Genet 2024; 111:1301-1315. [PMID: 38815586 PMCID: PMC11267523 DOI: 10.1016/j.ajhg.2024.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 05/02/2024] [Accepted: 05/03/2024] [Indexed: 06/01/2024] Open
Abstract
To date, clinical genetic testing for Mendelian disease variants has focused heavily on exonic coding and intronic gene regions. This multi-step study was undertaken to provide an evidence base for selecting and applying computational approaches for use in clinical classification of 5' cis-regulatory region variants. Curated datasets of clinically reported disease-causing 5' cis-regulatory region variants and variants from matched genomic regions in population controls were used to calibrate six bioinformatic tools as predictors of variant pathogenicity. Likelihood ratio estimates were aligned to code weights following ClinGen recommendations for application of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) classification scheme. Considering code assignment across all reference dataset variants, performance was best for CADD (81.2%) and REMM (81.5%). Optimized thresholds provided moderate evidence toward pathogenicity (CADD, REMM) and moderate (CADD) or supporting (REMM) evidence against pathogenicity. Both sensitivity and specificity of prediction were improved when further categorizing variants based on location in an EPDnew-defined promoter region. Combining predictions (CADD, REMM, and location in a promoter region) increased specificity at the expense of sensitivity. Importantly, the optimal CADD thresholds for assigning ACMG/AMP codes PP3 (≥10) and BP4 (≤8) were vastly different from recommendations for protein-coding variants (PP3 ≥25.3; BP4 ≤22.7); CADD <22.7 would incorrectly assign BP4 for >90% of reported disease-causing cis-regulatory region variants. Our results demonstrate the need to consider a tiered approach and tailored score thresholds to optimize bioinformatic impact prediction for clinical classification of 5' cis-regulatory region variants.
Collapse
Affiliation(s)
- Rehan M Villani
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Maddison E McKenzie
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Aimee L Davidson
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Amanda B Spurdle
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia; University of Queensland, Brisbane, Queensland, Australia.
| |
Collapse
|
3
|
Lin YJ, Menon AS, Hu Z, Brenner SE. Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600283. [PMID: 38979289 PMCID: PMC11230257 DOI: 10.1101/2024.06.25.600283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Background Variant interpretation is essential for identifying patients' disease-causing genetic variants amongst the millions detected in their genomes. Hundreds of Variant Impact Predictors (VIPs), also known as Variant Effect Predictors (VEPs), have been developed for this purpose, with a variety of methodologies and goals. To facilitate the exploration of available VIP options, we have created the Variant Impact Predictor database (VIPdb). Results The Variant Impact Predictor database (VIPdb) version 2 presents a collection of VIPs developed over the past 25 years, summarizing their characteristics, ClinGen calibrated scores, CAGI assessment results, publication details, access information, and citation patterns. We previously summarized 217 VIPs and their features in VIPdb in 2019. Building upon this foundation, we identified and categorized an additional 186 VIPs, resulting in a total of 403 VIPs in VIPdb version 2. The majority of the VIPs have the capacity to predict the impacts of single nucleotide variants and nonsynonymous variants. More VIPs tailored to predict the impacts of insertions and deletions have been developed since the 2010s. In contrast, relatively few VIPs are dedicated to the prediction of splicing, structural, synonymous, and regulatory variants. The increasing rate of citations to VIPs reflects the ongoing growth in their use, and the evolving trends in citations reveal development in the field and individual methods. Conclusions VIPdb version 2 summarizes 403 VIPs and their features, potentially facilitating VIP exploration for various variant interpretation applications. Availability VIPdb version 2 is available at https://genomeinterpretation.org/vipdb.
Collapse
Affiliation(s)
- Yu-Jen Lin
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Center for Computational Biology, University of California, Berkeley, California 94720, USA
| | - Arul S. Menon
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, USA
| | - Zhiqiang Hu
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
- Currently at: Illumina, Foster City, California 94404, USA
| | - Steven E. Brenner
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Center for Computational Biology, University of California, Berkeley, California 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
| |
Collapse
|
4
|
Huang DL, Zeng Q, Xiong Y, Liu S, Pang C, Xia M, Fang T, Ma Y, Qiang C, Zhang Y, Zhang Y, Li H, Yuan Y. A Combined Manual Annotation and Deep-Learning Natural Language Processing Study on Accurate Entity Extraction in Hereditary Disease Related Biomedical Literature. Interdiscip Sci 2024; 16:333-344. [PMID: 38340264 PMCID: PMC11289304 DOI: 10.1007/s12539-024-00605-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 01/02/2024] [Accepted: 01/03/2024] [Indexed: 02/12/2024]
Abstract
We report a combined manual annotation and deep-learning natural language processing study to make accurate entity extraction in hereditary disease related biomedical literature. A total of 400 full articles were manually annotated based on published guidelines by experienced genetic interpreters at Beijing Genomics Institute (BGI). The performance of our manual annotations was assessed by comparing our re-annotated results with those publicly available. The overall Jaccard index was calculated to be 0.866 for the four entity types-gene, variant, disease and species. Both a BERT-based large name entity recognition (NER) model and a DistilBERT-based simplified NER model were trained, validated and tested, respectively. Due to the limited manually annotated corpus, Such NER models were fine-tuned with two phases. The F1-scores of BERT-based NER for gene, variant, disease and species are 97.28%, 93.52%, 92.54% and 95.76%, respectively, while those of DistilBERT-based NER are 95.14%, 86.26%, 91.37% and 89.92%, respectively. Most importantly, the entity type of variant has been extracted by a large language model for the first time and a comparable F1-score with the state-of-the-art variant extraction model tmVar has been achieved.
Collapse
Affiliation(s)
- Dao-Ling Huang
- BGI Research, Shenzhen, 518083, China.
- Clinical Laboratory of BGI Health, BGI-Shenzhen, Shenzhen, 518083, China.
| | - Quanlei Zeng
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Yun Xiong
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Shuixia Liu
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Chaoqun Pang
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Menglei Xia
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Ting Fang
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Yanli Ma
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Cuicui Qiang
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Yi Zhang
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Yu Zhang
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Hong Li
- BGI-Wuhan Clinical Laboratories, BGI-Shenzhen, Wuhan, 430074, China
| | - Yuying Yuan
- Clinical Laboratory of BGI Health, BGI-Shenzhen, Shenzhen, 518083, China
| |
Collapse
|
5
|
Huang J, Osthushenrich T, MacNamara A, Mälarstig A, Brocchetti S, Bradberry S, Scarabottolo L, Ferrada E, Sosnin S, Digles D, Superti-Furga G, Ecker GF. ProteoMutaMetrics: machine learning approaches for solute carrier family 6 mutation pathogenicity prediction. RSC Adv 2024; 14:13083-13094. [PMID: 38655474 PMCID: PMC11034476 DOI: 10.1039/d4ra00748d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 03/25/2024] [Indexed: 04/26/2024] Open
Abstract
The solute carrier transporter family 6 (SLC6) is of key interest for their critical role in the transport of small amino acids or amino acid-like molecules. Their dysfunction is strongly associated with human diseases such as including schizophrenia, depression, and Parkinson's disease. Linking single point mutations to disease may support insights into the structure-function relationship of these transporters. This work aimed to develop a computational model for predicting the potential pathogenic effect of single point mutations in the SLC6 family. Missense mutation data was retrieved from UniProt, LitVar, and ClinVar, covering multiple protein-coding transcripts. As encoding approach, amino acid descriptors were used to calculate the average sequence properties for both original and mutated sequences. In addition to the full-sequence calculation, the sequences were cut into twelve domains. The domains are defined according to the transmembrane domains of the SLC6 transporters to analyse the regions' contributions to the pathogenicity prediction. Subsequently, several classification models, namely Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) with the hyperparameters optimized through grid search were built. For estimation of model performance, repeated stratified k-fold cross-validation was used. The accuracy values of the generated models are in the range of 0.72 to 0.80. Analysis of feature importance indicates that mutations in distinct regions of SLC6 transporters are associated with an increased risk for pathogenicity. When applying the model on an independent validation set, the performance in accuracy dropped to averagely 0.6 with high precision but low sensitivity scores.
Collapse
Affiliation(s)
- Jiahui Huang
- University of Vienna, Department of Pharmaceutical Sciences Vienna Austria
| | - Tanja Osthushenrich
- Bayer AG, Division Pharmaceuticals, Biomedical Data Science II Wuppertal Germany
| | - Aidan MacNamara
- Bayer AG, Division Pharmaceuticals, Biomedical Data Science II Wuppertal Germany
| | - Anders Mälarstig
- Emerging Science & Innovation, Pfizer Worldwide Research, Development and Medical Cambridge MA USA
| | | | | | | | - Evandro Ferrada
- CeMM, Research Center for Molecular Medicine of the Austrian Academy of Sciences Vienna Austria
| | - Sergey Sosnin
- University of Vienna, Department of Pharmaceutical Sciences Vienna Austria
| | - Daniela Digles
- University of Vienna, Department of Pharmaceutical Sciences Vienna Austria
| | - Giulio Superti-Furga
- CeMM, Research Center for Molecular Medicine of the Austrian Academy of Sciences Vienna Austria
| | - Gerhard F Ecker
- University of Vienna, Department of Pharmaceutical Sciences Vienna Austria
| |
Collapse
|
6
|
Wermers Z, Yoo S, Radenbaugh B, Douglass A, Biesecker LG, Johnston JJ. Comparison of literature mining tools for variant classification: Through the lens of 50 RYR1 variants. Genet Med 2024; 26:101083. [PMID: 38281099 DOI: 10.1016/j.gim.2024.101083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 01/19/2024] [Accepted: 01/22/2024] [Indexed: 01/29/2024] Open
Abstract
PURPOSE The American College of Medical Genetics and Genomics and the Association for Molecular Pathology have outlined a schema that allows for systematic classification of variant pathogenicity. Although gnomAD is generally accepted as a reliable source of population frequency data and ClinGen has provided guidance on the utility of specific bioinformatic predictors, there is no consensus source for identifying publications relevant to a variant. Multiple tools are available to aid in the identification of relevant variant literature, including manually curated databases and literature search engines. We set out to determine the utility of 4 literature mining tools used for ascertainment to inform the discussion of the use of these tools. METHODS Four literature mining tools including the Human Gene Mutation Database, Mastermind, ClinVar, and LitVar 2.0 were used to identify relevant variant literature for 50 RYR1 variants. Sensitivity and precision were determined for each tool. RESULTS Sensitivity among the 4 tools ranged from 0.332 to 0.687. Precision ranged from 0.389 to 0.906. No single tool retrieved all relevant publications. CONCLUSION At the current time, the use of multiple tools is necessary to completely identify the literature relevant to curate a variant.
Collapse
Affiliation(s)
- Zara Wermers
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Seeley Yoo
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Bailey Radenbaugh
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Amber Douglass
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Leslie G Biesecker
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Jennifer J Johnston
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD.
| |
Collapse
|
7
|
De Paoli F, Berardelli S, Limongelli I, Rizzo E, Zucca S. VarChat: the generative AI assistant for the interpretation of human genomic variations. Bioinformatics 2024; 40:btae183. [PMID: 38579245 PMCID: PMC11055464 DOI: 10.1093/bioinformatics/btae183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 03/05/2024] [Accepted: 04/04/2024] [Indexed: 04/07/2024] Open
Abstract
MOTIVATION In the modern era of genomic research, the scientific community is witnessing an explosive growth in the volume of published findings. While this abundance of data offers invaluable insights, it also places a pressing responsibility on genetic professionals and researchers to stay informed about the latest findings and their clinical significance. Genomic variant interpretation is currently facing a challenge in identifying the most up-to-date and relevant scientific papers, while also extracting meaningful information to accelerate the process from clinical assessment to reporting. Computer-aided literature search and summarization can play a pivotal role in this context. By synthesizing complex genomic findings into concise, interpretable summaries, this approach facilitates the translation of extensive genomic datasets into clinically relevant insights. RESULTS To bridge this gap, we present VarChat (varchat.engenome.com), an innovative tool based on generative AI, developed to find and summarize the fragmented scientific literature associated with genomic variants into brief yet informative texts. VarChat provides users with a concise description of specific genetic variants, detailing their impact on related proteins and possible effects on human health. In addition, VarChat offers direct links to related scientific trustable sources, and encourages deeper research. AVAILABILITY AND IMPLEMENTATION varchat.engenome.com.
Collapse
Affiliation(s)
| | - Silvia Berardelli
- enGenome srl, via Ferrata, 5, Pavia, 27100, Italy
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, via Ferrata, 5, Pavia, 27100, Italy
| | | | - Ettore Rizzo
- enGenome srl, via Ferrata, 5, Pavia, 27100, Italy
| | | |
Collapse
|
8
|
Rajcsanyi LS, Zheng Y, Herpertz-Dahlmann B, Seitz J, de Zwaan M, Herzog W, Ehrlich S, Zipfel S, Giel K, Egberts K, Burghardt R, Föcker M, Antel J, Fischer-Posovszky P, Hebebrand J, Hinney A. Unexpected identification of obesity-associated mutations in LEP and MC4R genes in patients with anorexia nervosa. Sci Rep 2024; 14:7067. [PMID: 38528040 PMCID: PMC10963783 DOI: 10.1038/s41598-024-57517-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 03/19/2024] [Indexed: 03/27/2024] Open
Abstract
Mutations leading to a reduced or loss of function in genes of the leptin-melanocortin system confer a risk for monogenic forms of obesity. Yet, gain of function variants in the melanocortin-4-receptor (MC4R) gene predispose to a lower BMI. In individuals with reduced body weight, we thus expected mutations leading to an enhanced function in the respective genes, like leptin (LEP) and MC4R. Therefore, we have Sanger sequenced the coding regions of LEP and MC4R in 462 female patients with anorexia nervosa (AN), and 445 healthy-lean controls. In total, we have observed four and eight variants in LEP and MC4R, respectively. Previous studies showed different functional in vitro effects for the detected frameshift and non-synonymous variants: (1) LEP: reduced/loss of function (p.Val94Met), (2) MC4R: gain of function (p.Val103Ile, p.Ile251Leu), reduced or loss of function (p.Thr112Met, p.Ser127Leu, p.Leu211fsX) and without functional in vitro data (p.Val50Leut). In LEP, the variant p.Val94Met was detected in one patient with AN. For MC4R variants, one patient with AN carried the frameshift variant p.Leu211fsX. One patient with AN was heterozygous for two variants at the MC4R (p.Val103Ile and p.Ser127Leu). All other functionally relevant variants were detected in similar frequencies in patients with AN and lean individuals.
Collapse
Affiliation(s)
- Luisa Sophie Rajcsanyi
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Virchowstraße 174, 45147, Essen, Germany.
- Center for Translational Neuro- and Behavioural Sciences, University Hospital Essen, Essen, Germany.
- Section for Molecular Genetics of Mental Disorders, University Hospital Essen, Essen, Germany.
- Institute of Sex- and Gender-Sensitive Medicine, University Hospital Essen, Essen, Germany.
| | - Yiran Zheng
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Virchowstraße 174, 45147, Essen, Germany
- Center for Translational Neuro- and Behavioural Sciences, University Hospital Essen, Essen, Germany
| | - Beate Herpertz-Dahlmann
- Department of Child and Adolescent Psychiatry and Psychotherapy, University Hospital of the RWTH Aachen, Aachen, Germany
| | - Jochen Seitz
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Virchowstraße 174, 45147, Essen, Germany
- Department of Child and Adolescent Psychiatry and Psychotherapy, University Hospital of the RWTH Aachen, Aachen, Germany
| | - Martina de Zwaan
- Department of Psychosomatic Medicine and Psychotherapy, Hannover Medical School, Hannover, Germany
| | - Wolfgang Herzog
- Department of Internal Medicine II, General Internal and Psychosomatic Medicine, University of Heidelberg, Heidelberg, Germany
| | - Stefan Ehrlich
- Eating Disorders Research and Treatment Center, Department of Child and Adolescent Psychiatry, Faculty of Medicine, TU Dresden, Dresden, Germany
- Translational Developmental Neuroscience Section, Division of Psychological and Social Medicine and Developmental Neurosciences, Faculty of Medicine, TU Dresden, Dresden, Germany
| | - Stephan Zipfel
- Department of Psychosomatic Medicine and Psychotherapy, Medical University Hospital Tübingen, Tübingen, Germany
- Center of Excellence in Eating Disorders KOMET, Tübingen, Germany
| | - Katrin Giel
- Department of Psychosomatic Medicine and Psychotherapy, Medical University Hospital Tübingen, Tübingen, Germany
- Center of Excellence in Eating Disorders KOMET, Tübingen, Germany
| | - Karin Egberts
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Würzburg, Würzburg, Germany
| | - Roland Burghardt
- Oberberg Clinic for Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, Fasanenkiez, Berlin, Germany
| | - Manuel Föcker
- Department of Child and Adolescent Psychiatry, University Hospital Münster, Munster, Germany
- LWL-University Hospital Hamm for Child and Adolescent Psychiatry, Ruhr-University Bochum, Hamm, Germany
| | - Jochen Antel
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Virchowstraße 174, 45147, Essen, Germany
- Center for Translational Neuro- and Behavioural Sciences, University Hospital Essen, Essen, Germany
| | | | - Johannes Hebebrand
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Virchowstraße 174, 45147, Essen, Germany
- Center for Translational Neuro- and Behavioural Sciences, University Hospital Essen, Essen, Germany
| | - Anke Hinney
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Virchowstraße 174, 45147, Essen, Germany
- Center for Translational Neuro- and Behavioural Sciences, University Hospital Essen, Essen, Germany
- Section for Molecular Genetics of Mental Disorders, University Hospital Essen, Essen, Germany
- Institute of Sex- and Gender-Sensitive Medicine, University Hospital Essen, Essen, Germany
| |
Collapse
|
9
|
Asadi-Pooya AA, Malekpour M, Taherifard E, Mallahzadeh A, Farjoud Kouhanjani M. Coexistence of temporal lobe epilepsy and idiopathic generalized epilepsy. Epilepsy Behav 2024; 151:109602. [PMID: 38160579 DOI: 10.1016/j.yebeh.2023.109602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 12/07/2023] [Accepted: 12/21/2023] [Indexed: 01/03/2024]
Abstract
OBJECTIVE We investigated the frequency of coexistence of temporal lobe epilepsy (TLE) and idiopathic generalized epilepsy (IGE) in a retrospective database study. We also explored the underlying pathomechanisms of the coexistence of TLE and IGE based on the available information, using bioinformatics tools. METHODS The first phase of the investigation was a retrospective study. All patients with an electro-clinical diagnosis of epilepsy were studied at the outpatient epilepsy clinic at Shiraz University of Medical Sciences, Shiraz, Iran, from 2008 until 2023. In the second phase, we searched the following databases for genetic variations (epilepsy-associated genetic polymorphisms) that are associated with TLE or syndromes of IGE: DisGeNET, genome-wide association study (GWAS) Catalog, epilepsy genetic association database (epiGAD), and UniProt. We also did a separate literature search using PubMed. RESULTS In total, 3760 patients with epilepsy were registered at our clinic; four patients with definitely mixed TLE and IGE were identified; 0.1% of all epilepsies. We could identify that rs1883415 of ALDH5A1, rs137852779 of EFHC1, rs211037 of GABRG2, rs1130183 of KCNJ10, and rs1045642 of ABCB1 genes are shared between TLE and syndromes of IGE. CONCLUSION While coexistence of TLE and IGE is a rare phenomenon, this could be explained by shared genetic variations.
Collapse
Affiliation(s)
- Ali A Asadi-Pooya
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran; Jefferson Comprehensive Epilepsy Center, Department of Neurology, Thomas Jefferson University, Philadelphia, PA, USA.
| | - Mahdi Malekpour
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Ehsan Taherifard
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Arashk Mallahzadeh
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | | |
Collapse
|
10
|
Jin Q, Leaman R, Lu Z. PubMed and beyond: biomedical literature search in the age of artificial intelligence. EBioMedicine 2024; 100:104988. [PMID: 38306900 PMCID: PMC10850402 DOI: 10.1016/j.ebiom.2024.104988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Revised: 01/14/2024] [Accepted: 01/15/2024] [Indexed: 02/04/2024] Open
Abstract
Biomedical research yields vast information, much of which is only accessible through the literature. Consequently, literature search is crucial for healthcare and biomedicine. Recent improvements in artificial intelligence (AI) have expanded functionality beyond keywords, but they might be unfamiliar to clinicians and researchers. In response, we present an overview of over 30 literature search tools tailored to common biomedical use cases, aiming at helping readers efficiently fulfill their information needs. We first discuss recent improvements and continued challenges of the widely used PubMed. Then, we describe AI-based literature search tools catering to five specific information needs: 1. Evidence-based medicine. 2. Precision medicine and genomics. 3. Searching by meaning, including questions. 4. Finding related articles with literature recommendation. 5. Discovering hidden associations through literature mining. Finally, we discuss the impacts of recent developments of large language models such as ChatGPT on biomedical information seeking.
Collapse
Affiliation(s)
- Qiao Jin
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Robert Leaman
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA.
| |
Collapse
|
11
|
Savage SR, Zhang Y, Jaehnig EJ, Liao Y, Shi Z, Pham HA, Xu H, Zhang B. IDPpub: Illuminating the Dark Phosphoproteome Through PubMed Mining. Mol Cell Proteomics 2024; 23:100682. [PMID: 37993103 PMCID: PMC10716774 DOI: 10.1016/j.mcpro.2023.100682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 10/25/2023] [Accepted: 11/14/2023] [Indexed: 11/24/2023] Open
Abstract
Global phosphoproteomics experiments quantify tens of thousands of phosphorylation sites. However, data interpretation is hampered by our limited knowledge on functions, biological contexts, or precipitating enzymes of the phosphosites. This study establishes a repository of phosphosites with associated evidence in biomedical abstracts, using deep learning-based natural language processing techniques. Our model for illuminating the dark phosphoproteome through PubMed mining (IDPpub) was generated by fine-tuning BioBERT, a deep learning tool for biomedical text mining. Trained using sentences containing protein substrates and phosphorylation site positions from 3000 abstracts, the IDPpub model was then used to extract phosphorylation sites from all MEDLINE abstracts. The extracted proteins were normalized to gene symbols using the National Center for Biotechnology Information gene query, and sites were mapped to human UniProt sequences using ProtMapper and mouse UniProt sequences by direct match. Precision and recall were calculated using 150 curated abstracts, and utility was assessed by analyzing the CPTAC (Clinical Proteomics Tumor Analysis Consortium) pan-cancer phosphoproteomics datasets and the PhosphoSitePlus database. Using 10-fold cross validation, pairs of correct substrates and phosphosite positions were extracted with an average precision of 0.93 and recall of 0.94. After entity normalization and site mapping to human reference sequences, an independent validation achieved a precision of 0.91 and recall of 0.77. The IDPpub repository contains 18,458 unique human phosphorylation sites with evidence sentences from 58,227 abstracts and 5918 mouse sites in 14,610 abstracts. This included evidence sentences for 1803 sites identified in CPTAC studies that are not covered by manually curated functional information in PhosphoSitePlus. Evaluation results demonstrate the potential of IDPpub as an effective biomedical text mining tool for collecting phosphosites. Moreover, the repository (http://idppub.ptmax.org), which can be automatically updated, can serve as a powerful complement to existing resources.
Collapse
Affiliation(s)
- Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Zhiao Shi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | - Hua Xu
- Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, Connecticut, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA.
| |
Collapse
|
12
|
Poli G, Demontis GC, Sodi A, Saba A, Rizzo S, Macchia M, Tuccinardi T. An in silico toolbox for the prediction of the potential pathogenic effects of missense mutations in the dimeric region of hRPE65. J Enzyme Inhib Med Chem 2023; 38:2162047. [PMID: 36629452 PMCID: PMC9848331 DOI: 10.1080/14756366.2022.2162047] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
hRPE65 is a fundamental enzyme of the retinoid visual cycle, and many missense mutations affecting its expression or function are associated with a wide range of diseases. Many hRPE65 missense mutations lack a clear pathogenicity classification or are labelled as VUS. In this context, we recently developed a protocol based on µs-long molecular dynamics simulations to study the potential pathogenic effect of hRPE65 missense mutations. In the present work, the structure-based protocol was integrated with a hRPE65-tailored consensus bioinformatics strategy, named ConPath, that showed high performance in predicting known pathogenic/benign hRPE65 missense mutations. The combined strategy was used to perform a multi-level evaluation of the potential pathogenicity of 13 different hRPE65 VUS, which were classified based on their likelihood of pathogenic effect. The obtained results provide information that may support the reclassification of these VUS and help clinicians evaluate the eligibility for gene therapy of patients diagnosed with such variants.
Collapse
Affiliation(s)
- Giulio Poli
- Department of Pharmacy, University of Pisa, Pisa, Italy
| | | | - Andrea Sodi
- Department of Neurosciences, Psychology, Drug Research and Child Health Eye Clinic, University of Florence, AOU Careggi, Florence, Italy
| | - Alessandro Saba
- Department of Surgical Pathology, Molecular Medicine and of the Critical Area, University of Pisa, Pisa, Italy
| | - Stanislao Rizzo
- Ophthalmology Unit, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy,Catholic University Sacro Cuore, Rome, Italy,Consiglio Nazionale delle Ricerche, Istituto di Neuroscienze, Pisa, Italy
| | - Marco Macchia
- Department of Pharmacy, University of Pisa, Pisa, Italy
| | - Tiziano Tuccinardi
- Department of Pharmacy, University of Pisa, Pisa, Italy,CONTACT Tiziano Tuccinardi Department of Pharmacy, University of Pisa, Via Bonanno 6, Pisa, 56126, Italy
| |
Collapse
|
13
|
Nezamuldeen L, Jafri MS. Protein-Protein Interaction Network Extraction Using Text Mining Methods Adds Insight into Autism Spectrum Disorder. BIOLOGY 2023; 12:1344. [PMID: 37887054 PMCID: PMC10604135 DOI: 10.3390/biology12101344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 10/02/2023] [Accepted: 10/12/2023] [Indexed: 10/28/2023]
Abstract
Text mining methods are being developed to assimilate the volume of biomedical textual materials that are continually expanding. Understanding protein-protein interaction (PPI) deficits would assist in explaining the genesis of diseases. In this study, we designed an automated system to extract PPIs from the biomedical literature that uses a deep learning sentence classification model, a pretrained word embedding, and a BiLSTM recurrent neural network with additional layers, a conditional random field (CRF) named entity recognition (NER) model, and shortest-dependency path (SDP) model using the SpaCy library in Python. The automated system ensures that it targets sentences that contain PPIs and not just these proteins mentioned in the framework of disease discovery or other context. Our first model achieved 13% greater precision on the Aimed/BioInfr benchmark corpus than the previous state-of-the-art BiLSTM neural network models. The NER model presented in this study achieved 98% precision on the Aimed/BioInfr corpus over previous models. In order to facilitate the production of an accurate representation of the PPI network, the processes were developed to systematically map the protein interactions in the texts. Overall, evaluating our system through the use of 6027 abstracts pertaining to seven proteins associated with Autism Spectrum Disorder completed the manually curated PPI network for these proteins. When it comes to complicated diseases, these networks would assist in understanding how PPI deficits contribute to disease development while also emphasizing the influence of interactions on protein function and biological processes.
Collapse
Affiliation(s)
- Leena Nezamuldeen
- School of Systems Biology, George Mason University, Fairfax, VA 22030, USA
- King Fahd Medical Research Centre, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
| | - Mohsin Saleet Jafri
- School of Systems Biology, George Mason University, Fairfax, VA 22030, USA
- Center for Biomedical Engineering and Technology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
14
|
Hasan MM, Nabi AN, Yasmin T. Comprehensive analysis predicting effects of deleterious SNPs of human progesterone receptor gene on its structure and functions: a computational approach. J Biomol Struct Dyn 2023; 41:8002-8017. [PMID: 36166622 DOI: 10.1080/07391102.2022.2127908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 09/17/2022] [Indexed: 10/14/2022]
Abstract
Progesterone receptor plays a crucial role in the development of the mammary gland and breast cancer. Single nucleotide polymorphisms (SNPs) within its gene, PGR, are associated with the risk of miscarriages and preterm birth as well as many cancers across different populations. The main aim of this work is to investigate the most deleterious SNPs in the PGR gene to identify potential biomarkers for various disease susceptibility and treatments. Both sequence and structure-based computational approaches were adopted and in total 11 nsSNPs have been filtered out of 674 nsSNPs along with seven non-coding SNPs. R740Q, I744T and D746E belonged to a mutation cluster. R740Q, D746E along with S865L altered H-bond interactions within the receptor. The same mutations have been found to be associated with several cancers including uterine and breast cancer among others. It is, therefore, possible that the high-risk SNPs associated with cancers may exert their effect by causing changes in the protein structure, particularly in its bonding patterns, and thus affecting its function. In addition, seven non-coding SNPs that were located in the UTR region created a new miRNA site while three SNPs disrupted a conserved miRNA site. These high-risk SNPs can play an instrumental role in generating a dataset of the PGR gene's SNPs. Thus, the present study may pave the way to design and develop novel therapeutics for overcoming the challenges associated with certain cancers and pregnancy that result from a change in the protein structure and function due to the SNP mutations in the PGR gene.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- M Mahbub Hasan
- Population Genetics Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh
| | - Ahm Nurun Nabi
- Population Genetics Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh
| | - Tahirah Yasmin
- Population Genetics Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh
| |
Collapse
|
15
|
Allot A, Wei CH, Phan L, Hefferon T, Landrum M, Rehm HL, Lu Z. Tracking genetic variants in the biomedical literature using LitVar 2.0. Nat Genet 2023; 55:901-903. [PMID: 37268776 PMCID: PMC11096795 DOI: 10.1038/s41588-023-01414-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Affiliation(s)
- Alexis Allot
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Chih-Hsuan Wei
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Lon Phan
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Timothy Hefferon
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Melissa Landrum
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Heidi L Rehm
- Program in Medical and Population Genetics, Broad institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA.
| |
Collapse
|
16
|
Andreis TF, de Souza KIW, Vieira IA, Alemar B, Sinigaglia M, de Araújo Rocha YM, Artigalás O, Bittar C, Oliveira Netto CB, Ashton-Prolla P, Rosset C. Challenges in periodic revision of genetic testing results: Comparison of the main classification guidelines and report of a retrospective analysis involving BRCA1/BRCA2 variants of uncertain significance. Gene 2023; 862:147281. [PMID: 36775216 DOI: 10.1016/j.gene.2023.147281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 01/27/2023] [Accepted: 02/07/2023] [Indexed: 02/12/2023]
Abstract
In the context of cancer predisposition syndromes, it is widely known that the correct interpretation of germline variants identified in multigene panel testing is essential for adequate genetic counseling and clinical decision making, in which variants of uncertain significance (VUS) are not considered actionable findings. Thus, their periodic re-evaluation using appropriate guidelines is notably important. In the present study, we compared the performance of the main variant classification guidelines (ACMG, Sherloc and ENIGMA) in variant reassessment, using as input a BRCA1/2 VUS case series (retrospective analysis) from Brazil, an ethnically diverse and admixed country with substantial challenges in VUS reclassification. As main findings, two of the 15 VUS analyzed were reclassified as likely pathogenic by the 3 guidelines, BRCA1 c.4987-3C > G (rs397509213) and BRCA2 c.7868A > G (rs80359012). Moreover, challenges in variant classification and reassessment are described and additional in silico data about structural impact of the variant BRCA2 c.7868A > G are provided. We hypothesize that the establishment of a framework to reassess VUS could improve this process in health centers that have not yet implemented this practice. Results of this study underscore that periodic monitoring of the functional, clinical, and bioinformatics data of a VUS by a multidisciplinary team are of utmost importance in clinical practice. When there is a specific guideline for a given gene, such as ENIGMA for BRCA1/2, it should be considered the first option for variant assessment. Finally, recruitment of VUS carriers and their relatives to participate in variant segregation studies and publication of VUS reclassification results in the international scientific literature should be encouraged.
Collapse
Affiliation(s)
- Tiago Finger Andreis
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil; Programa de Pós-Graduação em Genética e Biologia Molecular (PPGBM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Kayana Isabel Weber de Souza
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil; Programa de Pós-Graduação em Ciências Médicas: Medicina (PPGCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Igor Araujo Vieira
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil; Escola de Saúde, Universidade do Vale do Rio dos Sinos (UNISINOS), São Leopoldo, Rio Grande do Sul, Brazil
| | - Bárbara Alemar
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil
| | | | - Yasminne Marinho de Araújo Rocha
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil
| | - Osvaldo Artigalás
- Hospital Moinhos de Vento (HMV), Porto Alegre, Rio Grande do Sul, Brazil
| | - Camila Bittar
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil; Hospital Moinhos de Vento (HMV), Porto Alegre, Rio Grande do Sul, Brazil
| | | | - Patricia Ashton-Prolla
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil; Programa de Pós-Graduação em Genética e Biologia Molecular (PPGBM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil; Programa de Pós-Graduação em Ciências Médicas: Medicina (PPGCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil; Serviço de Genética Médica, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil
| | - Clévia Rosset
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil; Programa de Pós-Graduação em Ciências Médicas: Medicina (PPGCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil; Unidade de Pesquisa Laboratorial (UPL) - Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, Rio Grande do Sul, Brazil.
| |
Collapse
|
17
|
Rajalakshmi K, Thirunavukkarasu J, Vikraman MA, Maruthy S, Sylvester C, Kundapur R. Analysis of SLC26A4 Gene in Individuals with Non Syndromic Hearing Impairment in Relation with GJB2 Associated Mutations. Avicenna J Med Biotechnol 2023; 15:124-127. [PMID: 37034890 PMCID: PMC10073921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 01/07/2023] [Indexed: 04/11/2023] Open
Abstract
Background Hearing Loss (HL) is the most common sensory disorder. HL commonly ranges from mild to severe. Persons with HL face difficulty in hearing conversations or sounds through one ear or both ears, which impacts one's ability to interact with others. Hence it is a communicable disorder that makes people socially isolated, lonely, and frustrated. HL in children severely affects language development. The people who are referred to as 'Deaf' with very little or no hearing capabilities, are considered as having profound hearing loss. More than 124 genes are causative for Non-Syndromic HL (NSHL) with varying inheritance, among which the SLC26A4 mutations are the second commonest cause of hereditary HL across the globe. Methods Samples from 70 NSHL patients were analyzed through Next-Generation Sequencing (NGS) and generated five pathogenic variants [N246fs (rs918684449), K564fs (rs746427774), F122fs, V239D (rs111033256), T721M (rs121908363)] each with frequency of 1.42%. Three missense variants [S399P (rs747431002), L597S (rs55638457), and G6V (rs111033423)] were reported under the "uncertain" category. All the collected samples were further genotyped to look for the possibility of having GJB2 and HL-associated mutations. Results Out of five SLC26A4 pathogenic mutations N246fs (rs918684449) and K564fs (rs746427774) were observed in samples which were positive for GJB2-HL associated candidate mutations [W24X (rs104894396), Q124X (rs397516874) and W77X (rs80338944)]. Similarly, pathogenic variants F122fs, V239D (rs111033256) and T721M (rs121908363) were observed in patient samples which were negative for GJB2-HL associated mutations. Conclusion Our data will expand the list of variants underlying NSHL and encourage further genotype SLC26A4 gene concerning the south Indian population with a large sample size.
Collapse
Affiliation(s)
- Krishna Rajalakshmi
- Department of Audiology, All India Institute of Speech and Hearing, Naimisham Campus, Manasagangothri, Mysore, India 570006
- School of Rehabilitation and Behavioral Sciences, VMRF (DU) Aarupadai Veedu Medical College Pondicherry, India 607403
| | - Jayakumar Thirunavukkarasu
- Department of Speech-Language Sciences, All India Institute of Speech and Hearing, Naimisham Campus, Manasagangothri, Mysore, India 570006
| | - Meenu Ambika Vikraman
- Department of Audiology, All India Institute of Speech and Hearing, Naimisham Campus, Manasagangothri, Mysore, India 570006
- Department of Audiology Taluk Head Quarters Hospital, Kottarakara, Kerala, India 691506
| | - Santosh Maruthy
- Department of Speech-Language Sciences, All India Institute of Speech and Hearing, Naimisham Campus, Manasagangothri, Mysore, India 570006
| | - Charles Sylvester
- Unit for Human Genetics, All India Institute of Speech and Hearing, Naimisham Campus, Manasagangothri, Mysore, India 570006
| | - Rajesh Kundapur
- Unit for Human Genetics, All India Institute of Speech and Hearing, Naimisham Campus, Manasagangothri, Mysore, India 570006
| |
Collapse
|
18
|
Malekpour M, Salarikia SR, Kashkooli M, Asadi-Pooya AA. The genetic link between systemic autoimmune disorders and temporal lobe epilepsy: A bioinformatics study. Epilepsia Open 2023. [PMID: 36929812 DOI: 10.1002/epi4.12727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 03/11/2023] [Indexed: 03/18/2023] Open
Abstract
OBJECTIVE We aimed to explore the underlying pathomechanisms of the comorbidity between three common systemic autoimmune disorders (SADs) [i.e., insulin-dependent diabetes mellitus (IDDM), systemic lupus erythematosus (SLE), and rheumatoid arthritis (RA)] and temporal lobe epilepsy (TLE), using bioinformatics tools. We hypothesized that there are shared genetic variations among these four conditions. METHODS Different databases (DisGeNET, Harmonizome, and Enrichr) were searched to find TLE-associated genes with variants; their single nucleotide polymorphisms (SNPs) were gathered from the literature. We also did a separate literature search using PubMed with the following keywords for original articles: "TLE" or "Temporal lobe epilepsy" AND "genetic variation," "single nucleotide polymorphism," "SNP," or "genetic polymorphism." In the next step, the SNPs associated with TLE were searched in the LitVar database to find the shared gene variations with RA, SLE, and IDDM. RESULTS Ninety unique SNPs were identified to be associated with TLE. LitVar search identified two SNPs that were shared between TLE and all three SADs (i.e., IDDM, SLE, and RA). The first SNP was rs16944 on the Interleukin-1β (IL-1β) gene. The second genetic variation was ε4 variation of apolipoprotein E (APOE) gene. SIGNIFICANCE The shared genetic variations (i.e., rs16944 on the IL-1β gene and ε4 variation of the APOE gene) may explain the underlying pathomechanisms of the comorbidity between three common SADs (i.e., IDDM, SLE, and RA) and TLE. Exploring such shared genetic variations may help find targeted therapies for patients with TLE, especially those with drug-resistant seizures who also have comorbid SADs.
Collapse
Affiliation(s)
- Mahdi Malekpour
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | | | - Mohammad Kashkooli
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Ali A Asadi-Pooya
- Epilepsy Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
- Department of Neurology, Jefferson Comprehensive Epilepsy Center, Thomas Jefferson University, Philadelphia, Pennsylvania, USA
| |
Collapse
|
19
|
Vis JK, Santcroos MA, Kosters WA, Laros JFJ. A Boolean algebra for genetic variants. Bioinformatics 2023; 39:6967432. [PMID: 36594541 PMCID: PMC9879725 DOI: 10.1093/bioinformatics/btad001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 12/06/2022] [Accepted: 01/02/2023] [Indexed: 01/04/2023] Open
Abstract
MOTIVATION Beyond identifying genetic variants, we introduce a set of Boolean relations, which allows for a comprehensive classification of the relations of every pair of variants by taking all minimal alignments into account. We present an efficient algorithm to compute these relations, including a novel way of efficiently computing all minimal alignments within the best theoretical complexity bounds. RESULTS We show that these relations are common, and many non-trivial, for variants of the CFTR gene in dbSNP. Ultimately, we present an approach for the storing and indexing of variants in the context of a database that enables efficient querying for all these relations. AVAILABILITY AND IMPLEMENTATION A Python implementation is available at https://github.com/mutalyzer/algebra/tree/v0.2.0 as well as an interface at https://mutalyzer.nl/algebra.
Collapse
Affiliation(s)
| | - Mark A Santcroos
- Department of Human Genetics, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands,Department of Clinical Genetics, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands
| | - Walter A Kosters
- Leiden Institute of Advanced Computer Science, Leiden University, 2333 CA Leiden, The Netherlands
| | - Jeroen F J Laros
- Department of Human Genetics, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands,National Institute for Public Health and the Environment (RIVM), 3721 MA Bilthoven, The Netherlands
| |
Collapse
|
20
|
Papageorgiou L, Kalospyrou E, Papakonstantinou E, Diakou I, Pierouli K, Dragoumani K, Bacopoulou F, Chrousos GP, Exarchos TP, Vlamos P, Eliopoulos E, Vlachakis D. DRDs and Brain-Derived Neurotrophic Factor Share a Common Therapeutic Ground: A Novel Bioinformatic Approach Sheds New Light Toward Pharmacological Treatment of Cognitive and Behavioral Disorders. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1424:97-115. [PMID: 37486484 DOI: 10.1007/978-3-031-31982-2_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]
Abstract
Cognitive and behavioral disorders are subgroups of mental health disorders. Both cognitive and behavioral disorders can occur in people of different ages, genders, and social backgrounds, and they can cause serious physical, mental, or social problems. The risk factors for these diseases are numerous, with a range from genetic and epigenetic factors to physical factors. In most cases, the appearance of such a disorder in an individual is a combination of his genetic profile and environmental stimuli. To date, researchers have not been able to identify the specific causes of these disorders, and as such, there is urgent need for innovative study approaches. The aim of the present study was to identify the genetic factors which seem to be more directly responsible for the occurrence of a cognitive and/or behavioral disorder. More specifically, through bioinformatics tools and software as well as analytical methods such as systemic data and text mining, semantic analysis, and scoring functions, we extracted the most relevant single nucleotide polymorphisms (SNPs) and genes connected to these disorders. All the extracted SNPs were filtered, annotated, classified, and evaluated in order to create the "genomic grammar" of these diseases. The identified SNPs guided the search for top suspected genetic factors, dopamine receptors D and neurotrophic factor BDNF, for which regulatory networks were built. The identification of the "genomic grammar" and underlying factors connected to cognitive and behavioral disorders can aid in the successful disease profiling and the establishment of novel pharmacological targets and provide the basis for personalized medicine, which takes into account the patient's genetic background as well as epigenetic factors.
Collapse
Affiliation(s)
- Louis Papageorgiou
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Efstathia Kalospyrou
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Eleni Papakonstantinou
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Io Diakou
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Katerina Pierouli
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Konstantina Dragoumani
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Flora Bacopoulou
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece
| | - George P Chrousos
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece
| | - Themis P Exarchos
- Department of Informatics, Bioinformatics & Human Electrophysiology Laboratory, Ionian University, Corfu, Greece
| | - Panagiotis Vlamos
- Department of Informatics, Bioinformatics & Human Electrophysiology Laboratory, Ionian University, Corfu, Greece
| | - Elias Eliopoulos
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Dimitrios Vlachakis
- Department of Biotechnology, Laboratory of Genetics, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece.
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece.
- Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece.
| |
Collapse
|
21
|
Mitsis T, Papageorgiou L, Papakonstantinou E, Diakou I, Pierouli K, Dragoumani K, Bacopoulou F, Kino T, Chrousos GP, Eliopoulos E, Vlachakis D. A Genomic Study of the Japanese Population Focusing on the Glucocorticoid Receptor Interactome Highlights Distinct Genetic Characteristics Associated with Stress Response. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1423:101-113. [PMID: 37525035 DOI: 10.1007/978-3-031-31978-5_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/02/2023]
Abstract
All living organisms have been programmed to maintain a complex inner equilibrium called homeostasis, despite numerous adversities during their lifespan. Any threatening or perceived as such stimuli for homeostasis is termed a stressor, and a highly conserved response system called the stress response system has been developed to cope with these stimuli and maintain or reinstate homeostasis. The glucocorticoid receptor, a transcription factor belonging to the nuclear receptors protein superfamily, has a major role in the stress response system, and research on its interactome may provide novel information regarding the mechanisms underlying homeostasis maintenance. A list of 149 autosomal genes that have an essential role in GR function or are prime examples of GRE-containing genes was composed in order to gain a comprehensive view of the GR interactome. A search for SNPs on those particular genes was conducted on a dataset of 3554 Japanese individuals, with mentioned polymorphisms being annotated with relevant information from the ClinVar, LitVar, and dbSNP databases. Forty-two SNPs of interest and their genomic locations were identified. These SNPs have been associated with drug metabolism and neuropsychiatric, metabolic, and immune system disorders, while most of them were located in intronic regions. The frequencies of those SNPs were later compared with a dataset consisting of 1465 Korean individuals in order to find population-specific characteristics based on some of the identified SNPs of interest. The results highlighted.that rs1043618 frequencies were different in the two populations, with mentioned polymorphism having a potential role in chronic obstructive pulmonary disease in response to environmental stressors. This SNP is located in the HSPA1A gene, which codes for an essential GR co-chaperone, and such information showcases that similar gene may be novel genomic targets for managing or combatting stress-related pathologies.
Collapse
Affiliation(s)
- Thanasis Mitsis
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Louis Papageorgiou
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Eleni Papakonstantinou
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Io Diakou
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Katerina Pierouli
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Konstantina Dragoumani
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Flora Bacopoulou
- University Research Institute of Maternal and Child Health & Precision Medicine, Athens, Greece
- National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece
| | - Tomoshige Kino
- Department of Human Genetics, Division of Translational Medicine, Sidra Medical and Research Center, Doha, Qatar
| | - George P Chrousos
- University Research Institute of Maternal and Child Health & Precision Medicine, Athens, Greece
- National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece
| | - Elias Eliopoulos
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Dimitrios Vlachakis
- Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece.
- University Research Institute of Maternal and Child Health & Precision Medicine, Athens, Greece.
- National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece.
- Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece.
| |
Collapse
|
22
|
Papageorgiou L, Papakonstantinou E, Diakou I, Pierouli K, Dragoumani K, Bacopoulou F, Chrousos GP, Eliopoulos E, Vlachakis D. Semantic and Population Analysis of the Genetic Targets Related to COVID-19 and Its Association with Genes and Diseases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1423:59-78. [PMID: 37525033 DOI: 10.1007/978-3-031-31978-5_6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/02/2023]
Abstract
SARS-CoV-2 is a coronavirus responsible for one of the most serious, modern worldwide pandemics, with lasting and multifaceted effects. By late 2021, SARS-CoV-2 has infected more than 180 million people and has killed more than 3 million. The virus gains entrance to human cells through binding to ACE2 via its surface spike protein and causes a complex disease of the respiratory system, termed COVID-19. Vaccination efforts are being made to hinder the viral spread, and therapeutics are currently under development. Toward this goal, scientific attention is shifting toward variants and SNPs that affect factors of the disease such as susceptibility and severity. This genomic grammar, tightly related to the dark part of our genome, can be explored through the use of modern methods such as natural language processing. We present a semantic analysis of SARS-CoV-2-related publications, which yielded a repertoire of SNPs, genes, and disease ontologies. Population data from the 1000 Genomes Project were subsequently integrated into the pipeline. Data mining approaches of this scale have the potential to elucidate the complex interaction between COVID-19 pathogenesis and host genetic variation; the resulting knowledge can facilitate the management of high-risk groups and aid the efforts toward precision medicine.
Collapse
Affiliation(s)
- Louis Papageorgiou
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Eleni Papakonstantinou
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Io Diakou
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Katerina Pierouli
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Konstantina Dragoumani
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Flora Bacopoulou
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece
| | - George P Chrousos
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece
| | - Elias Eliopoulos
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Dimitrios Vlachakis
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece.
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, "Aghia Sophia" Children's Hospital, Athens, Greece.
- Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece.
| |
Collapse
|
23
|
TogoVar: A comprehensive Japanese genetic variation database. Hum Genome Var 2022; 9:44. [PMID: 36509753 PMCID: PMC9744889 DOI: 10.1038/s41439-022-00222-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 11/03/2022] [Accepted: 11/07/2022] [Indexed: 12/14/2022] Open
Abstract
TogoVar ( https://togovar.org ) is a database that integrates allele frequencies derived from Japanese populations and provides annotations for variant interpretation. First, a scheme to reanalyze individual-level genome sequence data deposited in the Japanese Genotype-phenotype Archive (JGA), a controlled-access database, was established to make allele frequencies publicly available. As more Japanese individual-level genome sequence data are deposited in JGA, the sample size employed in TogoVar is expected to increase, contributing to genetic study as reference data for Japanese populations. Second, public datasets of Japanese and non-Japanese populations were integrated into TogoVar to easily compare allele frequencies in Japanese and other populations. Each variant detected in Japanese populations was assigned a TogoVar ID as a permanent identifier. Third, these variants were annotated with molecular consequence, pathogenicity, and literature information for interpreting and prioritizing variants. Here, we introduce the newly developed TogoVar database that compares allele frequencies among Japanese and non-Japanese populations and describes the integrated annotations.
Collapse
|
24
|
Xu H, Chen Y, Zhang D. Semantic Interpretation for Convolutional Neural Networks: What Makes a Cat a Cat? ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2022; 9:e2204723. [PMID: 36216585 PMCID: PMC9762288 DOI: 10.1002/advs.202204723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Indexed: 06/16/2023]
Abstract
The interpretability of deep neural networks has attracted increasing attention in recent years, and several methods have been created to interpret the "black box" model. Fundamental limitations remain, however, that impede the pace of understanding the networks, especially the extraction of understandable semantic space. In this work, the framework of semantic explainable artificial intelligence (S-XAI) is introduced, which utilizes a sample compression method based on the distinctive row-centered principal component analysis (PCA) that is different from the conventional column-centered PCA to obtain common traits of samples from the convolutional neural network (CNN), and extracts understandable semantic spaces on the basis of discovered semantically sensitive neurons and visualization techniques. Statistical interpretation of the semantic space is also provided, and the concept of semantic probability is proposed. The experimental results demonstrate that S-XAI is effective in providing a semantic interpretation for the CNN, and offers broad usage, including trustworthiness assessment and semantic sample searching.
Collapse
Affiliation(s)
- Hao Xu
- BIC‐ESATEREand SKLTCSCollege of EngineeringPeking UniversityBeijing100871P. R. China
| | - Yuntian Chen
- Eastern Institute for Advanced StudyYongriver Institute of TechnologyNingboZhejiang315200P. R. China
| | - Dongxiao Zhang
- National Center for Applied Mathematics Shenzhen (NCAMS)Southern University of Science and TechnologyShenzhenGuangdong518055P. R. China
- Department of Mathematics and TheoriesPeng Cheng LaboratoryShenzhenGuangdong518000P. R. China
| |
Collapse
|
25
|
Rodríguez Ruiz N, Abd Own S, Ekström Smedby K, Eloranta S, Koch S, Wästerlid T, Krstic A, Boman M. Data-driven support to decision-making in molecular tumour boards for lymphoma: A design science approach. Front Oncol 2022; 12:984021. [PMID: 36457495 PMCID: PMC9705761 DOI: 10.3389/fonc.2022.984021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 10/03/2022] [Indexed: 09/10/2024] Open
Abstract
Background The increasing amount of molecular data and knowledge about genomic alterations from next-generation sequencing processes together allow for a greater understanding of individual patients, thereby advancing precision medicine. Molecular tumour boards feature multidisciplinary teams of clinical experts who meet to discuss complex individual cancer cases. Preparing the meetings is a manual and time-consuming process. Purpose To design a clinical decision support system to improve the multimodal data interpretation in molecular tumour board meetings for lymphoma patients at Karolinska University Hospital, Stockholm, Sweden. We investigated user needs and system requirements, explored the employment of artificial intelligence, and evaluated the proposed design with primary stakeholders. Methods Design science methodology was used to form and evaluate the proposed artefact. Requirements elicitation was done through a scoping review followed by five semi-structured interviews. We used UML Use Case diagrams to model user interaction and UML Activity diagrams to inform the proposed flow of control in the system. Additionally, we modelled the current and future workflow for MTB meetings and its proposed machine learning pipeline. Interactive sessions with end-users validated the initial requirements based on a fictive patient scenario which helped further refine the system. Results The analysis showed that an interactive secure Web-based information system supporting the preparation of the meeting, multidisciplinary discussions, and clinical decision-making could address the identified requirements. Integrating artificial intelligence via continual learning and multimodal data fusion were identified as crucial elements that could provide accurate diagnosis and treatment recommendations. Impact Our work is of methodological importance in that using artificial intelligence for molecular tumour boards is novel. We provide a consolidated proof-of-concept system that could support the end-to-end clinical decision-making process and positively and immediately impact patients. Conclusion Augmenting a digital decision support system for molecular tumour boards with retrospective patient material is promising. This generates realistic and constructive material for human learning, and also digital data for continual learning by data-driven artificial intelligence approaches. The latter makes the future system adaptable to human bias, improving adequacy and decision quality over time and over tasks, while building and maintaining a digital log.
Collapse
Affiliation(s)
- Núria Rodríguez Ruiz
- Department of Learning, Informatics, Management and Ethics (LIME), Health Informatics Centre, Karolinska Institutet, Stockholm, Sweden
| | - Sulaf Abd Own
- Department of Medicine Solna, Clinical Epidemiology Division, Karolinska Institutet, Stockholm, Sweden
- Department of Laboratory Medicine, Division of Pathology, Karolinska University Hospital Huddinge, Stockholm, Sweden
| | - Karin Ekström Smedby
- Department of Medicine Solna, Clinical Epidemiology Division, Karolinska Institutet, Stockholm, Sweden
- Department of Hematology, Karolinska University Hospital, Stockholm, Sweden
| | - Sandra Eloranta
- Department of Medicine Solna, Clinical Epidemiology Division, Karolinska Institutet, Stockholm, Sweden
| | - Sabine Koch
- Department of Learning, Informatics, Management and Ethics (LIME), Health Informatics Centre, Karolinska Institutet, Stockholm, Sweden
| | - Tove Wästerlid
- Department of Medicine Solna, Clinical Epidemiology Division, Karolinska Institutet, Stockholm, Sweden
- Department of Hematology, Karolinska University Hospital, Stockholm, Sweden
| | - Aleksandra Krstic
- Center for Hematology and Regenerative Medicine, Karolinska Institutet, Stockholm, Sweden
- Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Magnus Boman
- Department of Learning, Informatics, Management and Ethics (LIME), Health Informatics Centre, Karolinska Institutet, Stockholm, Sweden
- School of Electrical Engineering and Computer Science (EECS)/Software and Computer Systems, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
26
|
Tong Y, Tan F, Huang H, Zhang Z, Zong H, Xie Y, Huang D, Cheng S, Wei Z, Fang M, Crabbe MJC, Wang Y, Zhang X. ViMRT: a text-mining tool and search engine for automated virus mutation recognition. Bioinformatics 2022; 39:6808671. [PMID: 36342236 PMCID: PMC9805560 DOI: 10.1093/bioinformatics/btac721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 10/24/2022] [Accepted: 11/04/2022] [Indexed: 11/09/2022] Open
Abstract
MOTIVATION Virus mutation is one of the most important research issues which plays a critical role in disease progression and has prompted substantial scientific publications. Mutation extraction from published literature has become an increasingly important task, benefiting many downstream applications such as vaccine design and drug usage. However, most existing approaches have low performances in extracting virus mutation due to both lack of precise virus mutation information and their development based on human gene mutations. RESULTS We developed ViMRT, a text-mining tool and search engine for automated virus mutation recognition using natural language processing. ViMRT mainly developed 8 optimized rules and 12 regular expressions based on a development dataset comprising 830 papers of 5 human severe disease-related viruses. It achieved higher performance than other tools in a test dataset (1662 papers, 99.17% in F1-score) and has been applied well to two other viruses, influenza virus and severe acute respiratory syndrome coronavirus-2 (212 papers, 96.99% in F1-score). These results indicate that ViMRT is a high-performance method for the extraction of virus mutation from the biomedical literature. Besides, we present a search engine for researchers to quickly find and accurately search virus mutation-related information including virus genes and related diseases. AVAILABILITY AND IMPLEMENTATION ViMRT software is freely available at http://bmtongji.cn:1225/mutation/index.
Collapse
Affiliation(s)
- Yuantao Tong
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Fanglin Tan
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Honglian Huang
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zeyu Zhang
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Hui Zong
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Yujia Xie
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Danqi Huang
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Shiyang Cheng
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Ziyi Wei
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Meng Fang
- Department of Laboratory Medicine, Shanghai Eastern Hepatobiliary Surgery Hospital, Shanghai 200438, China
| | - M James C Crabbe
- Wolfson College, Oxford University, Oxford OX2 6UD, UK
- Institute of Biomedical and Environmental Science & Technology, University of Bedfordshire, Luton LU1 3JU, UK
- School of Life Sciences, Shanxi University, Taiyuan 030006, China
| | - Ying Wang
- To whom correspondence should be addressed. or
| | | |
Collapse
|
27
|
Grosjean I, Roméo B, Domdom MA, Belaid A, D’Andréa G, Guillot N, Gherardi RK, Gal J, Milano G, Marquette CH, Hung RJ, Landi MT, Han Y, Brest P, Von Bergen M, Klionsky DJ, Amos CI, Hofman P, Mograbi B. Autophagopathies: from autophagy gene polymorphisms to precision medicine for human diseases. Autophagy 2022; 18:2519-2536. [PMID: 35383530 PMCID: PMC9629091 DOI: 10.1080/15548627.2022.2039994] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 01/20/2022] [Accepted: 02/06/2022] [Indexed: 12/15/2022] Open
Abstract
At a time when complex diseases affect globally 280 million people and claim 14 million lives every year, there is an urgent need to rapidly increase our knowledge into their underlying etiologies. Though critical in identifying the people at risk, the causal environmental factors (microbiome and/or pollutants) and the affected pathophysiological mechanisms are not well understood. Herein, we consider the variations of autophagy-related (ATG) genes at the heart of mechanisms of increased susceptibility to environmental stress. A comprehensive autophagy genomic resource is presented with 263 single nucleotide polymorphisms (SNPs) for 69 autophagy-related genes associated with 117 autoimmune, inflammatory, infectious, cardiovascular, neurological, respiratory, and endocrine diseases. We thus propose the term 'autophagopathies' to group together a class of complex human diseases the etiology of which lies in a genetic defect of the autophagy machinery, whether directly related or not to an abnormal flux in autophagy, LC3-associated phagocytosis, or any associated trafficking. The future of precision medicine for common diseases will lie in our ability to exploit these ATG SNP x environment relationships to develop new polygenetic risk scores, new management guidelines, and optimal therapies for afflicted patients.Abbreviations: ATG, autophagy-related; ALS-FTD, amyotrophic lateral sclerosis-frontotemporal dementia; ccRCC, clear cell renal cell carcinoma; CD, Crohn disease; COPD, chronic obstructive pulmonary disease; eQTL, expression quantitative trait loci; HCC, hepatocellular carcinoma; HNSCC, head and neck squamous cell carcinoma; GTEx, genotype-tissue expression; GWAS, genome-wide association studies; LAP, LC3-associated phagocytosis; LC3-II, phosphatidylethanolamine conjugated form of LC3; LD, linkage disequilibrium; LUAD, lung adenocarcinoma; MAF, minor allele frequency; MAP1LC3/LC3: microtubule associated protein 1 light chain 3; NSCLC, non-small cell lung cancer; OS, overall survival; PtdIns3K CIII, class III phosphatidylinositol 3 kinase; PtdIns3P, phosphatidylinositol-3-phosphate; SLE, systemic lupus erythematosus; SNPs, single-nucleotide polymorphisms; mQTL, methylation quantitative trait loci; ULK, unc-51 like autophagy activating kinase; UTRs, untranslated regions; WHO, World Health Organization.
Collapse
Affiliation(s)
- Iris Grosjean
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
| | - Barnabé Roméo
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
| | - Marie-Angela Domdom
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
| | - Amine Belaid
- Université Côte d’Azur (UCA), INSERM U1065, C3M, Team 5, F-06204, France
| | - Grégoire D’Andréa
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
- ENT and Head and Neck surgery department, Institut Universitaire de la Face et du Cou, CHU de Nice, University Hospital, Côte d’Azur University, Nice, France
| | - Nicolas Guillot
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
| | - Romain K Gherardi
- INSERM U955 Team Relais, Faculty of Health, Paris Est University, France
| | - Jocelyn Gal
- University Côte d’Azur, Centre Antoine Lacassagne, Epidemiology and Biostatistics Department, Nice, France
| | - Gérard Milano
- Université Côte d’Azur, Centre Antoine Lacassagne, UPR7497, Nice, France
| | - Charles Hugo Marquette
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
- University Côte d’Azur, FHU-OncoAge, Department of Pulmonary Medicine and Oncology, CHU de Nice, Nice, France
| | - Rayjean J. Hung
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Ontario, Canada; Division of Epidemiology, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | - Maria Teresa Landi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Younghun Han
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA
| | - Patrick Brest
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
| | - Martin Von Bergen
- Helmholtz Centre for Environmental Research GmbH - UFZ, Dep. of Molecular Systems Biology; University of Leipzig, Faculty of Life Sciences, Institute of Biochemistry, Leipzig, Germany
| | - Daniel J. Klionsky
- University of Michigan, Life Sciences Institute, Ann Arbor, MI, 48109, USA
| | - Christopher I. Amos
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA
| | - Paul Hofman
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
- University Côte d’Azur, FHU-OncoAge, CHU de Nice, Laboratory of Clinical and Experimental Pathology (LPCE) Biobank(BB-0033-00025), Nice, France
| | - Baharia Mograbi
- University Côte d’Azur, CNRS, INSERM, IRCAN, FHU-OncoAge, Centre Antoine Lacassagne, France
| |
Collapse
|
28
|
Luo L, Wei CH, Lai PT, Chen Q, Islamaj R, Lu Z. Assigning species information to corresponding genes by a sequence labeling framework. Database (Oxford) 2022; 2022:6760187. [PMID: 36227127 PMCID: PMC9558450 DOI: 10.1093/database/baac090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 08/26/2022] [Accepted: 10/11/2022] [Indexed: 01/24/2023]
Abstract
The automatic assignment of species information to the corresponding genes in a research article is a critically important step in the gene normalization task, whereby a gene mention is normalized and linked to a database record or an identifier by a text-mining algorithm. Existing methods typically rely on heuristic rules based on gene and species co-occurrence in the article, but their accuracy is suboptimal. We therefore developed a high-performance method, using a novel deep learning-based framework, to identify whether there is a relation between a gene and a species. Instead of the traditional binary classification framework in which all possible pairs of genes and species in the same article are evaluated, we treat the problem as a sequence labeling task such that only a fraction of the pairs needs to be considered. Our benchmarking results show that our approach obtains significantly higher performance compared to that of the rule-based baseline method for the species assignment task (from 65.8-81.3% in accuracy). The source code and data for species assignment are freely available. Database URL https://github.com/ncbi/SpeciesAssignment.
Collapse
Affiliation(s)
| | | | - Po-Ting Lai
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Qingyu Chen
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Rezarta Islamaj
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Zhiyong Lu
- *Corresponding author: Tel: +301 594 7089; Fax: +301 480 2288;
| |
Collapse
|
29
|
Setlere S, Jurcenko M, Gailite L, Rots D, Kenina V. Alanyl-tRNA Synthetase 1 Gene Variants in Hereditary Neuropathy. Neurol Genet 2022; 8:e200019. [PMID: 36092982 PMCID: PMC9450682 DOI: 10.1212/nxg.0000000000200019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 07/01/2022] [Indexed: 11/15/2022]
Abstract
Background and ObjectivesOur objective was to report 2 novel variants and to reclassify previously reported alanyl-tRNA synthetase 1 (AARS1) variants associated with hereditary neuropathy and to summarize the clinical features of a previously published cohort of patients.MethodsWe performed detailed neurologic and electrophysiologic assessments and segregation analysis of 2 unrelated families with Charcot-Marie-Tooth (CMT) disease with novel variants in the AARS1 gene. Via literature search, we found studies that included neuropathy cases with AARS1 variants; we then reviewed and reclassified these variants.ResultsWe identified 2 CMT families harboring previously unreported likely pathogenic AARS1 variants: c.1823C>A p.(Thr608Lys) and c.1815C>G p.(His605Gln). In addition, we reinterpreted a total of 35 different AARS1 variants reported in cases with neuropathy from the literature: 9 variants fulfilled the current criteria for being (likely) pathogenic. We compiled and summarized standardized clinical and genotypic information for 90 affected individuals from 32 families with (likely) pathogenic AARS1 variants. Most experienced motor weakness and sensory loss in the lower limbs.DiscussionIn total, 11 AARS1 variants can currently be classified as pathogenic or likely pathogenic and are associated with sensorimotor axonal or intermediate, slowly progressive polyneuropathy with common asymmetry and variable age of symptom onset with no apparent involvement of other organ systems.
Collapse
|
30
|
Nayara Góes de Araújo J, Fernandes de Oliveira V, Bassani Borges J, Dagli-Hernandez C, da Silva Rodrigues Marçal E, Caroline Costa de Freitas R, Medeiros Bastos G, Marques Gonçalves R, Arpad Faludi A, Elim Jannes C, da Costa Pereira A, Dominguez Crespo Hirata R, Hiroyuki Hirata M, Ducati Luchessi A, Nogueira Silbiger V. In silico analysis of upstream variants in Brazilian patients with Familial Hypercholesterolemia. Gene X 2022; 849:146908. [PMID: 36167182 DOI: 10.1016/j.gene.2022.146908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 08/16/2022] [Accepted: 09/19/2022] [Indexed: 10/14/2022] Open
Abstract
Familial hypercholesterolemia (FH) is a prevalent autosomal genetic disease associated with increased risk of early cardiovascular events and death due to chronic exposure to very high levels of low-density lipoprotein cholesterol (LDL-c). Pathogenic variants in the coding regions of LDLR, APOB and PCSK9 account for most FH cases, and variants in non-coding regions maybe involved in FH as well. Variants in the upstream region of LDLR, APOB and PCSK9 were screened by targeted next-generation sequencing and their effects were explored using in silico tools. Twenty-five patients without pathogenic variants in FH-related genes were selected. 3 kb upstream regions of LDLR, APOB and PCSK9 were sequenced using the AmpliSeq (Illumina) and Miseq Reagent Nano Kit v2 (Illumina). Sequencing data were analyzed using variant discovery and functional annotation tools. Potentially regulatory variants were selected by integrating data from public databases, published data and context-dependent regulatory prediction score. Thirty-four single nucleotide variants (SNVs) in upstream regions were identified (6 in LDLR, 15 in APOB, and 13 in PCSK9). Five SNVs were prioritized as potentially regulatory variants (rs934197, rs9282606, rs36218923, rs538300761, g.55038486A>G). APOB rs934197 was previously associated with increased rate of transcription, which in silico analysis suggests that could be due to reducing binding affinity of a transcriptional repressor. Our findings highlight the importance of variant screening outside of coding regions of all relevant genes. Further functional studies are necessary to confirm that prioritized variants could impact gene regulation and contribute to the FH phenotype.
Collapse
Affiliation(s)
- Jéssica Nayara Góes de Araújo
- Northeast Biotechnology Network (RENORBIO), Graduate Program in Biotechnology, Federal University of Rio Grande do Norte, Natal 59078-900, Brazil
| | - Victor Fernandes de Oliveira
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of Sao Paulo, Sao Paulo 05508-000, Brazil
| | - Jéssica Bassani Borges
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of Sao Paulo, Sao Paulo 05508-000, Brazil; Laboratory of Molecular Research in Cardiology, Institute Dante Pazzanese of Cardiology, Sao Paulo, 04012-909, Brazil
| | - Carolina Dagli-Hernandez
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of Sao Paulo, Sao Paulo 05508-000, Brazil
| | | | - Renata Caroline Costa de Freitas
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of Sao Paulo, Sao Paulo 05508-000, Brazil
| | - Gisele Medeiros Bastos
- Laboratory of Molecular Research in Cardiology, Institute Dante Pazzanese of Cardiology, Sao Paulo, 04012-909, Brazil; Medical Clinic Division, Institute Dante Pazzanese of Cardiology, Sao Paulo 04012-909, Brazil
| | | | - André Arpad Faludi
- Medical Clinic Division, Institute Dante Pazzanese of Cardiology, Sao Paulo 04012-909, Brazil
| | - Cinthia Elim Jannes
- Laboratory of Genetics and Molecular Cardiology, Heart Institute, University of Sao Paulo 05403-900, Brazil
| | - Alexandre da Costa Pereira
- Laboratory of Genetics and Molecular Cardiology, Heart Institute, University of Sao Paulo 05403-900, Brazil
| | - Rosario Dominguez Crespo Hirata
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of Sao Paulo, Sao Paulo 05508-000, Brazil
| | - Mario Hiroyuki Hirata
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of Sao Paulo, Sao Paulo 05508-000, Brazil
| | - André Ducati Luchessi
- Northeast Biotechnology Network (RENORBIO), Graduate Program in Biotechnology, Federal University of Rio Grande do Norte, Natal 59078-900, Brazil; Department of Clinical and Toxicological Analyses, Federal University of Rio Grande do Norte, Natal 59012-570, Brazil
| | - Vivian Nogueira Silbiger
- Northeast Biotechnology Network (RENORBIO), Graduate Program in Biotechnology, Federal University of Rio Grande do Norte, Natal 59078-900, Brazil; Department of Clinical and Toxicological Analyses, Federal University of Rio Grande do Norte, Natal 59012-570, Brazil.
| |
Collapse
|
31
|
Luo L, Lai PT, Wei CH, Arighi CN, Lu Z. BioRED: a rich biomedical relation extraction dataset. Brief Bioinform 2022; 23:6645993. [PMID: 35849818 PMCID: PMC9487702 DOI: 10.1093/bib/bbac282] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 06/02/2022] [Accepted: 06/19/2022] [Indexed: 11/13/2022] Open
Abstract
Automated relation extraction (RE) from biomedical literature is critical for many downstream text mining applications in both research and real-world settings. However, most existing benchmarking datasets for biomedical RE only focus on relations of a single type (e.g. protein-protein interactions) at the sentence level, greatly limiting the development of RE systems in biomedicine. In this work, we first review commonly used named entity recognition (NER) and RE datasets. Then, we present a first-of-its-kind biomedical relation extraction dataset (BioRED) with multiple entity types (e.g. gene/protein, disease, chemical) and relation pairs (e.g. gene-disease; chemical-chemical) at the document level, on a set of 600 PubMed abstracts. Furthermore, we label each relation as describing either a novel finding or previously known background knowledge, enabling automated algorithms to differentiate between novel and background information. We assess the utility of BioRED by benchmarking several existing state-of-the-art methods, including Bidirectional Encoder Representations from Transformers (BERT)-based models, on the NER and RE tasks. Our results show that while existing approaches can reach high performance on the NER task (F-score of 89.3%), there is much room for improvement for the RE task, especially when extracting novel relations (F-score of 47.7%). Our experiments also demonstrate that such a rich dataset can successfully facilitate the development of more accurate, efficient and robust RE systems for biomedicine. Availability: The BioRED dataset and annotation guidelines are freely available at https://ftp.ncbi.nlm.nih.gov/pub/lu/BioRED/.
Collapse
Affiliation(s)
- Ling Luo
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Po-Ting Lai
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Chih-Hsuan Wei
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | | | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| |
Collapse
|
32
|
Wei CH, Allot A, Riehle K, Milosavljevic A, Lu Z. tmVar 3.0: an improved variant concept recognition and normalization tool. Bioinformatics 2022; 38:4449-4451. [PMID: 35904569 PMCID: PMC9477515 DOI: 10.1093/bioinformatics/btac537] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 07/07/2022] [Accepted: 07/27/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Previous studies have shown that automated text-mining tools are becoming increasingly important for successfully unlocking variant information in scientific literature at large scale. Despite multiple attempts in the past, existing tools are still of limited recognition scope and precision. RESULT We propose tmVar 3.0: an improved variant recognition and normalization system. Compared to its predecessors, tmVar 3.0 recognizes a wider spectrum of variant-related entities (e.g. allele and copy number variants), and groups together different variant mentions belonging to the same genomic sequence position in an article for improved accuracy. Moreover, tmVar 3.0 provides advanced variant normalization options such as allele-specific identifiers from the ClinGen Allele Registry. tmVar 3.0 exhibits state-of-the-art performance with over 90% in F-measure for variant recognition and normalization, when evaluated on three independent benchmarking datasets. tmVar 3.0 as well as annotations for the entire PubMed and PMC datasets are freely available for download. AVAILABILITY AND IMPLEMENTATION https://github.com/ncbi/tmVar3.
Collapse
Affiliation(s)
- Chih-Hsuan Wei
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Alexis Allot
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| |
Collapse
|
33
|
Xu Q, Liu Y, Hu J, Duan X, Song N, Zhou J, Zhai J, Su J, Liu S, Chen F, Zheng W, Guo Z, Li H, Zhou Q, Niu B. OncoPubMiner: a platform for mining oncology publications. Brief Bioinform 2022; 23:6691792. [PMID: 36058206 DOI: 10.1093/bib/bbac383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 11/12/2022] Open
Abstract
Updated and expert-quality knowledge bases are fundamental to biomedical research. A knowledge base established with human participation and subject to multiple inspections is needed to support clinical decision making, especially in the growing field of precision oncology. The number of original publications in this field has risen dramatically with the advances in technology and the evolution of in-depth research. Consequently, the issue of how to gather and mine these articles accurately and efficiently now requires close consideration. In this study, we present OncoPubMiner (https://oncopubminer.chosenmedinfo.com), a free and powerful system that combines text mining, data structure customisation, publication search with online reading and project-centred and team-based data collection to form a one-stop 'keyword in-knowledge out' oncology publication mining platform. The platform was constructed by integrating all open-access abstracts from PubMed and full-text articles from PubMed Central, and it is updated daily. OncoPubMiner makes obtaining precision oncology knowledge from scientific articles straightforward and will assist researchers in efficiently developing structured knowledge base systems and bring us closer to achieving precision oncology goals.
Collapse
Affiliation(s)
- Quan Xu
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Yueyue Liu
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China.,ChosenMed Gene Technology Co. Ltd., Nanjing, China
| | - Jifang Hu
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China.,Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100190, China
| | - Xiaohong Duan
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China.,ChosenMed Gene Technology Co. Ltd., Nanjing, China
| | - Niuben Song
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Jiale Zhou
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Jincheng Zhai
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Junyan Su
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Siyao Liu
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Fan Chen
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China.,ChosenMed Gene Technology Co. Ltd., Nanjing, China
| | - Wei Zheng
- The Department of Nephrology and Hypertension Medicine, Beijing Electric Power Hospital, Beijing 100073, China
| | - Zhongjia Guo
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Hexiang Li
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China
| | - Qiming Zhou
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China.,ChosenMed Gene Technology Co. Ltd., Nanjing, China
| | - Beifang Niu
- ChosenMed Technology (Beijing) Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing 100176, China.,Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100190, China
| |
Collapse
|
34
|
Liu M, Yang J, Duan H, Yu L, Wu D, Li H. SNPMap—An integrated visual SNP interpretation tool. Front Genet 2022; 13:985500. [PMID: 36061173 PMCID: PMC9437274 DOI: 10.3389/fgene.2022.985500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 07/25/2022] [Indexed: 12/03/2022] Open
Abstract
New technologies, such as next-generation sequencing, have advanced the ability to diagnose diseases and improve prognosis but require the identification of thousands of variants in each report based on several databases scattered across places. Curating an integrated interpretation database is time-consuming, costly, and needs regular update. On the other hand, the automatic curation of knowledge sources always results in overloaded information. In this study, an automated pipeline was proposed to create an integrated visual single-nucleotide polymorphism (SNP) interpretation tool called SNPMap. SNPMap pipelines periodically obtained SNP-related information from LitVar, PubTator, and GWAS Catalog API tools and presented it to the user after extraction, integration, and visualization. Keywords and their semantic relations to each SNP are rendered into two graphs, with their significance represented by the size/width of circles/lines. Moreover, the most related SNPs for each keyword that appeared in SNPMap were calculated and sorted. SNPMap retains the advantage of an automatic process while assisting users in accessing more lucid and detailed information through visualization and integration with other materials.
Collapse
Affiliation(s)
- Miaosen Liu
- School of Medicine, Zhejiang University, Hangzhou, China
| | - Jian Yang
- The College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
| | - Huilong Duan
- The College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
| | - Lan Yu
- The Children’s Hospital, Zhejiang University School of Medicine, National Clinical Research Center, Hangzhou, China
| | - Dingwen Wu
- The Children’s Hospital, Zhejiang University School of Medicine, National Clinical Research Center, Hangzhou, China
| | - Haomin Li
- The Children’s Hospital, Zhejiang University School of Medicine, National Clinical Research Center, Hangzhou, China
- *Correspondence: Haomin Li,
| |
Collapse
|
35
|
Farrokhian N, Maltas J, Dinh M, Durmaz A, Ellsworth P, Hitomi M, McClure E, Marusyk A, Kaznatcheev A, Scott JG. Measuring competitive exclusion in non-small cell lung cancer. SCIENCE ADVANCES 2022; 8:eabm7212. [PMID: 35776787 PMCID: PMC10883359 DOI: 10.1126/sciadv.abm7212] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In this study, we experimentally measure the frequency-dependent interactions between a gefitinib-resistant non-small cell lung cancer population and its sensitive ancestor via the evolutionary game assay. We show that cost of resistance is insufficient to accurately predict competitive exclusion and that frequency-dependent growth rate measurements are required. Using frequency-dependent growth rate data, we then show that gefitinib treatment results in competitive exclusion of the ancestor, while the absence of treatment results in a likely, but not guaranteed, exclusion of the resistant strain. Then, using simulations, we demonstrate that incorporating ecological growth effects can influence the predicted extinction time. In addition, we show that higher drug concentrations may not lead to the optimal reduction in tumor burden. Together, these results highlight the potential importance of frequency-dependent growth rate data for understanding competing populations, both in the laboratory and as we translate adaptive therapy regimens to the clinic.
Collapse
Affiliation(s)
| | - Jeff Maltas
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
| | - Mina Dinh
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
| | | | | | - Masahiro Hitomi
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
| | - Erin McClure
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
| | - Andriy Marusyk
- Department of Cancer Physiology, Moffitt Cancer Center, Tampa, FL, USA
| | - Artem Kaznatcheev
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Jacob G Scott
- CWRU School of Medicine, Cleveland, OH, USA
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
- Department of Radiation Oncology, Cleveland Clinic, Cleveland, OH, USA
| |
Collapse
|
36
|
Li PH, Chen TF, Yu JY, Shih SH, Su CH, Lin YH, Tsai HK, Juan HF, Chen CY, Huang JH. pubmedKB: an interactive web server for exploring biomedical entity relations in the biomedical literature. Nucleic Acids Res 2022; 50:W616-W622. [PMID: 35536289 PMCID: PMC9252824 DOI: 10.1093/nar/gkac310] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 04/06/2022] [Accepted: 04/18/2022] [Indexed: 11/15/2022] Open
Abstract
With the proliferation of genomic sequence data for biomedical research, the exploration of human genetic information by domain experts requires a comprehensive interrogation of large numbers of scientific publications in PubMed. However, a query in PubMed essentially provides search results sorted only by the date of publication. A search engine for retrieving and interpreting complex relations between biomedical concepts in scientific publications remains lacking. Here, we present pubmedKB, a web server designed to extract and visualize semantic relationships between four biomedical entity types: variants, genes, diseases, and chemicals. pubmedKB uses state-of-the-art natural language processing techniques to extract semantic relations from the large number of PubMed abstracts. Currently, over 2 million semantic relations between biomedical entity pairs are extracted from over 33 million PubMed abstracts in pubmedKB. pubmedKB has a user-friendly interface with an interactive semantic graph, enabling the user to easily query entities and explore entity relations. Supporting sentences with the highlighted snippets allow to easily navigate the publications. Combined with a new explorative approach to literature mining and an interactive interface for researchers, pubmedKB thus enables rapid, intelligent searching of the large biomedical literature to provide useful knowledge and insights. pubmedKB is available at https://www.pubmedkb.cc/.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Huai-Kuang Tsai
- Taiwan AI Labs, Taipei 10351, Taiwan.,Institute of Information Science, Academia Sinica, Taipei, 11529, Taiwan
| | - Hsueh-Fen Juan
- Taiwan AI Labs, Taipei 10351, Taiwan.,Department of Life Science, National Taiwan University, Taipei 10617, Taiwan.,Center for Computational and Systems Biology, National Taiwan University, Taipei 10617, Taiwan
| | - Chien-Yu Chen
- Taiwan AI Labs, Taipei 10351, Taiwan.,Center for Computational and Systems Biology, National Taiwan University, Taipei 10617, Taiwan.,Department of Biomechatronics Engineering, National Taiwan University, Taipei, 10617, Taiwan
| | | |
Collapse
|
37
|
Tomar S, Klinzing DC, Chen CK, Gan LH, Moscarello T, Reuter C, Ashley EA, Foo R. Causative Variants for Inherited Cardiac Conditions in a Southeast Asian Population Cohort. Circ Genom Precis Med 2022; 15:e003536. [DOI: 10.1161/circgen.121.003536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Background:
Variable penetrance and late-onset phenotypes are key challenges for classifying causal as well as incidental findings in inherited cardiac conditions. Allele frequencies of variants in ancestry-specific populations, along with clinical variant analysis and interpretation, are critical to determine their true significance.
Methods:
Here, we carefully reviewed and classified variants in genes associated with inherited cardiac conditions based on a population whole-genome sequencing cohort of 4810 Singaporeans representing Southeast Asian ancestries.
Results:
Eighty-nine (1.85%) individuals carried either pathogenic or likely pathogenic variants across 25 genes. Forty-six (51.7%) had variants in causal genes for familial hyperlipidemia, but there were also recurrent variants in
SCN5A
and
MYBPC3
, causal genes for inherited arrhythmia and cardiomyopathy, which, despite previous reports, we determined to lack criteria for pathogenicity.
Conclusions:
Our findings highlight the incidence of disease-related variants in inherited cardiac conditions and emphasize the value of large-scale sequencing in specific ancestries. Follow-up detailed phenotyping and analysis of pedigrees are crucial because assigning pathogenicity will significantly affect clinical management for individuals and their family members.
Collapse
Affiliation(s)
- Swati Tomar
- Cardiovascular Disease Translational Research Programme, Yong Loo Lin School of Medicine, National University Singapore (S.T., D.C.K., C.K.C., L.H.G., R.F.)
- Cardiovascular Research Institute, National University Heart Centre (S.T., D.C.K., C.K.C., L.H.G., R.F.), National University Health System, Singapore
| | - David C. Klinzing
- Cardiovascular Disease Translational Research Programme, Yong Loo Lin School of Medicine, National University Singapore (S.T., D.C.K., C.K.C., L.H.G., R.F.)
- Cardiovascular Research Institute, National University Heart Centre (S.T., D.C.K., C.K.C., L.H.G., R.F.), National University Health System, Singapore
- Khoo Teck Puat National University Children’s Medical Institute (C.K.C.), National University Health System, Singapore
- Department of Pediatrics, Yong Loo Lin School of Medicine, National University Singapore, Singapore (C.K.C.)
| | - Ching Kit Chen
- Cardiovascular Disease Translational Research Programme, Yong Loo Lin School of Medicine, National University Singapore (S.T., D.C.K., C.K.C., L.H.G., R.F.)
- Cardiovascular Research Institute, National University Heart Centre (S.T., D.C.K., C.K.C., L.H.G., R.F.), National University Health System, Singapore
| | - Louis Hanqiang Gan
- Cardiovascular Disease Translational Research Programme, Yong Loo Lin School of Medicine, National University Singapore (S.T., D.C.K., C.K.C., L.H.G., R.F.)
- Cardiovascular Research Institute, National University Heart Centre (S.T., D.C.K., C.K.C., L.H.G., R.F.), National University Health System, Singapore
| | - Tia Moscarello
- Centre for Inherited Cardiovascular Disease, Stanford University Medical Center, CA (T.M., C.R., E.A.A.)
| | - Chloe Reuter
- Centre for Inherited Cardiovascular Disease, Stanford University Medical Center, CA (T.M., C.R., E.A.A.)
| | - Euan A. Ashley
- Centre for Inherited Cardiovascular Disease, Stanford University Medical Center, CA (T.M., C.R., E.A.A.)
| | - Roger Foo
- Cardiovascular Disease Translational Research Programme, Yong Loo Lin School of Medicine, National University Singapore (S.T., D.C.K., C.K.C., L.H.G., R.F.)
- Cardiovascular Research Institute, National University Heart Centre (S.T., D.C.K., C.K.C., L.H.G., R.F.), National University Health System, Singapore
- Genome Institute of Singapore (R.F.)
| |
Collapse
|
38
|
Sadler KV, Rowlands CF, Smith PT, Hartley CL, Bowers NL, Roberts NY, Harris JL, Wallace AJ, Gareth Evans D, Messiaen LM, Smith MJ. Re-evaluation of Missense Variant Classifications in NF2. Hum Mutat 2022; 43:643-654. [PMID: 35332608 PMCID: PMC9323416 DOI: 10.1002/humu.24370] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 02/18/2022] [Accepted: 03/21/2022] [Indexed: 11/20/2022]
Abstract
Missense variants in the NF2 gene result in variable NF2 disease presentation. Clinical classification of missense variants often represents a challenge, due to lack of evidence for pathogenicity and function. This study provides a summary of NF2 missense variants, with variant classifications based on currently available evidence. NF2 missense variants were collated from pathology‐associated databases and existing literature. Association for Clinical Genomic Sciences Best Practice Guidelines (2020) were followed in the application of evidence for variant interpretation and classification. The majority of NF2 missense variants remain classified as variants of uncertain significance. However, NF2 missense variants identified in gnomAD occurred at a consistent rate across the gene, while variants compiled from pathology‐associated databases displayed differing rates of variation by exon of NF2. The highest rate of NF2 disease‐associated variants was observed in exon 7, while lower rates were observed toward the C‐terminus of the NF2 protein, merlin. Further phenotypic information associated with variants, alongside variant‐specific functional analysis, is necessary for more definitive variant interpretation. Our data identified differences in frequency of NF2 missense variants by exon between gnomAD population data and NF2 disease‐associated variants, suggesting a potential genotype‐phenotype correlation; further work is necessary to substantiate this.
Collapse
Affiliation(s)
- Katherine V Sadler
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK.,Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Charlie F Rowlands
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK.,Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Philip T Smith
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK
| | - Claire L Hartley
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK
| | - Naomi L Bowers
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK
| | - Nicola Y Roberts
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK
| | - Jade L Harris
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK
| | - Andrew J Wallace
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK
| | - D Gareth Evans
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK.,Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Ludwine M Messiaen
- Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Miriam J Smith
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester Academic Health Sciences Centre (MAHSC), Manchester, UK.,Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| |
Collapse
|
39
|
Pasche E, Mottaz A, Caucheteur D, Gobeill J, Michel PA, Ruch P. Variomes: a high recall search engine to support the curation of genomic variants. Bioinformatics 2022; 38:2595-2601. [PMID: 35274687 PMCID: PMC9048643 DOI: 10.1093/bioinformatics/btac146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 02/07/2022] [Accepted: 03/10/2022] [Indexed: 12/02/2022] Open
Abstract
Motivation Identification and interpretation of clinically actionable variants is a critical bottleneck. Searching for evidence in the literature is mandatory according to ASCO/AMP/CAP practice guidelines; however, it is both labor-intensive and error-prone. We developed a system to perform triage of publications relevant to support an evidence-based decision. The system is also able to prioritize variants. Our system searches within pre-annotated collections such as MEDLINE and PubMed Central. Results We assess the search effectiveness of the system using three different experimental settings: literature triage; variant prioritization and comparison of Variomes with LitVar. Almost two-thirds of the publications returned in the top-5 are relevant for clinical decision-support. Our approach enabled identifying 81.8% of clinically actionable variants in the top-3. Variomes retrieves on average +21.3% more articles than LitVar and returns the same number of results or more results than LitVar for 90% of the queries when tested on a set of 803 queries; thus, establishing a new baseline for searching the literature about variants. Availability and implementation Variomes is publicly available at https://candy.hesge.ch/Variomes. Source code is freely available at https://github.com/variomes/sibtm-variomes. SynVar is publicly available at https://goldorak.hesge.ch/synvar. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Emilie Pasche
- SIB Text Mining Group, Swiss Institute of Bioinformatics, 1206 Geneva, Switzerland.,BiTeM Group, Information Sciences, 1227 Carouge, Switzerland HES-SO/HEG
| | - Anaïs Mottaz
- SIB Text Mining Group, Swiss Institute of Bioinformatics, 1206 Geneva, Switzerland.,BiTeM Group, Information Sciences, 1227 Carouge, Switzerland HES-SO/HEG
| | - Déborah Caucheteur
- SIB Text Mining Group, Swiss Institute of Bioinformatics, 1206 Geneva, Switzerland.,BiTeM Group, Information Sciences, 1227 Carouge, Switzerland HES-SO/HEG
| | - Julien Gobeill
- SIB Text Mining Group, Swiss Institute of Bioinformatics, 1206 Geneva, Switzerland.,BiTeM Group, Information Sciences, 1227 Carouge, Switzerland HES-SO/HEG
| | - Pierre-André Michel
- SIB Text Mining Group, Swiss Institute of Bioinformatics, 1206 Geneva, Switzerland.,BiTeM Group, Information Sciences, 1227 Carouge, Switzerland HES-SO/HEG
| | - Patrick Ruch
- SIB Text Mining Group, Swiss Institute of Bioinformatics, 1206 Geneva, Switzerland.,BiTeM Group, Information Sciences, 1227 Carouge, Switzerland HES-SO/HEG
| |
Collapse
|
40
|
Acosta-Uribe J, Aguillón D, Cochran JN, Giraldo M, Madrigal L, Killingsworth BW, Singhal R, Labib S, Alzate D, Velilla L, Moreno S, García GP, Saldarriaga A, Piedrahita F, Hincapié L, López HE, Perumal N, Morelo L, Vallejo D, Solano JM, Reiman EM, Surace EI, Itzcovich T, Allegri R, Sánchez-Valle R, Villegas-Lanau A, White CL, Matallana D, Myers RM, Browning SR, Lopera F, Kosik KS. A neurodegenerative disease landscape of rare mutations in Colombia due to founder effects. Genome Med 2022; 14:27. [PMID: 35260199 PMCID: PMC8902761 DOI: 10.1186/s13073-022-01035-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 02/26/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Colombian population, as well as those in other Latin American regions, arose from a recent tri-continental admixture among Native Americans, Spanish invaders, and enslaved Africans, all of whom passed through a population bottleneck due to widespread infectious diseases that left small isolated local settlements. As a result, the current population reflects multiple founder effects derived from diverse ancestries. METHODS We characterized the role of admixture and founder effects on the origination of the mutational landscape that led to neurodegenerative disorders under these historical circumstances. Genomes from 900 Colombian individuals with Alzheimer's disease (AD) [n = 376], frontotemporal lobar degeneration-motor neuron disease continuum (FTLD-MND) [n = 197], early-onset dementia not otherwise specified (EOD) [n = 73], and healthy participants [n = 254] were analyzed. We examined their global and local ancestry proportions and screened this cohort for deleterious variants in disease-causing and risk-conferring genes. RESULTS We identified 21 pathogenic variants in AD-FTLD related genes, and PSEN1 harbored the majority (11 pathogenic variants). Variants were identified from all three continental ancestries. TREM2 heterozygous and homozygous variants were the most common among AD risk genes (102 carriers), a point of interest because the disease risk conferred by these variants differed according to ancestry. Several gene variants that have a known association with MND in European populations had FTLD phenotypes on a Native American haplotype. Consistent with founder effects, identity by descent among carriers of the same variant was frequent. CONCLUSIONS Colombian demography with multiple mini-bottlenecks probably enhanced the detection of founder events and left a proportionally higher frequency of rare variants derived from the ancestral populations. These findings demonstrate the role of genomically defined ancestry in phenotypic disease expression, a phenotypic range of different rare mutations in the same gene, and further emphasize the importance of inclusiveness in genetic studies.
Collapse
Affiliation(s)
- Juliana Acosta-Uribe
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - David Aguillón
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | | | - Margarita Giraldo
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
- Instituto Neurológico de Colombia (INDEC), Medellín, Colombia
| | - Lucía Madrigal
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Bradley W Killingsworth
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Rijul Singhal
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Sarah Labib
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Diana Alzate
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Lina Velilla
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Sonia Moreno
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Gloria P García
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Amanda Saldarriaga
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Francisco Piedrahita
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Liliana Hincapié
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Hugo E López
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Nithesh Perumal
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Leonilde Morelo
- Department of Internal Medicine, School of Medicine, Universidad del Sinú, Montería, Colombia
| | - Dionis Vallejo
- Department of Neurology, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Juan Marcos Solano
- Department of Neurology, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | | | - Ezequiel I Surace
- Laboratorio de Enfermedades Neurodegenerativas (Fleni-CONICET), Buenos Aires, Argentina
| | - Tatiana Itzcovich
- Laboratorio de Enfermedades Neurodegenerativas (Fleni-CONICET), Buenos Aires, Argentina
| | - Ricardo Allegri
- Centro de Memoria y Envejecimiento (Fleni-CONICET), Buenos Aires, Argentina
| | - Raquel Sánchez-Valle
- Alzheimer's Disease and Other Cognitive Disorders Unit, Hospital Clínic de Barcelona, IDIBAPS and University of Barcelona, Barcelona, Spain
| | - Andrés Villegas-Lanau
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Charles L White
- Neuropathology Section, Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Diana Matallana
- Instituto de Envejecimiento, Department of Psychiatry, School of Medicine, Pontifical Xaverian University, Bogotá, Colombia
- Department of Mental Health, Hospital Universitario Santa Fe de Bogotá, Bogotá, Colombia
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Sharon R Browning
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Francisco Lopera
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia.
| | - Kenneth S Kosik
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA.
| |
Collapse
|
41
|
Borchert F, Meister L, Langer T, Follmann M, Arnrich B, Schapranow MP. Controversial Trials First: Identifying Disagreement Between Clinical Guidelines and New Evidence. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022; 2021:237-246. [PMID: 35308948 PMCID: PMC8861732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Clinical guidelines integrate latest evidence to support clinical decision-making. As new research findings are published at an increasing rate, it would be helpful to detect when such results disagree with current guideline recommendations. In this work, we describe a software system for the automatic identification of disagreement between clinical guidelines and published research. A critical feature of the system is the extraction and cross-lingual normalization of information through natural language processing. The initial version focuses on the detection of cancer treatments in clinical trial reports that are not addressed in oncology guidelines. We evaluate the relevance of trials retrieved by our system retrospectively by comparison with historic guideline updates and also prospectively through manual evaluation by guideline experts. The system improves precision over state-of-the-art literature research strategies while maintaining near-total recall. Detailed error analysis highlights challenges for fine-grained clinical information extraction, in particular when extracting population definitions for tumor-agnostic therapies.
Collapse
Affiliation(s)
- Florian Borchert
- Digital Health Center, Hasso Plattner Institute, University of Potsdam, Germany
| | - Laura Meister
- Digital Health Center, Hasso Plattner Institute, University of Potsdam, Germany
| | - Thomas Langer
- German Guideline Program in Oncology, German Cancer Society, Berlin, Germany
| | - Markus Follmann
- German Guideline Program in Oncology, German Cancer Society, Berlin, Germany
| | - Bert Arnrich
- Digital Health Center, Hasso Plattner Institute, University of Potsdam, Germany
| | | |
Collapse
|
42
|
Nonnecke EB, Castillo PA, Johansson MEV, Hollox EJ, Shen B, Lönnerdal B, Bevins CL. Human intelectin-2 (ITLN2) is selectively expressed by secretory Paneth cells. FASEB J 2022; 36:e22200. [PMID: 35182405 PMCID: PMC9262044 DOI: 10.1096/fj.202101870r] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 01/20/2022] [Accepted: 01/26/2022] [Indexed: 01/04/2023]
Abstract
Intelectins (intestinal lectins) are highly conserved across chordate evolution and have been implicated in various human diseases, including Crohn's disease (CD). The human genome encodes two intelectin genes, intelectin-1 (ITLN1) and intelectin-2 (ITLN2). Other than its high sequence similarity with ITLN1, little is known about ITLN2. To address this void in knowledge, we report that ITLN2 exhibits discrete, yet notable differences from ITLN1 in primary structure, including a unique amino terminus, as well as changes in amino acid residues associated with the glycan-binding activity of ITLN1. We identified that ITLN2 is a highly abundant Paneth cell-specific product, which localizes to secretory granules, and is expressed as a multimeric protein in the small intestine. In surgical specimens of ileal CD, ITLN2 mRNA levels were reduced approximately five-fold compared to control specimens. The ileal expression of ITLN2 was unaffected by previously reported disease-associated variants in ITLN2 and CD-associated variants in neighboring ITLN1 as well as NOD2 and ATG16L1. ITLN2 mRNA expression was undetectable in control colon tissue; however, in both ulcerative colitis (UC) and colonic CD, metaplastic Paneth cells were found to express ITLN2. Together, the data reported establish the groundwork for understanding ITLN2 function(s) in the intestine, including its possible role in CD.
Collapse
Affiliation(s)
- Eric B Nonnecke
- Department of Microbiology and Immunology, School of Medicine, University of California, Davis, Davis, California, USA
| | - Patricia A Castillo
- Department of Microbiology and Immunology, School of Medicine, University of California, Davis, Davis, California, USA
| | - Malin E V Johansson
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg, Gothenburg, Sweden
| | - Edward J Hollox
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
| | - Bo Shen
- Department of Gastroenterology, Hepatology and Nutrition, Digestive Diseases and Surgery Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Bo Lönnerdal
- Department of Nutrition, University of California, Davis, Davis, California, USA
| | - Charles L Bevins
- Department of Microbiology and Immunology, School of Medicine, University of California, Davis, Davis, California, USA
| |
Collapse
|
43
|
Bakutenko IY, Haurylchyk ID, Sechko EV, Tchitchko AM, Batyan GM, Sukalo AV, Ryabokon NI. AGER gene variant as a risk factor for juvenile idiopathic arthritis. J Gene Med 2021; 24:e3399. [PMID: 34806241 DOI: 10.1002/jgm.3399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 10/20/2021] [Accepted: 11/04/2021] [Indexed: 11/08/2022] Open
Abstract
BACKGROUND The AGER gene encodes a cell surface multiligand receptor of advanced glycation end-products that is also capable of binding other molecules and is involved in numerous pathways related to inflammation, apoptosis, immunity and so on. In the present study, we aimed to investigate whether the AGER rs1035798 (G>A) intronic polymorphism, showing an association with multiple sclerosis and rheumatoid arthritis in adults, is related to juvenile idiopathic arthritis (JIA). METHODS Caucasian children from the Belarusian population were enrolled in the study. In total, there were 201 cases with JIA, 37 with juvenile systemic lupus erythematosus, 222 children with the articular syndrome of non-autoimmune etiology (positive control for JIA) and 365 negative controls (children without any autoimmune or inflammatory diseases). Genomic DNA samples from the patients and controls were genotyped by a real-time polymerase chain reaction. RESULTS A marked association of the homozygous AA rs1035798 genotype with JIA (p = 5 × 10-4 ) was found. Allele A was also associated with JIA (p = 0.0058), as well as with the articular syndrome of non-autoimmune etiology (p = 0.0264). The highest frequencies of the AA genotype were found in the subgroups of JIA patients with polyarthritis or severe oligoarthritis. The AA genotype patients also had the smallest mean age of the JIA onset. CONCLUSIONS Our results demonstrate that the AGER rs1035798 AA genotype is a risk factor for JIA in Belarusian children. They also suggest a link between the AGER AA genotype and the risk of JIA early onset and severity. However, the functional relevance of the rs1035798 polymorphism is still unclear.
Collapse
Affiliation(s)
- Ivan Yurievich Bakutenko
- Laboratory of Molecular Basis of Genome Stability, Institute of Genetics and Cytology, National Academy of Sciences of Belarus, Minsk, Republic of Belarus
| | - Irena Dmitrievna Haurylchyk
- Laboratory of Molecular Basis of Genome Stability, Institute of Genetics and Cytology, National Academy of Sciences of Belarus, Minsk, Republic of Belarus
| | - Elena Vladimirovna Sechko
- 1st Department of Childhood Diseases, Belarusian State Medical University, Minsk, Republic of Belarus
| | | | - Galina Mihajlovna Batyan
- 1st Department of Childhood Diseases, Belarusian State Medical University, Minsk, Republic of Belarus
| | | | - Nadezhda Ivanovna Ryabokon
- Laboratory of Molecular Basis of Genome Stability, Institute of Genetics and Cytology, National Academy of Sciences of Belarus, Minsk, Republic of Belarus
| |
Collapse
|
44
|
Papageorgiou L, Alkenaris H, Zervou MI, Vlachakis D, Matalliotakis I, Spandidos DA, Bertsias G, Goulielmos GN, Eliopoulos E. Epione application: An integrated web‑toolkit of clinical genomics and personalized medicine in systemic lupus erythematosus. Int J Mol Med 2021; 49:8. [PMID: 34791504 PMCID: PMC8612305 DOI: 10.3892/ijmm.2021.5063] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 11/02/2021] [Indexed: 12/16/2022] Open
Abstract
Genome wide association studies (GWAS) have identified autoimmune disease-associated loci, a number of which are involved in numerous disease-associated pathways. However, much of the underlying genetic and pathophysiological mechanisms remain to be elucidated. Systemic lupus erythematosus (SLE) is a chronic, highly heterogeneous auto-immune disease, characterized by differences in autoantibody profile, serum cytokines and a multi-system involvement. This study presents the Epione application, an integrated bioinformatics web-toolkit, designed to assist medical experts and researchers in more accurately diagnosing SLE. The application aims to identify the most credible gene variants and single nucleotide polymorphisms (SNPs) associated with SLE susceptibility, by using patient's genomic data to aid the medical expert in SLE diagnosis. The application contains useful knowledge of >70,000 SLE-related publications that have been analyzed, using data mining and semantic techniques, towards extracting the SLE-related genes and the corresponding SNPs. Probable genes associated with the patient's genomic profile are visualized with several graphs, including chromosome ideograms, statistic bars and regulatory networks through data mining studies with relative publications, to obtain a representative number of the most credible candidate genes and biological pathways associated with the SLE. Furthermore, an evaluation study was performed on a patient diagnosed with SLE and is presented herein. Epione has also been expanded in family-related candidate patients to evaluate its predictive power. All the recognized gene variants that were previously considered to be associated with SLE were accurately identified in the output profile of the patient, and by comparing the results, novel findings have emerged. The Epione application may assist and facilitate in early stage diagnosis by using the patients' genomic profile to compare against the list of the most predictable candidate gene variants related to SLE. Its diagnosis-oriented output presents the user with a structured set of results on variant association, position in genome and links to specific bibliography and gene network associations. The overall aim of the present study was to provide a reliable tool for the most effective study of SLE. This novel and accessible webserver tool of SLE is available at http://geneticslab.aua.gr/epione/.
Collapse
Affiliation(s)
- Louis Papageorgiou
- Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
| | - Haris Alkenaris
- Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
| | - Maria I Zervou
- Section of Molecular Pathology and Human Genetics, Department of Internal Medicine, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Dimitriοs Vlachakis
- Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
| | - Ioannis Matalliotakis
- Department of Obstetrics and Gynecology, Venizeleio and Pananio General Hospital of Heraklion, 71409 Heraklion, Greece
| | - Demetrios A Spandidos
- Laboratory of Clinical Virology, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - George Bertsias
- Department of Rheumatology and Clinical Immunology, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - George N Goulielmos
- Section of Molecular Pathology and Human Genetics, Department of Internal Medicine, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Elias Eliopoulos
- Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
| |
Collapse
|
45
|
Borchert F, Mock A, Tomczak A, Hügel J, Alkarkoukly S, Knurr A, Volckmar AL, Stenzinger A, Schirmacher P, Debus J, Jäger D, Longerich T, Fröhling S, Eils R, Bougatf N, Sax U, Schapranow MP. Knowledge bases and software support for variant interpretation in precision oncology. Brief Bioinform 2021; 22:bbab134. [PMID: 33971666 PMCID: PMC8574624 DOI: 10.1093/bib/bbab134] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 03/10/2021] [Accepted: 03/30/2021] [Indexed: 12/12/2022] Open
Abstract
Precision oncology is a rapidly evolving interdisciplinary medical specialty. Comprehensive cancer panels are becoming increasingly available at pathology departments worldwide, creating the urgent need for scalable cancer variant annotation and molecularly informed treatment recommendations. A wealth of mainly academia-driven knowledge bases calls for software tools supporting the multi-step diagnostic process. We derive a comprehensive list of knowledge bases relevant for variant interpretation by a review of existing literature followed by a survey among medical experts from university hospitals in Germany. In addition, we review cancer variant interpretation tools, which integrate multiple knowledge bases. We categorize the knowledge bases along the diagnostic process in precision oncology and analyze programmatic access options as well as the integration of knowledge bases into software tools. The most commonly used knowledge bases provide good programmatic access options and have been integrated into a range of software tools. For the wider set of knowledge bases, access options vary across different parts of the diagnostic process. Programmatic access is limited for information regarding clinical classifications of variants and for therapy recommendations. The main issue for databases used for biological classification of pathogenic variants and pathway context information is the lack of standardized interfaces. There is no single cancer variant interpretation tool that integrates all identified knowledge bases. Specialized tools are available and need to be further developed for different steps in the diagnostic process.
Collapse
Affiliation(s)
- Florian Borchert
- Digital Health Center, Hasso Plattner Institute (HPI), University of Potsdam, Prof.-Dr.-Helmert-Str. 2-3, 14482 Potsdam, Germany
| | - Andreas Mock
- Department of Translational Medical Oncology (TMO), National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ) Heidelberg, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
- Department of Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
| | - Aurelie Tomczak
- Institute of Pathology Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 224, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
| | - Jonas Hügel
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Str. 3, 37099 Göttingen, Germany
- Campus Institute Data Science, Göttingen, Germany
| | - Samer Alkarkoukly
- CECAD, Faculty of Medicine and University Hospital Cologne, University of Cologne, Joseph-Stelzmann-Straße 26, 50931 Cologne
| | - Alexander Knurr
- Division of Medical Informatics for Translational Oncology, German Cancer Research Center (DKFZ) Heidelberg, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| | - Anna-Lena Volckmar
- Institute of Pathology Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 224, 69120 Heidelberg, Germany
| | - Albrecht Stenzinger
- Institute of Pathology Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 224, 69120 Heidelberg, Germany
| | - Peter Schirmacher
- Institute of Pathology Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 224, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
| | - Jürgen Debus
- Department of Radiation Oncology, Heidelberg University Hospital, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT), Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
- Clinical Cooperation Unit Radiation Oncology, German Cancer Research Center (DKFZ) Heidelberg, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
- Heidelberg Ion-Beam Therapy Center (HIT), Department of Radiation Oncology, Heidelberg University Hospital, Im Neuenheimer Feld 450, 69120 Heidelberg, Germany
- Heidelberg Institute of Radiation Oncology (HIRO), Heidelberg University Hospital, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany
| | - Dirk Jäger
- Department of Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
- Clinical Coorporation Unit Applied Tumor-Immunity, German Cancer Research Center (DKFZ) Heidelberg, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| | - Thomas Longerich
- Institute of Pathology Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 224, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg, Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
| | - Stefan Fröhling
- Department of Translational Medical Oncology (TMO), National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ) Heidelberg, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
- German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
| | - Roland Eils
- Health Data Science Unit, Heidelberg University Hospital, Im Neuenheimer Feld 267, 69120 Heidelberg, Germany
- Center for Digital Health, Berlin Institute of Health and Charité Universitötsmedizin Berlin, Kapelle-Ufer 2, 10117 Berlin, Germany
| | - Nina Bougatf
- Department of Radiation Oncology, Heidelberg University Hospital, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT), Heidelberg University Hospital, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
- Clinical Cooperation Unit Radiation Oncology, German Cancer Research Center (DKFZ) Heidelberg, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
- Heidelberg Ion-Beam Therapy Center (HIT), Department of Radiation Oncology, Heidelberg University Hospital, Im Neuenheimer Feld 450, 69120 Heidelberg, Germany
- Heidelberg Institute of Radiation Oncology (HIRO), Heidelberg University Hospital, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany
| | - Ulrich Sax
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Str. 3, 37099 Göttingen, Germany
- Campus Institute Data Science, Göttingen, Germany
| | - Matthieu-P Schapranow
- Digital Health Center, Hasso Plattner Institute (HPI), University of Potsdam, Prof.-Dr.-Helmert-Str. 2-3, 14482 Potsdam, Germany
| |
Collapse
|
46
|
Patel MJ, DiStefano MT, Oza AM, Hughes MY, Wilcox EH, Hemphill SE, Cushman BJ, Grant AR, Siegert RK, Shen J, Chapin A, Boczek NJ, Schimmenti LA, Nara K, Kenna M, Azaiez H, Booth KT, Avraham KB, Kremer H, Griffith AJ, Rehm HL, Amr SS, Tayoun ANA. Disease-specific ACMG/AMP guidelines improve sequence variant interpretation for hearing loss. Genet Med 2021; 23:2208-2212. [PMID: 34230634 PMCID: PMC8556313 DOI: 10.1038/s41436-021-01254-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 06/09/2021] [Accepted: 06/10/2021] [Indexed: 12/29/2022] Open
Abstract
PURPOSE The ClinGen Variant Curation Expert Panels (VCEPs) provide disease-specific rules for accurate variant interpretation. Using the hearing loss-specific American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines, the Hearing Loss VCEP (HL VCEP) illustrates the utility of expert specifications in variant interpretation. METHODS A total of 157 variants across nine HL genes, previously submitted to ClinVar, were curated by the HL VCEP. The curation process involved collecting published and unpublished data for each variant by biocurators, followed by bimonthly meetings of an expert curation subgroup that reviewed all evidence and applied the HL-specific ACMG/AMP guidelines to reach a final classification. RESULTS Before expert curation, 75% (117/157) of variants had single or multiple variants of uncertain significance (VUS) submissions (17/157) or had conflicting interpretations in ClinVar (100/157). After applying the HL-specific ACMG/AMP guidelines, 24% (4/17) of VUS and 69% (69/100) of discordant variants were resolved into benign (B), likely benign (LB), likely pathogenic (LP), or pathogenic (P). Overall, 70% (109/157) variants had unambiguous classifications (B, LB, LP, P). We quantify the contribution of the HL-specified ACMG/AMP codes to variant classification. CONCLUSION Expert specification and application of the HL-specific ACMG/AMP guidelines effectively resolved discordant interpretations in ClinVar. This study highlights the utility of ClinGen VCEPs in supporting more consistent clinical variant interpretation.
Collapse
Affiliation(s)
- Mayher J Patel
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Marina T DiStefano
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA,Precision Health Program, Geisinger, Danville, PA, USA
| | - Andrea M Oza
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA,Dept. of Otolaryngology and Communication Enhancement, Boston Children’s Hospital, Boston, MA, USA
| | | | - Emma H Wilcox
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sarah E Hemphill
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA
| | - Brandon J Cushman
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA
| | - Andrew R Grant
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Jun Shen
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA,Harvard Medical School, Boston, MA, USA
| | | | - Nicole J Boczek
- Dept of Laboratory Medicine & Pathology, Mayo Clinic, Rochester, MN, USA
| | - Lisa A Schimmenti
- Department of Otorhinolaryngology, Clinical Genomics and Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, USA
| | - Kiyomitsu Nara
- Division of Hearing and Balance Research, National Institute of Sensory Organs, National Hospital Organization Tokyo Medical Center, Tokyo, Japan
| | - Margaret Kenna
- Dept. of Otolaryngology and Communication Enhancement, Boston Children’s Hospital, Boston, MA, USA,Harvard Medical School, Boston, MA, USA
| | - Hela Azaiez
- Molecular Otolaryngology and Renal Research Laboratories, Department of Otolaryngology, University of Iowa Hospital and Clinics, Iowa City, IA, USA
| | - Kevin T Booth
- Molecular Otolaryngology and Renal Research Laboratories, Department of Otolaryngology, University of Iowa Hospital and Clinics, Iowa City, IA, USA,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Karen B Avraham
- Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine and Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
| | - Hannie Kremer
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
| | - Andrew J. Griffith
- Department of Otolaryngology Head-Neck Surgery, College of Medicine, The University of Tennessee Health Science Center, Memphis, TN, USA
| | - Heidi L Rehm
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA,Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA
| | - Sami S Amr
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA,Harvard Medical School, Boston, MA, USA
| | - Ahmad N Abou Tayoun
- Al Genomics Center, Al Jalila Children’s Specialty Hospital, Dubai, United Arab Emirates,Center for Genomic Discovery, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates
| | | |
Collapse
|
47
|
Kaushik V, Plazzer JP, Winship I, Macrae F. Genetic variant interpretation: a primer for clinicians. Intern Med J 2021; 51:1401-1406. [PMID: 34541770 DOI: 10.1111/imj.15485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 02/08/2021] [Accepted: 05/31/2021] [Indexed: 11/28/2022]
Abstract
Clinicians beyond specialist genetic services are now able to order tests to interrogate the genetic basis of disease. Behind every genetic report lies a significant body of work that draws on decades of collaboration between clinicians, researchers and database curators. Understanding these advances in genetic variant interpretation may allow practising clinicians to develop a more nuanced appreciation of the role genetic variant interpretation can play in the diagnosis and management of heritable disorders. In this article, we consider genetic variant interpretation with reference to efforts to better understand variation in the mismatch repair genes and their relation to Lynch syndrome - the most common cause of hereditary colon cancer.
Collapse
Affiliation(s)
- Varun Kaushik
- The Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne, Victoria, Australia.,Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Melbourne, Victoria, Australia
| | - John-Paul Plazzer
- Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Melbourne, Victoria, Australia
| | - Ingrid Winship
- Department of Genomic Medicine, The Royal Melbourne Hospital, Melbourne, Victoria, Australia.,Department of Medicine, The University of Melbourne, The Royal Melbourne Hospital, Melbourne, Victoria, Australia
| | - Finlay Macrae
- Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Melbourne, Victoria, Australia.,Department of Medicine, The University of Melbourne, The Royal Melbourne Hospital, Melbourne, Victoria, Australia
| |
Collapse
|
48
|
The effects of genotype on inflammatory response in hippocampal progenitor cells: A computational approach. Brain Behav Immun Health 2021; 15:100286. [PMID: 34345870 PMCID: PMC8261829 DOI: 10.1016/j.bbih.2021.100286] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Accepted: 06/09/2021] [Indexed: 02/08/2023] Open
Abstract
Cell culture models are valuable tools to study biological mechanisms underlying health and disease in a controlled environment. Although their genotype influences their phenotype, subtle genetic variations in cell lines are rarely characterised and taken into account for in vitro studies. To investigate how the genetic makeup of a cell line might affect the cellular response to inflammation, we characterised the single nucleotide variants (SNPs) relevant to inflammation-related genes in an established hippocampal progenitor cell line (HPC0A07/03C) that is frequently used as an in vitro model for hippocampal neurogenesis (HN). SNPs were identified using a genotyping array, and genes associated with chronic inflammatory and neuroinflammatory response gene ontology terms were retrieved using the AmiGO application. SNPs associated with these genes were then extracted from the genotyping dataset, for which a literature search was conducted, yielding relevant research articles for a total of 17 SNPs. Of these variants, 10 were found to potentially affect hippocampal neurogenesis whereby a majority (n=7) is likely to reduce neurogenesis under inflammatory conditions. Taken together, the existing literature seems to suggest that all stages of hippocampal neurogenesis could be negatively affected due to the genetic makeup in HPC0A07/03C cells under inflammation. Additional experiments will be needed to validate these specific findings in a laboratory setting. However, this computational approach already confirms that in vitro studies in general should control for cell lines subtle genetic variations which could mask or exacerbate findings.
Collapse
|
49
|
DeVoe E, Oliver GR, Zenka R, Blackburn PR, Cousin MA, Boczek NJ, Kocher JPA, Urrutia R, Klee EW, Zimmermann MT. P 2T 2: Protein Panoramic annoTation Tool for the interpretation of protein coding genetic variants. JAMIA Open 2021; 4:ooab065. [PMID: 34377961 PMCID: PMC8346652 DOI: 10.1093/jamiaopen/ooab065] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 07/06/2021] [Accepted: 07/17/2021] [Indexed: 11/29/2022] Open
Abstract
MOTIVATION Genomic data are prevalent, leading to frequent encounters with uninterpreted variants or mutations with unknown mechanisms of effect. Researchers must manually aggregate data from multiple sources and across related proteins, mentally translating effects between the genome and proteome, to attempt to understand mechanisms. MATERIALS AND METHODS P2T2 presents diverse data and annotation types in a unified protein-centric view, facilitating the interpretation of coding variants and hypothesis generation. Information from primary sequence, domain, motif, and structural levels are presented and also organized into the first Paralog Annotation Analysis across the human proteome. RESULTS Our tool assists research efforts to interpret genomic variation by aggregating diverse, relevant, and proteome-wide information into a unified interactive web-based interface. Additionally, we provide a REST API enabling automated data queries, or repurposing data for other studies. CONCLUSION The unified protein-centric interface presented in P2T2 will help researchers interpret novel variants identified through next-generation sequencing. Code and server link available at github.com/GenomicInterpretation/p2t2.
Collapse
Affiliation(s)
- Elias DeVoe
- Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, USA
| | - Gavin R Oliver
- Department of Health Science Research, Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Roman Zenka
- Department of Health Science Research, Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
| | - Patrick R Blackburn
- Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, USA
- Center for Individualized Medicine, Mayo Clinic, Jacksonville, Florida, USA
| | - Margot A Cousin
- Department of Health Science Research, Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Nicole J Boczek
- Department of Health Science Research, Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Jean-Pierre A Kocher
- Department of Health Science Research, Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Raul Urrutia
- Genomic Sciences and Precision Medicine Center, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, USA
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin, 53226, USA
| | - Eric W Klee
- Department of Health Science Research, Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Michael T Zimmermann
- Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, USA
- Genomic Sciences and Precision Medicine Center, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, USA
| |
Collapse
|
50
|
Saha SK, Saba AA, Hasib M, Rimon RA, Hasan I, Alam MS, Mahmud I, Nabi AN. Evaluation of D-loop hypervariable region I variations, haplogroups and copy number of mitochondrial DNA in Bangladeshi population with type 2 diabetes. Heliyon 2021; 7:e07573. [PMID: 34377852 PMCID: PMC8327661 DOI: 10.1016/j.heliyon.2021.e07573] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/01/2021] [Accepted: 07/12/2021] [Indexed: 10/24/2022] Open
Abstract
The profound impact of mitochondrion in cellular metabolism has been well documented. Since type 2 diabetes (T2D) is a metabolic disorder, mitochondrial dysfunction is intricately linked with the disease pathogenesis. Mitochondrial DNA (mtDNA) variants are involved with functional dysfunction of mitochondrion and play a pivotal role in the susceptibility to T2D. In this study, we opted to find the association of mtDNA variants within the D-loop hypervariable region I (HVI), haplogroups and mtDNA copy number with T2D in Bangladeshi population. A total of 300 unrelated Bangladeshi individuals (150 healthy and 150 patients with T2D) were recruited in the present study, their HVI regions were amplified and sequenced using Sanger chemistry. Haplogrep2 and Phylotree17 tools were employed to determine the haplogroups. MtDNA copy number was measured using primers of mitochondrial tRNALeu (UUR) gene and nuclear β2-microglobulin gene. Variants G16048A (OR:0.12, p = 0.04) and G16129A (OR: 0.42, p = 0.007) were found to confer protective role against T2D according to logistic regression analysis. However along with G16129A, two new variants C16294T and T16325C demonstrated protective role against T2D when age and gender were adjusted. Haplogroups A and H showed significant association with the risk of T2D after adjustments out of total 19 major haplogroups identified. The mtDNA copy numbers were stratified into 4 groups according to the quartiles (groups with lower, medium, upper and higher mtDNA copy numbers were respectively designated as LCN, MCN, UCN and HCN). Patients with T2D had significantly lower mtDNA copy number compared to their healthy counterparts in HCN group. Moreover, six mtDNA variants were significantly associated with mtDNA copy number in the participants. Thus, our study confers that certain haplogroups and novel variants of mtDNA are significantly associated with T2D while decreased mtDNA copy number (though not significant) has been observed in patients with T2D. However, largescale studies are warranted to establish association of novel variants and haplogroup with type 2 diabetes.
Collapse
Affiliation(s)
- Sajoy Kanti Saha
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| | - Abdullah Al Saba
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| | - Md. Hasib
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| | - Razoan Al Rimon
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| | - Imrul Hasan
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| | - Md. Sohrab Alam
- Department of Immunology, Institute of Research and Rehabilitation in Diabetes, Endocrine and Metabolic Disorders, Shahbagh, Dhaka, Bangladesh
| | - Ishtiaq Mahmud
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| | - A.H.M. Nurun Nabi
- Laboratory of Population Genetics, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh
| |
Collapse
|