Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

45
(from Reference Citation Analysis)

Article PDFs (13)

Cited by > 0 (33)

Searched Name

phenotype prediction

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Hu K, Meyer F, Deng ZL, Asgari E, Kuo TH, Münch PC, McHardy AC. Assessing computational predictions of antimicrobial resistance phenotypes from microbial genomes. Brief Bioinform 2024;25:bbae206. [PMID: 38706320 PMCID: PMC11070729 DOI: 10.1093/bib/bbae206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 04/08/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open

Abstract

The advent of rapid whole-genome sequencing has created new opportunities for computational prediction of antimicrobial resistance (AMR) phenotypes from genomic data. Both rule-based and machine learning (ML) approaches have been explored for this task, but systematic benchmarking is still needed. Here, we evaluated four state-of-the-art ML methods (Kover, PhenotypeSeeker, Seq2Geno2Pheno and Aytan-Aktug), an ML baseline and the rule-based ResFinder by training and testing each of them across 78 species-antibiotic datasets, using a rigorous benchmarking workflow that integrates three evaluation approaches, each paired with three distinct sample splitting methods. Our analysis revealed considerable variation in the performance across techniques and datasets. Whereas ML methods generally excelled for closely related strains, ResFinder excelled for handling divergent genomes. Overall, Kover most frequently ranked top among the ML approaches, followed by PhenotypeSeeker and Seq2Geno2Pheno. AMR phenotypes for antibiotic classes such as macrolides and sulfonamides were predicted with the highest accuracies. The quality of predictions varied substantially across species-antibiotic combinations, particularly for beta-lactams; across species, resistance phenotyping of the beta-lactams compound, aztreonam, amoxicillin/clavulanic acid, cefoxitin, ceftazidime and piperacillin/tazobactam, alongside tetracyclines demonstrated more variable performance than the other benchmarked antibiotics. By organism, Campylobacter jejuni and Enterococcus faecium phenotypes were more robustly predicted than those of Escherichia coli, Staphylococcus aureus, Salmonella enterica, Neisseria gonorrhoeae, Klebsiella pneumoniae, Pseudomonas aeruginosa, Acinetobacter baumannii, Streptococcus pneumoniae and Mycobacterium tuberculosis. In addition, our study provides software recommendations for each species-antibiotic combination. It furthermore highlights the need for optimization for robust clinical applications, particularly for strains that diverge substantially from those used for training.

Collapse

Affiliation(s)

Kaixin Hu Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
Fernando Meyer Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
Zhi-Luo Deng Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
Ehsaneddin Asgari Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Molecular Cell Biomechanics Laboratory, Department of Bioengineering and Mechanical Engineering, University of California, Berkeley, USA
Tzu-Hao Kuo Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
Philipp C Münch Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany German Center for Infection Research (DZIF), partner site Hannover Braunschweig, Braunschweig, Germany Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
Alice C McHardy Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany

Collapse

Kolobkov D, Mishra Sharma S, Medvedev A, Lebedev M, Kosaretskiy E, Vakhitov R. Efficacy of federated learning on genomic data: a study on the UK Biobank and the 1000 Genomes Project. Front Big Data 2024;7:1266031. [PMID: 38487517 PMCID: PMC10937521 DOI: 10.3389/fdata.2024.1266031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 01/31/2024] [Indexed: 03/17/2024] Open

Brouard C, Mourad R, Vialaneix N. Should we really use graph neural networks for transcriptomic prediction? Brief Bioinform 2024;25:bbae027. [PMID: 38349060 PMCID: PMC10939369 DOI: 10.1093/bib/bbae027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 12/20/2023] [Accepted: 01/17/2024] [Indexed: 02/15/2024] Open

Bonet D, Levin M, Montserrat DM, Ioannidis AG. Machine Learning Strategies for Improved Phenotype Prediction in Underrepresented Populations. Pac Symp Biocomput 2024;29:404-418. [PMID: 38160295 PMCID: PMC10799683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]

Comajoan Cara M, Mas Montserrat D, Ioannidis AG. PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations. Pac Symp Biocomput 2024;29:327-340. [PMID: 38160290 PMCID: PMC10906137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]

Cara MC, Montserrat DM, Ioannidis AG. PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations. bioRxiv 2023:2023.10.10.561715. [PMID: 37873492 PMCID: PMC10592760 DOI: 10.1101/2023.10.10.561715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Ruan M, Hu Z, Zhu Q, Li Y, Nie X. 16S rDNA Sequencing-Based Insights into the Bacterial Community Structure and Function in Co-Existing Soil and Coal Gangue. Microorganisms 2023;11:2151. [PMID: 37763995 PMCID: PMC10536285 DOI: 10.3390/microorganisms11092151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 08/16/2023] [Accepted: 08/21/2023] [Indexed: 09/29/2023] Open

Aspromonte MC, Conte AD, Zhu S, Tan W, Shen Y, Zhang Y, Li Q, Wang MH, Babbi G, Bovo S, Martelli PL, Casadio R, Althagafi A, Toonsi S, Kulmanov M, Hoehndorf R, Katsonis P, Williams A, Lichtarge O, Xian S, Surento W, Pejaver V, Mooney SD, Sunderam U, Srinivasan R, Murgia A, Piovesan D, Tosatto SCE, Leonardi E. CAGI6 ID-Challenge: Assessment of phenotype and variant predictions in 415 children with Neurodevelopmental Disorders (NDDs). Res Sq 2023:rs.3.rs-3209168. [PMID: 37577579 PMCID: PMC10418555 DOI: 10.21203/rs.3.rs-3209168/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]

Affiliation(s)

Maria Cristina Aspromonte Department of Biomedical Sciences, University of Padova
Alessio Del Conte Department of Biomedical Sciences, University of Padova
Shaowen Zhu Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
Wuwei Tan Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
Yang Shen Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
Yexian Zhang CUHK Shenzhen Research Institute, Shenzhen
Qi Li CUHK Shenzhen Research Institute, Shenzhen
Maggie Haitian Wang CUHK Shenzhen Research Institute, Shenzhen
Giulia Babbi Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
Samuele Bovo Department of Agricultural and Food Sciences, University of Bologna
Pier Luigi Martelli Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
Rita Casadio Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
Azza Althagafi Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Sumyyah Toonsi Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Maxat Kulmanov Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Robert Hoehndorf Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Panagiotis Katsonis Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
Amanda Williams Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
Olivier Lichtarge Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
Su Xian Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
Wesley Surento Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
Vikas Pejaver Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY 10029
Sean D Mooney Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
Uma Sunderam Innovation Labs, Tata Consultancy Services, Hyderabad
Rajgopal Srinivasan Innovation Labs, Tata Consultancy Services, Hyderabad
Alessandra Murgia Department of Women's and Children's Health, University of Padova
Damiano Piovesan Department of Biomedical Sciences, University of Padova
Silvio C E Tosatto Department of Biomedical Sciences, University of Padova
Emanuela Leonardi Department of Biomedical Sciences, University of Padova

Collapse

Mowlaei ME, Shi X. FSF-GA: A Feature Selection Framework for Phenotype Prediction Using Genetic Algorithms. Genes (Basel) 2023;14:genes14051059. [PMID: 37239419 DOI: 10.3390/genes14051059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 05/04/2023] [Accepted: 05/06/2023] [Indexed: 05/28/2023] Open

Chen Y, Guo Y, Guan P, Wang Y, Wang X, Wang Z, Qin Z, Ma S, Xin M, Hu Z, Yao Y, Ni Z, Sun Q, Guo W, Peng H. A wheat integrative regulatory network from large-scale complementary functional datasets enables trait-associated gene discovery for crop improvement. Mol Plant 2023;16:393-414. [PMID: 36575796 DOI: 10.1016/j.molp.2022.12.019] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 11/28/2022] [Accepted: 12/18/2022] [Indexed: 06/17/2023]

Affiliation(s)

Yongming Chen Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Yiwen Guo Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Panfeng Guan Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Yongfa Wang Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Xiaobo Wang Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Zihao Wang Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Zhen Qin Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Shengwei Ma Hainan Yazhou Bay Seed Laboratory, Sanya, Hainan, China; State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
Mingming Xin Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Zhaorong Hu Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Yingyin Yao Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Zhongfu Ni Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Qixin Sun Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Weilong Guo Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
Huiru Peng Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.

Collapse

Forutan M, Lynn A, Aliloo H, Clark SA, McGilchrist P, Polkinghorne R, Hayes BJ. Predicting phenotypes of beef eating quality traits. Front Genet 2023;14:1089490. [PMID: 36816029 PMCID: PMC9936823 DOI: 10.3389/fgene.2023.1089490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 01/19/2023] [Indexed: 02/04/2023] Open

Abstract

Introduction: Phenotype predictions of beef eating quality for individual animals could be used to allocate animals to longer and more expensive feeding regimes as they enter the feedlot if they are predicted to have higher eating quality, and to sort carcasses into consumer or market value categories. Phenotype predictions can include genetic effects (breed effects, heterosis and breeding value), predicted from genetic markers, as well as fixed effects such as days aged and carcass weight, hump height, ossification, and hormone growth promotant (HGP) status. Methods: Here we assessed accuracy of phenotype predictions for five eating quality traits (tenderness, juiciness, flavour, overall liking and MQ4) in striploins from 1701 animals from a wide variety of backgrounds, including Bos indicus and Bos taurus breeds, using genotypes and simple fixed effects including days aged and carcass weight. The genetic components were predicted based on 709k single nucleotide polymorphism (SNP) using BayesR model, which assumes some markers may have a moderate to large effect. Fixed effects in the prediction included principal components of the genomic relationship matrix, to account for breed effects, heterosis, days aged and carcass weight. Results and Discussion: A model which allowed breed effects to be captured in the SNP effects (e.g., not explicitly fitting these effects) tended to have slightly higher accuracies (0.43-0.50) compared to when these effects were explicitly fitted as fixed effects (0.42-0.49), perhaps because breed effects when explicitly fitted were estimated with more error than when incorporated into the (random) SNP effects. Adding estimates of effects of days aged and carcass weight did not increase the accuracy of phenotype predictions in this particular analysis. The accuracy of phenotype prediction for beef eating quality traits was sufficiently high that such predictions could be useful in predicting eating quality from DNA samples taken from an animal/carcass as it enters the processing plant, to enable optimal supply chain value extraction by sorting product into markets with different quality. The BayesR predictions identified several novel genes potentially associated with beef eating quality.

Collapse

John M, Haselbeck F, Dass R, Malisi C, Ricca P, Dreischer C, Schultheiss SJ, Grimm DG. A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species. Front Plant Sci 2022;13:932512. [PMID: 36407627 PMCID: PMC9673477 DOI: 10.3389/fpls.2022.932512] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 07/25/2022] [Indexed: 06/16/2023]

Jiang Z, Lu Y, Liu Z, Wu W, Xu X, Dinnyés A, Yu Z, Chen L, Sun Q. Drug resistance prediction and resistance genes identification in Mycobacterium tuberculosis based on a hierarchical attentive neural network utilizing genome-wide variants. Brief Bioinform 2022;23:6553603. [PMID: 35325021 DOI: 10.1093/bib/bbac041] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 01/18/2022] [Accepted: 01/27/2022] [Indexed: 01/25/2023] Open

Dinh JC, Boone EC, Staggs VS, Pearce RE, Wang WY, Gaedigk R, Leeder JS, Gaedigk A. The Impact of the CYP2D6 "Enhancer" Single Nucleotide Polymorphism on CYP2D6 Activity. Clin Pharmacol Ther 2022;111:646-654. [PMID: 34716917 PMCID: PMC8825689 DOI: 10.1002/cpt.2469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 10/21/2021] [Indexed: 11/10/2022]

Diepenbroek M, Bayer B, Anslinger K. Pushing the Boundaries: Forensic DNA Phenotyping Challenged by Single-Cell Sequencing. Genes (Basel) 2021;12:genes12091362. [PMID: 34573344 PMCID: PMC8466929 DOI: 10.3390/genes12091362] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 08/24/2021] [Accepted: 08/27/2021] [Indexed: 12/26/2022] Open

Raghu VK, Ge X, Balajiee A, Shirer DJ, Das I, Benos PV, Chrysanthis PK. A Pipeline for Integrated Theory and Data-Driven Modeling of Biomedical Data. IEEE/ACM Trans Comput Biol Bioinform 2021;18:811-822. [PMID: 32841121 PMCID: PMC8237279 DOI: 10.1109/tcbb.2020.3019237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Song K, Wright FA, Zhou YH. Systematic Comparisons for Composition Profiles, Taxonomic Levels, and Machine Learning Methods for Microbiome-Based Disease Prediction. Front Mol Biosci 2020;7:610845. [PMID: 33392266 PMCID: PMC7772236 DOI: 10.3389/fmolb.2020.610845] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Accepted: 11/25/2020] [Indexed: 12/12/2022] Open

Diepenbroek M, Bayer B, Schwender K, Schiller R, Lim J, Lagacé R, Anslinger K. Evaluation of the Ion AmpliSeq™ PhenoTrivium Panel: MPS-Based Assay for Ancestry and Phenotype Predictions Challenged by Casework Samples. Genes (Basel) 2020;11:E1398. [PMID: 33255693 DOI: 10.3390/genes11121398] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 11/19/2020] [Accepted: 11/22/2020] [Indexed: 12/21/2022] Open

Pook T, Freudenthal J, Korte A, Simianer H. Using Local Convolutional Neural Networks for Genomic Prediction. Front Genet 2020;11:561497. [PMID: 33281867 PMCID: PMC7689358 DOI: 10.3389/fgene.2020.561497] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 10/12/2020] [Indexed: 11/18/2022] Open

Abstract

The prediction of breeding values and phenotypes is of central importance for both livestock and crop breeding. In this study, we analyze the use of artificial neural networks (ANN) and, in particular, local convolutional neural networks (LCNN) for genomic prediction, as a region-specific filter corresponds much better with our prior genetic knowledge on the genetic architecture of traits than traditional convolutional neural networks. Model performances are evaluated on a simulated maize data panel (n = 10,000; p = 34,595) and real Arabidopsis data (n = 2,039; p = 180,000) for a variety of traits based on their predictive ability. The baseline LCNN, containing one local convolutional layer (kernel size: 10) and two fully connected layers with 64 nodes each, is outperforming commonly proposed ANNs (multi layer perceptrons and convolutional neural networks) for basically all considered traits. For traits with high heritability and large training population as present in the simulated data, LCNN are even outperforming state-of-the-art methods like genomic best linear unbiased prediction (GBLUP), Bayesian models and extended GBLUP, indicated by an increase in predictive ability of up to 24%. However, for small training populations, these state-of-the-art methods outperform all considered ANNs. Nevertheless, the LCNN still outperforms all other considered ANNs by around 10%. Minor improvements to the tested baseline network architecture of the LCNN were obtained by increasing the kernel size and of reducing the stride, whereas the number of subsequent fully connected layers and their node sizes had neglectable impact. Although gains in predictive ability were obtained for large scale data sets by using LCNNs, the practical use of ANNs comes with additional problems, such as the need of genotyping all considered individuals, the lack of estimation of heritability and reliability. Furthermore, breeding values are additive by design, whereas ANN-based estimates are not. However, ANNs also comes with new opportunities, as networks can easily be extended to account for additional inputs (omics, weather etc.) and outputs (multi-trait models), and computing time increases linearly with the number of individuals. With advances in high-throughput phenotyping and cheaper genotyping, ANNs can become a valid alternative for genomic prediction.

Collapse

Lees JA, Mai TT, Galardini M, Wheeler NE, Horsfield ST, Parkhill J, Corander J. Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions. mBio 2020;11:e01344-20. [PMID: 32636251 PMCID: PMC7343994 DOI: 10.1128/mbio.01344-20] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 06/05/2020] [Indexed: 12/19/2022] Open

Abstract

Discovery of genetic variants underlying bacterial phenotypes and the prediction of phenotypes such as antibiotic resistance are fundamental tasks in bacterial genomics. Genome-wide association study (GWAS) methods have been applied to study these relations, but the plastic nature of bacterial genomes and the clonal structure of bacterial populations creates challenges. We introduce an alignment-free method which finds sets of loci associated with bacterial phenotypes, quantifies the total effect of genetics on the phenotype, and allows accurate phenotype prediction, all within a single computationally scalable joint modeling framework. Genetic variants covering the entire pangenome are compactly represented by extended DNA sequence words known as unitigs, and model fitting is achieved using elastic net penalization, an extension of standard multiple regression. Using an extensive set of state-of-the-art bacterial population genomic data sets, we demonstrate that our approach performs accurate phenotype prediction, comparable to popular machine learning methods, while retaining both interpretability and computational efficiency. Compared to those of previous approaches, which test each genotype-phenotype association separately for each variant and apply a significance threshold, the variants selected by our joint modeling approach overlap substantially.IMPORTANCE Being able to identify the genetic variants responsible for specific bacterial phenotypes has been the goal of bacterial genetics since its inception and is fundamental to our current level of understanding of bacteria. This identification has been based primarily on painstaking experimentation, but the availability of large data sets of whole genomes with associated phenotype metadata promises to revolutionize this approach, not least for important clinical phenotypes that are not amenable to laboratory analysis. These models of phenotype-genotype association can in the future be used for rapid prediction of clinically important phenotypes such as antibiotic resistance and virulence by rapid-turnaround or point-of-care tests. However, despite much effort being put into adapting genome-wide association study (GWAS) approaches to cope with bacterium-specific problems, such as strong population structure and horizontal gene exchange, current approaches are not yet optimal. We describe a method that advances methodology for both association and generation of portable prediction models.

Collapse

Chun S, Imakaev M, Hui D, Patsopoulos NA, Neale BM, Kathiresan S, Stitziel NO, Sunyaev SR. Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics. Am J Hum Genet 2020;107:46-59. [PMID: 32470373 PMCID: PMC7332650 DOI: 10.1016/j.ajhg.2020.05.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Accepted: 05/01/2020] [Indexed: 02/07/2023] Open

Affiliation(s)

Sung Chun Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
Maxim Imakaev Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
Daniel Hui Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Systems Biology and Computer Science Program, Ann Romney Center for Neurological Diseases, Department of Neurology, Brigham & Women's Hospital, Boston, MA 02115, USA
Nikolaos A Patsopoulos Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Systems Biology and Computer Science Program, Ann Romney Center for Neurological Diseases, Department of Neurology, Brigham & Women's Hospital, Boston, MA 02115, USA
Benjamin M Neale Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
Sekar Kathiresan Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA; Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA
Nathan O Stitziel Cardiovascular Division, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA; Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110, USA; McDonnell Genome Institute, Washington University School of Medicine, Saint Louis, MO 63110, USA.
Shamil R Sunyaev Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA.

Collapse

Livesey BJ, Marsh JA. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol Syst Biol 2020;16:e9380. [PMID: 32627955 PMCID: PMC7336272 DOI: 10.15252/msb.20199380] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Revised: 05/18/2020] [Accepted: 05/26/2020] [Indexed: 12/23/2022] Open

Palencia-Madrid L, Xavier C, de la Puente M, Hohoff C, Phillips C, Kayser M, Parson W. Evaluation of the VISAGE Basic Tool for Appearance and Ancestry Prediction Using PowerSeq Chemistry on the MiSeq FGx System. Genes (Basel) 2020;11:E708. [PMID: 32604780 DOI: 10.3390/genes11060708] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 06/09/2020] [Accepted: 06/11/2020] [Indexed: 01/23/2023] Open

Liu YH, Xu Y, Zhang M, Cui Y, Sze SH, Smith CW, Xu S, Zhang HB. Accurate Prediction of a Quantitative Trait Using the Genes Controlling the Trait for Gene-Based Breeding in Cotton. Front Plant Sci 2020;11:583277. [PMID: 33281846 PMCID: PMC7690289 DOI: 10.3389/fpls.2020.583277] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 10/15/2020] [Indexed: 05/03/2023]

Carraro M, Monzon AM, Chiricosta L, Reggiani F, Aspromonte MC, Bellini M, Pagel K, Jiang Y, Radivojac P, Kundu K, Pal LR, Yin Y, Limongelli I, Andreoletti G, Moult J, Wilson SJ, Katsonis P, Lichtarge O, Chen J, Wang Y, Hu Z, Brenner SE, Ferrari C, Murgia A, Tosatto SC, Leonardi E. Assessment of patient clinical descriptions and pathogenic variants from gene panel sequences in the CAGI-5 intellectual disability challenge. Hum Mutat 2019;40:1330-1345. [PMID: 31144778 PMCID: PMC7341177 DOI: 10.1002/humu.23823] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 05/07/2019] [Accepted: 05/27/2019] [Indexed: 12/15/2022]

Affiliation(s)

Marco Carraro Department of Biomedical Sciences, University of Padua, Padua, Italy
Alexander Miguel Monzon Department of Biomedical Sciences, University of Padua, Padua, Italy
Luigi Chiricosta Department of Biomedical Sciences, University of Padua, Padua, Italy
Francesco Reggiani Department of Biomedical Sciences, University of Padua, Padua, Italy Department of Information Engineering, University of Padua, Padua, Italy
Maria Cristina Aspromonte Department of Woman and Child Health, University of Padua, Padua, Italy
Mariagrazia Bellini Department of Woman and Child Health, University of Padua, Padua, Italy Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
Kymberleigh Pagel Khoury College of Computer and Information Sciences, Northeastern University, 440, Huntington Avenue, Boston, MA 02115, USA
Yuxiang Jiang Khoury College of Computer and Information Sciences, Northeastern University, 440, Huntington Avenue, Boston, MA 02115, USA
Predrag Radivojac Khoury College of Computer and Information Sciences, Northeastern University, 440, Huntington Avenue, Boston, MA 02115, USA
Kunal Kundu Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, MD 20742, USA
Lipika R. Pal Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
Yizhou Yin Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, MD 20742, USA
Ivan Limongelli enGenome srl, via Ferrata 5, Pavia, Italy
Gaia Andreoletti Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
John Moult Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
Stephen J. Wilson Baylor College of Medicine, Department of Molecular and Human Genetics, Houston, TX 77030, USA
Panagiotis Katsonis Baylor College of Medicine, Department of Molecular and Human Genetics, Houston, TX 77030, USA
Olivier Lichtarge Baylor College of Medicine, Department of Molecular and Human Genetics, Houston, TX 77030, USA
Jingqi Chen Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
Yaqiong Wang Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
Zhiqiang Hu Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
Steven E. Brenner Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
Carlo Ferrari Department of Information Engineering, University of Padua, Padua, Italy
Alessandra Murgia Department of Woman and Child Health, University of Padua, Padua, Italy Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
Silvio C.E. Tosatto Department of Biomedical Sciences, University of Padua, Padua, Italy CNR Institute of Neuroscience, Padua, Italy
Emanuela Leonardi Department of Woman and Child Health, University of Padua, Padua, Italy Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy

Collapse

Kasak L, Bakolitsa C, Hu Z, Yu C, Rine J, Dimster-Denk DF, Pandey G, Baets GD, Bromberg Y, Cao C, Capriotti E, Casadio R, Durme JV, Giollo M, Karchin R, Katsonis P, Leonardi E, Lichtarge O, Martelli PL, Masica D, Mooney SD, Olatubosun A, Pal LR, Radivojac P, Rousseau F, Savojardo C, Schymkowitz J, Thusberg J, Tosatto SC, Vihinen M, Väliaho J, Repo S, Moult J, Brenner SE, Friedberg I. Assessing computational predictions of the phenotypic effect of cystathionine-beta-synthase variants. Hum Mutat 2019;40:1530-1545. [PMID: 31301157 PMCID: PMC7325732 DOI: 10.1002/humu.23868] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Revised: 06/22/2019] [Accepted: 07/09/2019] [Indexed: 12/28/2022]

Affiliation(s)

Laura Kasak Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
Constantina Bakolitsa Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Zhiqiang Hu Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Changhua Yu Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Jasper Rine California Institute for Quantitative Biosciences, University of California, Berkeley, CA, USA
Dago F. Dimster-Denk California Institute for Quantitative Biosciences, University of California, Berkeley, CA, USA
Gaurav Pandey Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Greet De Baets Switch Laboratory, VIB Center for Brain and Disease Research, Leuven, Belgium Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium
Yana Bromberg Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, USA
Chen Cao Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, USA Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, MD, USA
Emidio Capriotti Department of Bioengineering, Stanford University, Stanford, CA, USA
Rita Casadio Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
Joost Van Durme Switch Laboratory, VIB Center for Brain and Disease Research, Leuven, Belgium Vrije Universiteit Brussel, Brussels, Belgium
Manuel Giollo Department of Biomedical Sciences, University of Padua, Padua, Italy
Rachel Karchin Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD, USA
Panagiotis Katsonis Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Emanuela Leonardi Department for Woman and Child Health, University of Padua, Italy
Olivier Lichtarge Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Pier Luigi Martelli Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
David Masica Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD, USA
Sean D. Mooney Buck Institute for Research on Aging, Novato, CA, USA
Ayodeji Olatubosun Institute of Medical Technology, University of Tampere, Tampere, Finland
Lipika R. Pal Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, USA
Predrag Radivojac School of Informatics and Computing, Indiana University, Bloomington, IN, USA
Frederic Rousseau Switch Laboratory, VIB Center for Brain and Disease Research, Leuven, Belgium Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium
Castrense Savojardo Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
Joost Schymkowitz Switch Laboratory, VIB Center for Brain and Disease Research, Leuven, Belgium Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium
Janita Thusberg Buck Institute for Research on Aging, Novato, CA, USA
Silvio C.E. Tosatto Department of Biomedical Sciences, University of Padua, Padua, Italy
Mauno Vihinen Institute of Medical Technology, University of Tampere, Tampere, Finland
Jouni Väliaho Institute of Medical Technology, University of Tampere, Tampere, Finland
Susanna Repo Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
John Moult Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
Steven E. Brenner Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Iddo Friedberg Department of Microbiology, Miami University, Oxford, OH, USA Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA USA

Collapse

Kasak L, Hunter JM, Udani R, Bakolitsa C, Hu Z, Adhikari AN, Babbi G, Casadio R, Gough J, Guerrero RF, Jiang Y, Joseph T, Katsonis P, Kotte S, Kundu K, Lichtarge O, Martelli PL, Mooney SD, Moult J, Pal LR, Poitras J, Radivojac P, Rao A, Sivadasan N, Sunderam U, VG S, Yin Y, Zaucha J, Brenner SE, Meyn MS. CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases. Hum Mutat 2019;40:1373-1391. [PMID: 31322791 PMCID: PMC7318886 DOI: 10.1002/humu.23874] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 07/15/2019] [Accepted: 07/15/2019] [Indexed: 01/02/2023]

Affiliation(s)

Laura Kasak Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
Jesse M. Hunter Department of Pediatrics and Wisconsin State Lab of Hygiene, University of Wisconsin Madison, WI, USA
Rupa Udani Department of Pediatrics and Wisconsin State Lab of Hygiene, University of Wisconsin Madison, WI, USA
Constantina Bakolitsa Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Zhiqiang Hu Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Aashish N. Adhikari Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
Giulia Babbi Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
Rita Casadio Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
Julian Gough Department of Computer Science, University of Bristol, Bristol, UK
Rafael F. Guerrero Department of Computer Science, Indiana University, IN, USA
Yuxiang Jiang Department of Computer Science, Indiana University, IN, USA
Thomas Joseph Tata Consultancy Services Ltd
Panagiotis Katsonis Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Sujatha Kotte Tata Consultancy Services Ltd
Kunal Kundu Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, USA Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, MD, USA
Olivier Lichtarge Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA Department of Biochemistry & Molecular Biology, Department of Pharmacology, Computational and Integrative Biomedical Research Center, Baylor College of Medicine, Houston, TX, USA
Pier Luigi Martelli Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
Sean D. Mooney Department of Biomedical Informatics and Medical Education, University of Washington, WA, USA
John Moult Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, USA Department of Cell Biology and Molecular Genetics, University of Maryland, MD, USA
Lipika R. Pal Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, USA
Jennifer Poitras QIAGEN Bioinformatics, Redwood City, CA, USA
Predrag Radivojac Khoury College of Computer Sciences, Northeastern University, MA, USA
Aditya Rao Tata Consultancy Services Ltd
Naveen Sivadasan Tata Consultancy Services Ltd
Uma Sunderam Tata Consultancy Services Ltd
Saipradeep VG Tata Consultancy Services Ltd
Yizhou Yin Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, USA Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, MD, USA
Jan Zaucha Department of Computer Science, University of Bristol, Bristol, UK
Steven E. Brenner Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
M. Stephen Meyn Center for Human Genomics and Precision Medicine, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA Department of Paediatrics, The Hospital for Sick Children, Toronto, Canada

Collapse

McInnes G, Daneshjou R, Katsonis P, Lichtarge O, Srinivasan R, Rana S, Radivojac P, Mooney SD, Pagel KA, Stamboulian M, Jiang Y, Capriotti E, Wang Y, Bromberg Y, Bovo S, Savojardo C, Martelli PL, Casadio R, Pal LR, Moult J, Brenner SE, Altman R. Predicting venous thromboembolism risk from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges. Hum Mutat 2019;40:1314-1320. [PMID: 31140652 DOI: 10.1002/humu.23825] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 05/07/2019] [Accepted: 05/27/2019] [Indexed: 01/14/2023]

Affiliation(s)

Gregory McInnes Biomedical Informatics Training Program, Stanford University, Stanford, California
Roxana Daneshjou Department of Dermatology, Stanford School of Medicine, Stanford, California
Panagiostis Katsonis Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Olivier Lichtarge Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Department of Biochemistry & Molecular Biology, Baylor College of Medicine, Houston, Texas.,Department of Pharmacology, Baylor College of Medicine, Houston, Texas.,Computational and Integrative Biomedical Research Center, Baylor College of Medicine, Houston, Texas
Rajgopal Srinivasan Innovations Labs, Tata Consultancy Services, Hyderabad, India
Sadhna Rana Innovations Labs, Tata Consultancy Services, Hyderabad, India
Predrag Radivojac Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Sean D Mooney Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington
Kymberleigh A Pagel Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
Moses Stamboulian Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
Yuxiang Jiang Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
Emidio Capriotti BioFolD Unit, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
Yanran Wang Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, New Jersey
Yana Bromberg Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, New Jersey
Samuele Bovo Department of Pharmacy and Biotechnology, Bologna Biocomputing Group, University of Bologna, Italy
Castrense Savojardo Department of Pharmacy and Biotechnology, Bologna Biocomputing Group, University of Bologna, Italy
Pier Luigi Martelli Department of Pharmacy and Biotechnology, Bologna Biocomputing Group, University of Bologna, Italy
Rita Casadio Department of Pharmacy and Biotechnology, Bologna Biocomputing Group, University of Bologna, Italy.,Institute of Biomembrane and Bioenergetics, Consiglio Nazionale delle Ricerche, Bari, Italy
Lipika R Pal Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
John Moult Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Steven E Brenner Department of Plant and Microbial biology, University of California Berkeley, Berkeley, California
Russ Altman Departments of Bioengineering, Biomedical Data Science, Genetics, and Medicine, Stanford University, Stanford, California

Collapse

Li Z, Gao N, Martini JWR, Simianer H. Integrating Gene Expression Data Into Genomic Prediction. Front Genet 2019;10:126. [PMID: 30858865 PMCID: PMC6397893 DOI: 10.3389/fgene.2019.00126] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2018] [Accepted: 02/04/2019] [Indexed: 01/14/2023] Open

Kim OD, Rocha M, Maia P. A Review of Dynamic Modeling Approaches and Their Application in Computational Strain Optimization for Metabolic Engineering. Front Microbiol 2018;9:1690. [PMID: 30108559 PMCID: PMC6079213 DOI: 10.3389/fmicb.2018.01690] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Accepted: 07/06/2018] [Indexed: 12/03/2022] Open

Abstract

Mathematical modeling is a key process to describe the behavior of biological networks. One of the most difficult challenges is to build models that allow quantitative predictions of the cells' states along time. Recently, this issue started to be tackled through novel in silico approaches, such as the reconstruction of dynamic models, the use of phenotype prediction methods, and pathway design via efficient strain optimization algorithms. The use of dynamic models, which include detailed kinetic information of the biological systems, potentially increases the scope of the applications and the accuracy of the phenotype predictions. New efforts in metabolic engineering aim at bridging the gap between this approach and other different paradigms of mathematical modeling, as constraint-based approaches. These strategies take advantage of the best features of each method, and deal with the most remarkable limitation—the lack of available experimental information—which affects the accuracy and feasibility of solutions. Parameter estimation helps to solve this problem, but adding more computational cost to the overall process. Moreover, the existing approaches include limitations such as their scalability, flexibility, convergence time of the simulations, among others. The aim is to establish a trade-off between the size of the model and the level of accuracy of the solutions. In this work, we review the state of the art of dynamic modeling and related methods used for metabolic engineering applications, including approaches based on hybrid modeling. We describe approaches developed to undertake issues regarding the mathematical formulation and the underlying optimization algorithms, and that address the phenotype prediction by including available kinetic rate laws of metabolic processes. Then, we discuss how these have been used and combined as the basis to build computational strain optimization methods for metabolic engineering purposes, how they lead to bi-level schemes that can be used in the industry, including a consideration of their limitations.

Collapse

Robertson J, Yoshida C, Kruczkiewicz P, Nadon C, Nichani A, Taboada EN, Nash JHE. Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico Typing Resource (SISTR). Microb Genom 2018;4:e000151. [PMID: 29338812 PMCID: PMC5857378 DOI: 10.1099/mgen.0.000151] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 12/19/2017] [Indexed: 12/16/2022] Open

Lippert C, Sabatini R, Maher MC, Kang EY, Lee S, Arikan O, Harley A, Bernal A, Garst P, Lavrenko V, Yocum K, Wong T, Zhu M, Yang WY, Chang C, Lu T, Lee CWH, Hicks B, Ramakrishnan S, Tang H, Xie C, Piper J, Brewerton S, Turpaz Y, Telenti A, Roby RK, Och FJ, Venter JC. Identification of individuals by trait prediction using whole-genome sequencing data. Proc Natl Acad Sci U S A 2017;114:10166-10171. [PMID: 28874526 PMCID: PMC5617305 DOI: 10.1073/pnas.1711125114] [Citation(s) in RCA: 96] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Ray B, Liu W, Fenyö D. Adaptive Multiview Nonnegative Matrix Factorization Algorithm for Integration of Multimodal Biomedical Data. Cancer Inform 2017;16:1176935117725727. [PMID: 28835735 PMCID: PMC5564898 DOI: 10.1177/1176935117725727] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Accepted: 07/08/2017] [Indexed: 11/16/2022] Open

Abstract

The amounts and types of available multimodal tumor data are rapidly increasing, and their integration is critical for fully understanding the underlying cancer biology and personalizing treatment. However, the development of methods for effectively integrating multimodal data in a principled manner is lagging behind our ability to generate the data. In this article, we introduce an extension to a multiview nonnegative matrix factorization algorithm (NNMF) for dimensionality reduction and integration of heterogeneous data types and compare the predictive modeling performance of the method on unimodal and multimodal data. We also present a comparative evaluation of our novel multiview approach and current data integration methods. Our work provides an efficient method to extend an existing dimensionality reduction method. We report rigorous evaluation of the method on large-scale quantitative protein and phosphoprotein tumor data from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) acquired using state-of-the-art liquid chromatography mass spectrometry. Exome sequencing and RNA-Seq data were also available from The Cancer Genome Atlas for the same tumors. For unimodal data, in case of breast cancer, transcript levels were most predictive of estrogen and progesterone receptor status and copy number variation of human epidermal growth factor receptor 2 status. For ovarian and colon cancers, phosphoprotein and protein levels were most predictive of tumor grade and stage and residual tumor, respectively. When multiview NNMF was applied to multimodal data to predict outcomes, the improvement in performance is not overall statistically significant beyond unimodal data, suggesting that proteomics data may contain more predictive information regarding tumor phenotypes than transcript levels, probably due to the fact that proteins are the functional gene products and therefore a more direct measurement of the functional state of the tumor. Here, we have applied our proposed approach to multimodal molecular data for tumors, but it is generally applicable to dimensionality reduction and joint analysis of any type of multimodal data.

Collapse

Daneshjou R, Wang Y, Bromberg Y, Bovo S, Martelli PL, Babbi G, Lena PD, Casadio R, Edwards M, Gifford D, Jones DT, Sundaram L, Bhat RR, Li X, Pal LR, Kundu K, Yin Y, Moult J, Jiang Y, Pejaver V, Pagel KA, Li B, Mooney SD, Radivojac P, Shah S, Carraro M, Gasparini A, Leonardi E, Giollo M, Ferrari C, Tosatto SCE, Bachar E, Azaria JR, Ofran Y, Unger R, Niroula A, Vihinen M, Chang B, Wang MH, Franke A, Petersen BS, Pirooznia M, Zandi P, McCombie R, Potash JB, Altman RB, Klein TE, Hoskins RA, Repo S, Brenner SE, Morgan AA. Working toward precision medicine: Predicting phenotypes from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges. Hum Mutat 2017. [PMID: 28634997 DOI: 10.1002/humu.23280] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Affiliation(s)

Roxana Daneshjou Department of Genetics, Stanford School of Medicine, Stanford, California
Yanran Wang Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, New Jersey
Yana Bromberg Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, New Jersey
Samuele Bovo Biocomputing Group, BiGeA/CIG, "Luigi Galvani" Interdepartmental Center for Integrated Studies of Bioinformatics, Biophysics, and Biocomplexity, University of Bologna, Bologna, Italy
Pier L Martelli Biocomputing Group, BiGeA/CIG, "Luigi Galvani" Interdepartmental Center for Integrated Studies of Bioinformatics, Biophysics, and Biocomplexity, University of Bologna, Bologna, Italy
Giulia Babbi Biocomputing Group, BiGeA/CIG, "Luigi Galvani" Interdepartmental Center for Integrated Studies of Bioinformatics, Biophysics, and Biocomplexity, University of Bologna, Bologna, Italy
Pietro Di Lena Biocomputing Group/Department of Computer Science and Engineering, University of Bologna, Bologna, Italy
Rita Casadio Biocomputing Group, BiGeA/CIG, "Luigi Galvani" Interdepartmental Center for Integrated Studies of Bioinformatics, Biophysics, and Biocomplexity, University of Bologna, Bologna, Italy.,"Giorgio Prodi" Interdepartmental Center for Cancer Research, University of Bologna, Bologna, Italy
Matthew Edwards Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts
David Gifford Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts
David T Jones Bioinformatics Group, Department of Computer Science, University College London, London, United Kingdom
Laksshman Sundaram Large-scale Intelligent Systems Laboratory, NSF Center for Big Learning, University of Florida, Gainesville, Florida
Rajendra Rana Bhat Large-scale Intelligent Systems Laboratory, NSF Center for Big Learning, University of Florida, Gainesville, Florida
Xiaolin Li Large-scale Intelligent Systems Laboratory, NSF Center for Big Learning, University of Florida, Gainesville, Florida
Lipika R Pal Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Kunal Kundu Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Yizhou Yin Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
John Moult Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Yuxiang Jiang Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
Vikas Pejaver Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana.,Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington
Kymberleigh A Pagel Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
Biao Li Gilead Sciences, Foster City, California
Sean D Mooney Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington
Predrag Radivojac Department of Computer Science and Informatics, Indiana University, Bloomington, Indiana
Sohela Shah Qiagen Bioinformatics, Redwood City, California
Marco Carraro Department of Biomedical Science, University of Padova, Padova, Italy
Alessandra Gasparini Department of Biomedical Science, University of Padova, Padova, Italy.,Department of Woman and Child Health, University of Padova, Padova, Italy
Emanuela Leonardi Department of Woman and Child Health, University of Padova, Padova, Italy
Manuel Giollo Department of Biomedical Science, University of Padova, Padova, Italy.,Department of Information Engineering, University of Padova, Padova, Italy
Carlo Ferrari Department of Information Engineering, University of Padova, Padova, Italy
Silvio C E Tosatto Department of Biomedical Science, University of Padova, Padova, Italy.,CNR Institute of Neuroscience, Padova, Italy
Eran Bachar The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel
Johnathan R Azaria The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel
Yanay Ofran The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel
Ron Unger The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel
Abhishek Niroula Protein Structure and Bioinformatics Group, Department of Experimental Medical Science, Lund University, Lund, Sweden
Mauno Vihinen Protein Structure and Bioinformatics Group, Department of Experimental Medical Science, Lund University, Lund, Sweden
Billy Chang Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Maggie H Wang Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, Chinese University of Hong Kong, Shatin, N.T., Hong Kong.,CUHK Shenzhen Research Institute, Shenzhen, China
Andre Franke Institute of Clinical Molecular Biology, Christian-Albrechts-University Kiel, Kiel, Germany
Britt-Sabina Petersen Institute of Clinical Molecular Biology, Christian-Albrechts-University Kiel, Kiel, Germany
Mehdi Pirooznia Department of Psychiatry, The Johns Hopkins University School of Medicine, Baltimore, Maryland
Peter Zandi Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
Richard McCombie Cold Spring Harbor Laboratory, Cold Spring Harbor, New York
James B Potash Department of Psychiatry, University of Iowa, Iowa City, Iowa
Russ B Altman Department of Genetics, Stanford School of Medicine, Stanford, California
Teri E Klein Department of Genetics, Stanford School of Medicine, Stanford, California
Roger A Hoskins Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California
Susanna Repo Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California
Steven E Brenner Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California
Alexander A Morgan Stanford School of Medicine, Stanford, California

Collapse

Chandonia JM, Adhikari A, Carraro M, Chhibber A, Cutting GR, Fu Y, Gasparini A, Jones DT, Kramer A, Kundu K, Lam HYK, Leonardi E, Moult J, Pal LR, Searls DB, Shah S, Sunyaev S, Tosatto SCE, Yin Y, Buckley BA. Lessons from the CAGI-4 Hopkins clinical panel challenge. Hum Mutat 2017;38:1155-1168. [PMID: 28397312 DOI: 10.1002/humu.23225] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Revised: 03/24/2017] [Accepted: 03/29/2017] [Indexed: 12/17/2022]

Affiliation(s)

John-Marc Chandonia Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California
Aashish Adhikari Department of Plant and Microbial Biology, University of California, Berkeley, California
Marco Carraro Department of Biomedical Sciences, University of Padova, Padova, Italy
Aparna Chhibber Roche Sequencing Solutions, Belmont, California
Garry R Cutting McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
Yao Fu Roche Sequencing Solutions, Belmont, California
Alessandra Gasparini Department of Biomedical Sciences, University of Padova, Padova, Italy.,Department of Women's and Children's Health, University of Padova, Padova, Italy
David T Jones Department of Computer Science, University College London, London, United Kingdom
Andreas Kramer Qiagen Bioinformatics, Redwood City, California
Kunal Kundu Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Hugo Y K Lam Roche Sequencing Solutions, Belmont, California
Emanuela Leonardi Department of Women's and Children's Health, University of Padova, Padova, Italy
John Moult Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Lipika R Pal Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
David B Searls Independent Consultant, Philadelphia, Pennsylvania
Sohela Shah Qiagen Bioinformatics, Redwood City, California
Shamil Sunyaev Division of Genetics, Department of Medicine, Brigham & Women's Hospital, Harvard Medical School, Boston, Massachusetts.,Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
Silvio C E Tosatto Department of Biomedical Sciences, University of Padova, Padova, Italy.,CNR Institute of Neuroscience, Padova, Italy
Yizhou Yin Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland.,Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Bethany A Buckley McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland

Collapse

Yachison CA, Yoshida C, Robertson J, Nash JHE, Kruczkiewicz P, Taboada EN, Walker M, Reimer A, Christianson S, Nichani A, Nadon C. The Validation and Implications of Using Whole Genome Sequencing as a Replacement for Traditional Serotyping for a National Salmonella Reference Laboratory. Front Microbiol 2017. [PMID: 28649236 PMCID: PMC5465390 DOI: 10.3389/fmicb.2017.01044] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Abstract

Salmonella serotyping remains the gold-standard tool for the classification of Salmonella isolates and forms the basis of Canada’s national surveillance program for this priority foodborne pathogen. Public health officials have been increasingly looking toward whole genome sequencing (WGS) to provide a large set of data from which all the relevant information about an isolate can be mined. However, rigorous validation and careful consideration of potential implications in the replacement of traditional surveillance methodologies with WGS data analysis tools is needed. Two in silico tools for Salmonella serotyping have been developed, the Salmonella in silico Typing Resource (SISTR) and SeqSero, while seven gene MLST for serovar prediction can be adapted for in silico analysis. All three analysis methods were assessed and compared to traditional serotyping techniques using a set of 813 verified clinical and laboratory isolates, including 492 Canadian clinical isolates and 321 isolates of human and non-human sources. Successful results were obtained for 94.8, 88.2, and 88.3% of the isolates tested using SISTR, SeqSero, and MLST, respectively, indicating all would be suitable for maintaining historical records, surveillance systems, and communication structures currently in place and the choice of the platform used will ultimately depend on the users need. Results also pointed to the need to reframe serotyping in the genomic era as a test to understand the genes that are carried by an isolate, one which is not necessarily congruent with what is antigenically expressed. The adoption of WGS for serotyping will provide the simultaneous collection of information that can be used by multiple programs within the current surveillance paradigm; however, this does not negate the importance of the various programs or the role of serotyping going forward.

Collapse

Niroula A, Vihinen M. Predicting Severity of Disease-Causing Variants. Hum Mutat 2017;38:357-364. [PMID: 28070986 DOI: 10.1002/humu.23173] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Revised: 12/07/2016] [Accepted: 01/06/2017] [Indexed: 12/22/2022]

Klein A, Mazor Y, Karban A, Ben-Itzhak O, Chowers Y, Sabo E. Early histological findings may predict the clinical phenotype in Crohn's colitis. United European Gastroenterol J 2016;5:694-701. [PMID: 28815033 DOI: 10.1177/2050640616676435] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/11/2016] [Accepted: 10/03/2016] [Indexed: 11/17/2022] Open

deAndrés-Galiana EJ, Fernández-Martínez JL, Sonis ST. Design of Biomedical Robots for Phenotype Prediction Problems. J Comput Biol 2016;23:678-92. [PMID: 27347715 DOI: 10.1089/cmb.2016.0008] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Lopes MS, Bastiaansen JW, Janss L, Knol EF, Bovenhuis H. Estimation of Additive, Dominance, and Imprinting Genetic Variance Using Genomic Data. G3 (Bethesda) 2015;5:2629-37. [PMID: 26438289 DOI: 10.1534/g3.115.019513] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Porth I, Klápště J, Skyba O, Friedmann MC, Hannemann J, Ehlting J, El-Kassaby YA, Mansfield SD, Douglas CJ. Network analysis reveals the relationship among wood properties, gene expression levels and genotypes of natural Populus trichocarpa accessions. New Phytol 2013;200:727-742. [PMID: 23889128 DOI: 10.1111/nph.12419] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2013] [Accepted: 06/17/2013] [Indexed: 05/21/2023]

Chang R, Shoemaker R, Wang W. A novel knowledge-driven systems biology approach for phenotype prediction upon genetic intervention. IEEE/ACM Trans Comput Biol Bioinform 2011;8:1170-1182. [PMID: 21282866 PMCID: PMC3211072 DOI: 10.1109/tcbb.2011.18] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Wang PI, Marcotte EM. It's the machine that matters: Predicting gene function and phenotype from protein networks. J Proteomics 2010;73:2277-89. [PMID: 20637909 PMCID: PMC2953423 DOI: 10.1016/j.jprot.2010.07.005] [Citation(s) in RCA: 102] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2010] [Revised: 06/22/2010] [Accepted: 07/07/2010] [Indexed: 12/17/2022]

Lamers SL, Salemi M, McGrath MS, Fogel GB. Prediction of R5, X4, and R5X4 HIV-1 coreceptor usage with evolved neural networks. IEEE/ACM Trans Comput Biol Bioinform 2008;5:291-300. [PMID: 18451438 PMCID: PMC3523352 DOI: 10.1109/tcbb.2007.1074] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Matthew R, Banjevic M, Chan AS, Myers L, Wolkowicz R, Haberer J, Singer J. Use of the l1 norm for selection of sparse parameter sets that accurately predict drug response phenotype from viral genetic sequences. AMIA Annu Symp Proc 2005;2005:505-9. [PMID: 16779091 PMCID: PMC1560612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]