Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Vihinen M. How to evaluate performance of prediction methods? Measures and their interpretation in variation effect analysis. BMC Genomics 2012;13 Suppl 4:S2. [PMID: 22759650 PMCID: PMC3303716 DOI: 10.1186/1471-2164-13-s4-s2] [Citation(s) in RCA: 155] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

For:	Vihinen M. How to evaluate performance of prediction methods? Measures and their interpretation in variation effect analysis. BMC Genomics 2012;13 Suppl 4:S2. [PMID: 22759650 PMCID: PMC3303716 DOI: 10.1186/1471-2164-13-s4-s2] [Citation(s) in RCA: 155] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

Number

Cited by Other Article(s)

Kolbinger FR, Veldhuizen GP, Zhu J, Truhn D, Kather JN. Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis. COMMUNICATIONS MEDICINE 2024;4:71. [PMID: 38605106 PMCID: PMC11009315 DOI: 10.1038/s43856-024-00492-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 03/27/2024] [Indexed: 04/13/2024] Open

Pirompud P, Sivapirunthep P, Punyapornwithaya V, Chaosap C. Application of machine learning algorithms to predict dead on arrival of broiler chickens raised without antibiotic program. Poult Sci 2024;103:103504. [PMID: 38335671 PMCID: PMC10864801 DOI: 10.1016/j.psj.2024.103504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 01/20/2024] [Accepted: 01/23/2024] [Indexed: 02/12/2024] Open

Abstract

Understanding the factors of dead-on-arrival (DOA) incidents during pre-slaughter handling is crucial for informed decision-making, improving broiler welfare, and optimizing farm profitability. In this study, 3 different machine learning (ML) algorithms - least absolute shrinkage and selection operator (LASSO), classification tree (CT), and random forest (RF) - were used together with 4 sampling techniques to optimize imbalanced data. The dataset comes from 22,115 broiler truckloads from a large producer in Thailand (2021-2022) and includes 14 independent variables covering the rearing, catching, and transportation stages. The study focuses on DOA% in the range of 0.10 to 1.20%, with a threshold for high DOA% above 0.3%, and records DOA% per truckload during pre-slaughter ante-mortem inspection. With a high DOA rate of 25.2%, the imbalanced dataset prompts the implementation of 4 methods to tune the imbalance parameters: random over sampling (ROS), random under sampling (RUS), both sampling (BOTH), and synthetic sampling or random over sampling example (ROSE). The aim is to improve the performance of the prediction model in classifying and predicting high DOA%. The comparative analysis of the different error metrics shows that RF outperforms the other models in a balanced dataset. In particular, RUS shows a significant improvement in prediction performance across all models compared to the original unbalanced dataset. The identification of the 4 most important variables for predicting high DOA percentages - mortality and culling rate, rearing stocking density, season, and mean body weight - emphasizes their importance for broiler production. This study provides valuable insights into the prediction of DOA status using an ML approach and contributes to the development of more effective strategies to mitigate high DOA percentages in commercial broiler production.

Collapse

Reggiani F, El Rashed Z, Petito M, Pfeffer M, Morabito A, Tanda ET, Spagnolo F, Croce M, Pfeffer U, Amaro A. Machine Learning Methods for Gene Selection in Uveal Melanoma. Int J Mol Sci 2024;25:1796. [PMID: 38339073 PMCID: PMC10855534 DOI: 10.3390/ijms25031796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 01/25/2024] [Accepted: 01/30/2024] [Indexed: 02/12/2024] Open

Khalilzad Z, Tadj C. Use of psychoacoustic spectrum warping, decision template fusion, and neighborhood component analysis in newborn cry diagnostic systems. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;155:901-914. [PMID: 38310608 DOI: 10.1121/10.0024618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 01/10/2024] [Indexed: 02/06/2024]

Porras LM, Padilla N, Moles-Fernández A, Feliubadaló L, Santamariña-Pena M, Sánchez AT, López-Novo A, Blanco A, de la Hoya M, Molina IJ, Osorio A, Pineda M, Rueda D, Ruiz-Ponte C, Vega A, Lázaro C, Díez O, Gutiérrez-Enríquez S, de la Cruz X. A New Set of in Silico Tools to Support the Interpretation of ATM Missense Variants Using Graphical Analysis. J Mol Diagn 2024;26:17-28. [PMID: 37865290 DOI: 10.1016/j.jmoldx.2023.09.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 06/30/2023] [Accepted: 09/20/2023] [Indexed: 10/23/2023] Open

Affiliation(s)

Luz-Marina Porras Research Unit in Clinical and Translational Bioinformatics, Vall d'Hebron Institute of Research, Universitat Autònoma de Barcelona, Barcelona, Spain
Natàlia Padilla Research Unit in Clinical and Translational Bioinformatics, Vall d'Hebron Institute of Research, Universitat Autònoma de Barcelona, Barcelona, Spain
Alejandro Moles-Fernández Hereditary Cancer Genetics Group, Vall d'Hebron Institute of Oncology, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain
Lidia Feliubadaló Hereditary Cancer Program, Catalan Institute of Oncology, Hospitalet de Llobregat, Barcelona, Spain; Program in Molecular Mechanisms and Experimental Therapy in Oncology, Instituto de Investigación Biomédica de Bellvitge (IDIBELL), Hospitalet de Llobregat, Barcelona, Spain; Centro de Investigación Biomédica en Red de Cáncer, Madrid, Spain
Marta Santamariña-Pena Fundación Pública Galega Medicina Xenómica, Santiago de Compostela, Spain; Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain; Centro de Investigación Biomédica en Red de enfermedades Raras, Madrid, Spain
Alysson T Sánchez Hereditary Cancer Program, Oncobell Program, Catalan Institute of Oncology, Instituto de Investigación Biomédica de Bellvitge (IDIBELL), Hospitalet de Llobregat, Barcelona, Spain
Anael López-Novo Fundación Pública Galega Medicina Xenómica, Santiago de Compostela, Spain; Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain
Ana Blanco Fundación Pública Galega Medicina Xenómica, Santiago de Compostela, Spain; Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain; Centro de Investigación Biomédica en Red de enfermedades Raras, Madrid, Spain
Miguel de la Hoya Molecular Oncology Laboratory, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
Ignacio J Molina Instituto de Biopatología y Medicina Regenerativa, Universidad de Granada and Instituto de Investigación Biosanitaria ibs.GRANADA, Granada, Spain
Ana Osorio Familial Cancer Clinical Unit, Human Cancer Genetics Programme, Spanish National Cancer Research Centre, Madrid, Spain; Spanish Network on Rare Diseases, Madrid, Spain
Marta Pineda Hereditary Cancer Program, Catalan Institute of Oncology, Hospitalet de Llobregat, Barcelona, Spain; Program in Molecular Mechanisms and Experimental Therapy in Oncology, Instituto de Investigación Biomédica de Bellvitge (IDIBELL), Hospitalet de Llobregat, Barcelona, Spain; Centro de Investigación Biomédica en Red de Cáncer, Madrid, Spain
Daniel Rueda Hereditary Cancer Laboratory, 12 de Octubre University Hospital, i+12 Research Institute, Madrid, Spain
Clara Ruiz-Ponte Fundación Pública Galega Medicina Xenómica, Santiago de Compostela, Spain; Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain; Centro de Investigación Biomédica en Red de enfermedades Raras, Madrid, Spain
Ana Vega Fundación Pública Galega Medicina Xenómica, Santiago de Compostela, Spain; Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain; Centro de Investigación Biomédica en Red de enfermedades Raras, Madrid, Spain
Conxi Lázaro Hereditary Cancer Program, Catalan Institute of Oncology, Hospitalet de Llobregat, Barcelona, Spain; Program in Molecular Mechanisms and Experimental Therapy in Oncology, Instituto de Investigación Biomédica de Bellvitge (IDIBELL), Hospitalet de Llobregat, Barcelona, Spain; Centro de Investigación Biomédica en Red de Cáncer, Madrid, Spain
Orland Díez Hereditary Cancer Genetics Group, Vall d'Hebron Institute of Oncology, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Area of Clinical and Molecular Genetics, Vall d'Hebron Hospital Universitari, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain
Sara Gutiérrez-Enríquez Hereditary Cancer Genetics Group, Vall d'Hebron Institute of Oncology, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain.
Xavier de la Cruz Research Unit in Clinical and Translational Bioinformatics, Vall d'Hebron Institute of Research, Universitat Autònoma de Barcelona, Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain.

Collapse

Guéniche N, Lakehal Z, Habauzit D, Bruyère A, Fardel O, Le Hégarat L, Huguet A. Combined in silico and in vitro approaches to identify P-glycoprotein-inhibiting pesticides. J Biochem Mol Toxicol 2024;38:e23588. [PMID: 37985955 DOI: 10.1002/jbt.23588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 10/04/2023] [Accepted: 11/10/2023] [Indexed: 11/22/2023]

Shirvanizadeh N, Vihinen M. VariBench, new variation benchmark categories and data sets. FRONTIERS IN BIOINFORMATICS 2023;3:1248732. [PMID: 37795169 PMCID: PMC10546188 DOI: 10.3389/fbinf.2023.1248732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/08/2023] [Indexed: 10/06/2023] Open

Bhandari N, Walambe R, Kotecha K, Kaliya M. Integrative gene expression analysis for the diagnosis of Parkinson's disease using machine learning and explainable AI. Comput Biol Med 2023;163:107140. [PMID: 37315380 DOI: 10.1016/j.compbiomed.2023.107140] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 05/29/2023] [Accepted: 06/04/2023] [Indexed: 06/16/2023]

Yang Y, Chong Z, Vihinen M. PON-Fold: Prediction of Substitutions Affecting Protein Folding Rate. Int J Mol Sci 2023;24:13023. [PMID: 37629203 PMCID: PMC10455311 DOI: 10.3390/ijms241613023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/08/2023] [Accepted: 08/09/2023] [Indexed: 08/27/2023] Open

Aspromonte MC, Conte AD, Zhu S, Tan W, Shen Y, Zhang Y, Li Q, Wang MH, Babbi G, Bovo S, Martelli PL, Casadio R, Althagafi A, Toonsi S, Kulmanov M, Hoehndorf R, Katsonis P, Williams A, Lichtarge O, Xian S, Surento W, Pejaver V, Mooney SD, Sunderam U, Srinivasan R, Murgia A, Piovesan D, Tosatto SCE, Leonardi E. CAGI6 ID-Challenge: Assessment of phenotype and variant predictions in 415 children with Neurodevelopmental Disorders (NDDs). RESEARCH SQUARE 2023:rs.3.rs-3209168. [PMID: 37577579 PMCID: PMC10418555 DOI: 10.21203/rs.3.rs-3209168/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]

Affiliation(s)

Maria Cristina Aspromonte Department of Biomedical Sciences, University of Padova
Alessio Del Conte Department of Biomedical Sciences, University of Padova
Shaowen Zhu Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
Wuwei Tan Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
Yang Shen Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
Yexian Zhang CUHK Shenzhen Research Institute, Shenzhen
Qi Li CUHK Shenzhen Research Institute, Shenzhen
Maggie Haitian Wang CUHK Shenzhen Research Institute, Shenzhen
Giulia Babbi Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
Samuele Bovo Department of Agricultural and Food Sciences, University of Bologna
Pier Luigi Martelli Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
Rita Casadio Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
Azza Althagafi Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Sumyyah Toonsi Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Maxat Kulmanov Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Robert Hoehndorf Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
Panagiotis Katsonis Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
Amanda Williams Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
Olivier Lichtarge Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
Su Xian Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
Wesley Surento Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
Vikas Pejaver Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY 10029
Sean D Mooney Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
Uma Sunderam Innovation Labs, Tata Consultancy Services, Hyderabad
Rajgopal Srinivasan Innovation Labs, Tata Consultancy Services, Hyderabad
Alessandra Murgia Department of Women's and Children's Health, University of Padova
Damiano Piovesan Department of Biomedical Sciences, University of Padova
Silvio C E Tosatto Department of Biomedical Sciences, University of Padova
Emanuela Leonardi Department of Biomedical Sciences, University of Padova

Collapse

Ahn SY, Jung EH, Ahn H, Lee JS, Bak JH, Kim ED, Song JH, Shin HS, Jamiyansharav M, Seo KY. Automatic measurement of mouse visual acuity based on optomotor response: SKY optomotry. Lab Anim 2023;57:412-423. [PMID: 36708198 DOI: 10.1177/00236772221148576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Aguirre J, Padilla N, Özkan S, Riera C, Feliubadaló L, de la Cruz X. Choosing Variant Interpretation Tools for Clinical Applications: Context Matters. Int J Mol Sci 2023;24:11872. [PMID: 37511631 PMCID: PMC10380979 DOI: 10.3390/ijms241411872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 07/10/2023] [Accepted: 07/20/2023] [Indexed: 07/30/2023] Open

Rani D, Krishan K, Kanchan T. A methodological comparison of discriminant function analysis and binary logistic regression for estimating sex in forensic research and case-work. MEDICINE, SCIENCE, AND THE LAW 2023;63:227-236. [PMID: 36366800 DOI: 10.1177/00258024221136687] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Wang M, Zhao C, Barr A, Fan H, Yu S, Kapellusch J, Harris Adamson C. Hand Posture and Force Estimation Using Surface Electromyography and an Artificial Neural Network. HUMAN FACTORS 2023;65:382-402. [PMID: 34006135 DOI: 10.1177/00187208211016695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Khalilzad Z, Tadj C. Using CCA-Fused Cepstral Features in a Deep Learning-Based Cry Diagnostic System for Detecting an Ensemble of Pathologies in Newborns. Diagnostics (Basel) 2023;13:diagnostics13050879. [PMID: 36900023 PMCID: PMC10000938 DOI: 10.3390/diagnostics13050879] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 02/14/2023] [Accepted: 02/21/2023] [Indexed: 03/02/2023] Open

A new blood based epigenetic age predictor for adolescents and young adults. Sci Rep 2023;13:2303. [PMID: 36759656 PMCID: PMC9911637 DOI: 10.1038/s41598-023-29381-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 02/03/2023] [Indexed: 02/11/2023] Open

Deyneko IV. Guidelines on the performance evaluation of motif recognition methods in bioinformatics. Front Genet 2023;14:1135320. [PMID: 36824436 PMCID: PMC9941176 DOI: 10.3389/fgene.2023.1135320] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 01/19/2023] [Indexed: 02/09/2023] Open

C L, S P, Kashyap AH, Rahaman A, Niranjan S, Niranjan V. Novel Biomarker Prediction for Lung Cancer Using Random Forest Classifiers. Cancer Inform 2023;22:11769351231167992. [PMID: 37113644 PMCID: PMC10126698 DOI: 10.1177/11769351231167992] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 03/17/2023] [Indexed: 04/29/2023] Open

Abstract

Lung cancer is considered the most common and the deadliest cancer type. Lung cancer could be mainly of 2 types: small cell lung cancer and non-small cell lung cancer. Non-small cell lung cancer is affected by about 85% while small cell lung cancer is only about 14%. Over the last decade, functional genomics has arisen as a revolutionary tool for studying genetics and uncovering changes in gene expression. RNA-Seq has been applied to investigate the rare and novel transcripts that aid in discovering genetic changes that occur in tumours due to different lung cancers. Although RNA-Seq helps to understand and characterise the gene expression involved in lung cancer diagnostics, discovering the biomarkers remains a challenge. Usage of classification models helps uncover and classify the biomarkers based on gene expression levels over the different lung cancers. The current research concentrates on computing transcript statistics from gene transcript files with a normalised fold change of genes and identifying quantifiable differences in gene expression levels between the reference genome and lung cancer samples. The collected data is analysed, and machine learning models were developed to classify genes as causing NSCLC, causing SCLC, causing both or neither. An exploratory data analysis was performed to identify the probability distribution and principal features. Due to the limited number of features available, all of them were used in predicting the class. To address the imbalance in the dataset, an under-sampling algorithm Near Miss was carried out on the dataset. For classification, the research primarily focused on 4 supervised machine learning algorithms: Logistic Regression, KNN classifier, SVM classifier and Random Forest classifier and additionally, 2 ensemble algorithms were considered: XGboost and AdaBoost. Out of these, based on the weighted metrics considered, the Random Forest classifier showing 87% accuracy was considered to be the best performing algorithm and thus was used to predict the biomarkers causing NSCLC and SCLC. The imbalance and limited features in the dataset restrict any further improvement in the model's accuracy or precision. In our present study using the gene expression values (LogFC, P Value) as the feature sets in the Random Forest Classifier BRAF, KRAS, NRAS, EGFR is predicted to be the possible biomarkers causing NSCLC and ATF6, ATF3, PGDFA, PGDFD, PGDFC and PIP5K1C is predicted to be the possible biomarkers causing SCLC from the transcriptome analysis. It gave a precision of 91.3% and 91% recall after fine tuning. Some of the common biomarkers predicted for NSCLC and SCLC were CDK4, CDK6, BAK1, CDKN1A, DDB2.

Collapse

Ma J, Qin T, Xiang J. Disease-gene prediction based on preserving structure network embedding. Front Aging Neurosci 2023;15:1061892. [PMID: 36896421 PMCID: PMC9990751 DOI: 10.3389/fnagi.2023.1061892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 01/30/2023] [Indexed: 02/23/2023] Open

Wang H, Li H, Gao W, Xie J. PrUb-EL: A hybrid framework based on deep learning for identifying ubiquitination sites in Arabidopsis thaliana using ensemble learning strategy. Anal Biochem 2022;658:114935. [PMID: 36206844 DOI: 10.1016/j.ab.2022.114935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/25/2022] [Accepted: 09/26/2022] [Indexed: 12/30/2022]

Abstract

Identification of ubiquitination sites is central to many biological experiments. Ubiquitination is a kind of post-translational protein modification (PTM). It is a key mechanism for increasing protein diversity and plays a vital role in regulating cell function. In recent years, many models have been developed to predict ubiquitination sites in humans, mice and yeast. However, few studies have predicted ubiquitination sites in Arabidopsis thaliana. In view of this, a deep network model named PrUb-EL is proposed to predict ubiquitination sites in Arabidopsis thaliana. Firstly, six features based on the protein sequence are extracted with amino acid index database (AAindex), dipeptide deviates from the expected mean (DDE), dipeptide composition (DPC), blocks substitution matrix (BLOSUM62), enhanced amino acid composition (EAAC) and binary encoding. Secondly, the synthetic minority over-sampling technique (SMOTE) is utilized to process the imbalanced data set. Then a new classifier named DG is presented, which includes Dense block, Residual block and Gated recurrent unit (GRU) block. Finally, each of six feature extraction methods is integrated into the DG model, and the ensemble learning strategy is used to gain the final prediction result. Experimental results show that PrUb-EL has good predictive ability with the accuracy (ACC) and area under the ROC curve (auROC) values of 91.00% and 97.70% using 5-fold cross-validation, respectively. Note that the values of ACC and auROC are 88.58% and 96.09% in the independent test, respectively. Compared with previous studies, our model has significantly improved performance thus it is an excellent method for identifying ubiquitination sites in Arabidopsis thaliana. The datasets and code used for the article are available at https://github.com/Tom-Wangy/PreUb-EL.git.

Collapse

He B, Wang K, Xiang J, Bing P, Tang M, Tian G, Guo C, Xu M, Yang J. DGHNE: network enhancement-based method in identifying disease-causing genes through a heterogeneous biomedical network. Brief Bioinform 2022;23:6712302. [PMID: 36151744 DOI: 10.1093/bib/bbac405] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/01/2022] [Accepted: 08/21/2022] [Indexed: 12/14/2022] Open

Abstract

The identification of disease-causing genes is critical for mechanistic understanding of disease etiology and clinical manipulation in disease prevention and treatment. Yet the existing approaches in tackling this question are inadequate in accuracy and efficiency, demanding computational methods with higher identification power. Here, we proposed a new method called DGHNE to identify disease-causing genes through a heterogeneous biomedical network empowered by network enhancement. First, a disease-disease association network was constructed by the cosine similarity scores between phenotype annotation vectors of diseases, and a new heterogeneous biomedical network was constructed by using disease-gene associations to connect the disease-disease network and gene-gene network. Then, the heterogeneous biomedical network was further enhanced by using network embedding based on the Gaussian random projection. Finally, network propagation was used to identify candidate genes in the enhanced network. We applied DGHNE together with five other methods into the most updated disease-gene association database termed DisGeNet. Compared with all other methods, DGHNE displayed the highest area under the receiver operating characteristic curve and the precision-recall curve, as well as the highest precision and recall, in both the global 5-fold cross-validation and predicting new disease-gene associations. We further performed DGHNE in identifying the candidate causal genes of Parkinson's disease and diabetes mellitus, and the genes connecting hyperglycemia and diabetes mellitus. In all cases, the predicted causing genes were enriched in disease-associated gene ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways, and the gene-disease associations were highly evidenced by independent experimental studies.

Collapse

Bhandari N, Walambe R, Kotecha K, Khare SP. A comprehensive survey on computational learning methods for analysis of gene expression data. Front Mol Biosci 2022;9:907150. [PMID: 36458095 PMCID: PMC9706412 DOI: 10.3389/fmolb.2022.907150] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 09/28/2022] [Indexed: 09/19/2023] Open

Parallel functional annotation of cancer-associated missense mutations in histone methyltransferases. Sci Rep 2022;12:18487. [PMID: 36323913 PMCID: PMC9630446 DOI: 10.1038/s41598-022-23229-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 10/27/2022] [Indexed: 12/03/2022] Open

Comprehensive In Silico Functional Prediction Analysis of CDKL5 by Single Amino Acid Substitution in the Catalytic Domain. Int J Mol Sci 2022;23:ijms232012281. [PMID: 36293137 PMCID: PMC9603577 DOI: 10.3390/ijms232012281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 10/07/2022] [Accepted: 10/11/2022] [Indexed: 11/05/2022] Open

Prado MJ, Ligabue-Braun R, Zaha A, Rossetti MLR, Pandey AV. Variant predictions in congenital adrenal hyperplasia caused by mutations in CYP21A2. Front Pharmacol 2022;13:931089. [PMID: 36278220 PMCID: PMC9579345 DOI: 10.3389/fphar.2022.931089] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 09/14/2022] [Indexed: 11/13/2022] Open

Liu Y, Yeung WSB, Chiu PCN, Cao D. Computational approaches for predicting variant impact: An overview from resources, principles to applications. Front Genet 2022;13:981005. [PMID: 36246661 PMCID: PMC9559863 DOI: 10.3389/fgene.2022.981005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 08/08/2022] [Indexed: 11/13/2022] Open

Yang Y, Zhao J, Zeng L, Vihinen M. ProTstab2 for Prediction of Protein Thermal Stabilities. Int J Mol Sci 2022;23:ijms231810798. [PMID: 36142711 PMCID: PMC9505338 DOI: 10.3390/ijms231810798] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 09/12/2022] [Accepted: 09/13/2022] [Indexed: 11/16/2022] Open

Behrendt A, Golchin P, König F, Mulnaes D, Stalke A, Dröge C, Keitel V, Gohlke H. Vasor: Accurate prediction of variant effects for amino acid substitutions in multidrug resistance protein 3. Hepatol Commun 2022;6:3098-3111. [PMID: 36111625 PMCID: PMC9592774 DOI: 10.1002/hep4.2088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Revised: 07/26/2022] [Accepted: 08/16/2022] [Indexed: 12/14/2022] Open

Khalilzad Z, Kheddache Y, Tadj C. An Entropy-Based Architecture for Detection of Sepsis in Newborn Cry Diagnostic Systems. ENTROPY (BASEL, SWITZERLAND) 2022;24:1194. [PMID: 36141080 PMCID: PMC9498202 DOI: 10.3390/e24091194] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 08/18/2022] [Accepted: 08/22/2022] [Indexed: 06/16/2023]

Exploring the predictive capability of machine learning models in identifying foot and mouth disease outbreak occurrences in cattle farms in an endemic setting of Thailand. Prev Vet Med 2022;207:105706. [DOI: 10.1016/j.prevetmed.2022.105706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 06/09/2022] [Accepted: 07/01/2022] [Indexed: 11/20/2022]

Yang Y, Shao A, Vihinen M. PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms. Front Mol Biosci 2022;9:867572. [PMID: 35782867 PMCID: PMC9245922 DOI: 10.3389/fmolb.2022.867572] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 05/02/2022] [Indexed: 01/08/2023] Open

Predicting Parkinson disease related genes based on PyFeat and gradient boosted decision tree. Sci Rep 2022;12:10004. [PMID: 35705654 PMCID: PMC9200794 DOI: 10.1038/s41598-022-14127-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 06/01/2022] [Indexed: 11/10/2022] Open

Yazar M, Ozbek P. Assessment of 13 in silico pathogenicity methods on cancer-related variants. Comput Biol Med 2022;145:105434. [DOI: 10.1016/j.compbiomed.2022.105434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 03/04/2022] [Accepted: 03/20/2022] [Indexed: 11/03/2022]

Kebabci N, Timucin AC, Timucin E. Toward Compilation of Balanced Protein Stability Data Sets: Flattening the ΔΔG Curve through Systematic Enrichment. J Chem Inf Model 2022;62:1345-1355. [PMID: 35201762 DOI: 10.1021/acs.jcim.2c00054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

Often studies analyzing stability data sets and/or predictors ignore neutral mutations and use a binary classification scheme labeling only destabilizing and stabilizing mutations. Recognizing that highly concentrated neutral mutations interfere with data set quality, we have explored three protein stability data sets: S2648, PON-tstab, and the symmetric S^sym that differ in size and quality. A characteristic leptokurtic shape in the ΔΔG distributions of all three data sets including the curated and symmetric ones was reported due to concentrated neutral mutations. To further investigate the impact of neutral mutations on ΔΔG predictions, we have comprehensively assessed the performance of 11 predictors on the PON-tstab data set. Correlation and error analyses showed that all of the predictors performed the best on the neutral mutations, while their performance became gradually worse as the ΔΔG of the mutations departed further from the neutral zone regardless of the direction, implying a bias toward dense mutations. To this end, after unraveling the role of concentrated neutral mutations in biases of stability data sets, we described a systematic enrichment approach to balance the ΔΔG distributions. Before enrichment, mutations were clustered based on their biochemical and/or structural features, and then three mutations were selected from every 2 kcal/mol of each cluster. Upon implementation of this approach by distinct clustering schemes, we generated five subsets varying in size and ΔΔG distributions. All subsets showed improved ΔΔG and frequency distributions. We ultimately reported that the errors toward enriched subsets were higher than those toward the parent data sets, confirming the enrichment of difficult-to-predict mutations in the subsets. In summary, we elaborated the prediction bias toward a concentrated neutral zone and also implemented a rational strategy to tackle this and other forms of biases. Ultimately, this study equipping us with an extended view of shortcomings of stability data sets is a step taken toward development of an unbiased predictor.

Collapse

Khan MNA, Miah MSU, Shahjalal M, Sarwar TB, Rokon MS. Predicting Young Imposter Syndrome Using Ensemble Learning. COMPLEXITY 2022;2022:1-10. [DOI: 10.1155/2022/8306473] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]

Abstract Background. Imposter syndrome (IS), associated with self-doubt and fear despite clear accomplishments and competencies, is frequently detected in medical students and has a negative impact on their well-being. This study aimed to predict the students’ IS using the machine learning ensemble approach. Methods. This study was a cross-sectional design among medical students in Bangladesh. Data were collected from February to July 2020 through snowball sampling technique across medical colleges in Bangladesh. In this study, we employed three different machine learning techniques such as neural network, random forest, and ensemble learning to compare the accuracy of prediction of the IS. Results. In total, 500 students completed the questionnaire. We used the YIS scale to determine the presence of IS among medical students. The ensemble model has the highest accuracy of this predictive model, with 96.4%, while the individual accuracy of random forest and neural network is 93.5% and 96.3%, respectively. We used different performance matrices to compare the results of the models. Finally, we compared feature importance scores between neural network and random forest model. The top feature of the neural network model is Y7, and the top feature of the random forest model is Y2, which is second among the top features of the neural network model. Conclusions. Imposter syndrome is an emerging mental illness in Bangladesh and requires the immediate attention of researchers. For instance, in order to reduce the impact of IS, identifying key factors responsible for IS is an important step. Machine learning methods can be employed to identify the potential sources responsible for IS. Similarly, determining how each factor contributes to the IS condition among medical students could be a potential future direction. Collapse

Houskeeper HF, Rosenthal IS, Cavanaugh KC, Pawlak C, Trouille L, Byrnes JEK, Bell TW, Cavanaugh KC. Automated satellite remote sensing of giant kelp at the Falkland Islands (Islas Malvinas). PLoS One 2022;17:e0257933. [PMID: 34990455 PMCID: PMC8735600 DOI: 10.1371/journal.pone.0257933] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Accepted: 12/20/2021] [Indexed: 11/30/2022] Open

Abstract

Giant kelp populations that support productive and diverse coastal ecosystems at temperate and subpolar latitudes of both hemispheres are vulnerable to changing climate conditions as well as direct human impacts. Observations of giant kelp forests are spatially and temporally uneven, with disproportionate coverage in the northern hemisphere, despite the size and comparable density of southern hemisphere kelp forests. Satellite imagery enables the mapping of existing and historical giant kelp populations in understudied regions, but automating the detection of giant kelp using satellite imagery requires approaches that are robust to the optical complexity of the shallow, nearshore environment. We present and compare two approaches for automating the detection of giant kelp in satellite datasets: one based on crowd sourcing of satellite imagery classifications and another based on a decision tree paired with a spectral unmixing algorithm (automated using Google Earth Engine). Both approaches are applied to satellite imagery (Landsat) of the Falkland Islands or Islas Malvinas (FLK), an archipelago in the southern Atlantic Ocean that supports expansive giant kelp ecosystems. The performance of each method is evaluated by comparing the automated classifications with a subset of expert-annotated imagery (8 images spanning the majority of our continuous timeseries, cumulatively covering over 2,700 km of coastline, and including all relevant sensors). Using the remote sensing approaches evaluated herein, we present the first continuous timeseries of giant kelp observations in the FLK region using Landsat imagery spanning over three decades. We do not detect evidence of long-term change in the FLK region, although we observe a recent decline in total canopy area from 2017-2021. Using a nitrate model based on nearby ocean state measurements obtained from ships and incorporating satellite sea surface temperature products, we find that the area of giant kelp forests in the FLK region is positively correlated with the nitrate content observed during the prior year. Our results indicate that giant kelp classifications using citizen science are approximately consistent with classifications based on a state-of-the-art automated spectral approach. Despite differences in accuracy and sensitivity, both approaches find high interannual variability that impedes the detection of potential long-term changes in giant kelp canopy area, although recent canopy area declines are notable and should continue to be monitored carefully.

Collapse

Wafula EK, Zhang H, Von Kuster G, Leebens-Mack JH, Honaas LA, dePamphilis CW. PlantTribes2: Tools for comparative gene family analysis in plant genomics. FRONTIERS IN PLANT SCIENCE 2022;13:1011199. [PMID: 36798801 PMCID: PMC9928214 DOI: 10.3389/fpls.2022.1011199] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 12/02/2022] [Indexed: 05/12/2023]

Abstract

Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.

Collapse

Kanda K, Blythe S, Grace R, Elcombe E, Kemp L. Variations in sustained home visiting care for mothers and children experiencing adversity. Public Health Nurs 2021;39:71-81. [PMID: 34862813 PMCID: PMC9299687 DOI: 10.1111/phn.13014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 11/01/2021] [Accepted: 11/03/2021] [Indexed: 11/30/2022]

Kolbert Z, Lindermayr C. Computational prediction of NO-dependent posttranslational modifications in plants: Current status and perspectives. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2021;167:851-861. [PMID: 34536898 DOI: 10.1016/j.plaphy.2021.09.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/04/2021] [Accepted: 09/08/2021] [Indexed: 05/11/2023]

Khoruddin NA, Noorizhab MN, Teh LK, Mohd Yusof FZ, Salleh MZ. Pathogenic nsSNPs that increase the risks of cancers among the Orang Asli and Malays. Sci Rep 2021;11:16158. [PMID: 34373545 PMCID: PMC8352870 DOI: 10.1038/s41598-021-95618-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Accepted: 07/26/2021] [Indexed: 02/07/2023] Open

Prabhu BN, Kanchamreddy SH, Sharma AR, Bhat SK, Bhat PV, Kabekkodu SP, Satyamoorthy K, Rai PS. Conceptualization of functional single nucleotide polymorphisms of polycystic ovarian syndrome genes: an in silico approach. J Endocrinol Invest 2021;44:1783-1793. [PMID: 33506367 PMCID: PMC8285346 DOI: 10.1007/s40618-021-01498-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 01/02/2021] [Indexed: 12/24/2022]

Yang Y, Zeng L, Vihinen M. PON-Sol2: Prediction of Effects of Variants on Protein Solubility. Int J Mol Sci 2021;22:8027. [PMID: 34360790 PMCID: PMC8348231 DOI: 10.3390/ijms22158027] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 07/19/2021] [Accepted: 07/22/2021] [Indexed: 01/13/2023] Open

Guéniche N, Huguet A, Bruyere A, Habauzit D, Le Hégarat L, Fardel O. Comparative in silico prediction of P-glycoprotein-mediated transport for 2010-2020 US FDA-approved drugs using six Web-tools. Biopharm Drug Dispos 2021;42:393-398. [PMID: 34272891 DOI: 10.1002/bdd.2299] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Revised: 06/28/2021] [Accepted: 07/08/2021] [Indexed: 01/08/2023]

Özkan S, Padilla N, de la Cruz X. Towards a New, Endophenotype-Based Strategy for Pathogenicity Prediction in BRCA1 and BRCA2: In Silico Modeling of the Outcome of HDR/SGE Assays for Missense Variants. Int J Mol Sci 2021;22:6226. [PMID: 34207612 PMCID: PMC8229251 DOI: 10.3390/ijms22126226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 05/27/2021] [Accepted: 06/04/2021] [Indexed: 11/28/2022] Open

Adelaja A, Taylor B, Sheu KM, Liu Y, Luecke S, Hoffmann A. Six distinct NFκB signaling codons convey discrete information to distinguish stimuli and enable appropriate macrophage responses. Immunity 2021;54:916-930.e7. [PMID: 33979588 PMCID: PMC8184127 DOI: 10.1016/j.immuni.2021.04.011] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 12/21/2020] [Accepted: 04/13/2021] [Indexed: 12/12/2022]

Xiang J, Zhang J, Zheng R, Li X, Li M. NIDM: network impulsive dynamics on multiplex biological network for disease-gene prediction. Brief Bioinform 2021;22:6236070. [PMID: 33866352 DOI: 10.1093/bib/bbab080] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 02/11/2021] [Accepted: 02/21/2021] [Indexed: 12/12/2022] Open

Zhao X, Yao H, Li X. Unearthing of Key Genes Driving the Pathogenesis of Alzheimer's Disease via Bioinformatics. Front Genet 2021;12:641100. [PMID: 33936168 PMCID: PMC8085575 DOI: 10.3389/fgene.2021.641100] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 03/15/2021] [Indexed: 01/23/2023] Open

Podlewska S, Kurczab R. Mutual Support of Ligand- and Structure-Based Approaches-To What Extent We Can Optimize the Power of Predictive Model? Case Study of Opioid Receptors. Molecules 2021;26:molecules26061607. [PMID: 33799356 PMCID: PMC7998793 DOI: 10.3390/molecules26061607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 03/10/2021] [Accepted: 03/11/2021] [Indexed: 11/16/2022] Open

Abstract

The process of modern drug design would not exist in the current form without computational methods. They are part of every stage of the drug design pipeline, supporting the search and optimization of new bioactive substances. Nevertheless, despite the great help that is offered by in silico strategies, the power of computational methods strongly depends on the input data supplied at the stage of the predictive model construction. The studies on the efficiency of the computational protocols most often focus on global efficiency. They use general parameters that refer to the whole dataset, such as accuracy, precision, mean squared error, etc. In the study, we examined machine learning predictions obtained for opioid receptors (mu, kappa, delta) and focused on cases for which the predictions were the most accurate and the least accurate. Moreover, by using docking, we tried to explain prediction errors. We attempted to develop a rule of thumb, which can help in the prediction of compound activity towards opioid receptors via docking, especially those that have been incorrectly predicted by machine learning. We found out that although the combination of ligand- and structure-based path can be beneficial for the prediction accuracy, there still remain cases that cannot be reliably predicted by any available modeling method. In addition to challenging ligand- and structure-based predictions, we also examined the role of the application of machine-learning methods in comparison to simple statistical methods for both standard ligand-based representations (molecular fingerprints) and interaction fingerprints. All approaches were confronted in both classification (where compounds were assigned to the group of active and inactive group constructed on the basis of K_i values) and regression (where exact K_i value was predicted) experiments.

Collapse

Hyperspectral Image Spectral–Spatial Classification Method Based on Deep Adaptive Feature Fusion. REMOTE SENSING 2021. [DOI: 10.3390/rs13040746] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Prasanna A, Niranjan V. Clin-mNGS: Automated Pipeline for Pathogen Detection from Clinical Metagenomic Data. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200608130029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract Background: Since bacteria are the earliest known organisms, there has been significant interest in their variety and biology, most certainly concerning human health. Recent advances in Metagenomics sequencing (mNGS), a culture-independent sequencing technology, have facilitated an accelerated development in clinical microbiology and our understanding of pathogens. Objective: For the implementation of mNGS in routine clinical practice to become feasible, a practical and scalable strategy for the study of mNGS data is essential. This study presents a robust automated pipeline to analyze clinical metagenomic data for pathogen identification and classification. Method: The proposed Clin-mNGS pipeline is an integrated, open-source, scalable, reproducible, and user-friendly framework scripted using the Snakemake workflow management software. The implementation avoids the hassle of manual installation and configuration of the multiple commandline tools and dependencies. The approach directly screens pathogens from clinical raw reads and generates consolidated reports for each sample. Results: The pipeline is demonstrated using publicly available data and is tested on a desktop Linux system and a High-performance cluster. The study compares variability in results from different tools and versions. The versions of the tools are made user modifiable. The pipeline results in quality check, filtered reads, host subtraction, assembled contigs, assembly metrics, relative abundances of bacterial species, antimicrobial resistance genes, plasmid finding, and virulence factors identification. The results obtained from the pipeline are evaluated based on sensitivity and positive predictive value. Conclusion: Clin-mNGS is an automated Snakemake pipeline validated for the analysis of microbial clinical metagenomics reads to perform taxonomic classification and antimicrobial resistance prediction. Collapse