Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chicco D, Tötsch N, Jurman G. The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Min 2021;14:13. [PMID: 33541410 PMCID: PMC7863449 DOI: 10.1186/s13040-021-00244-z] [Citation(s) in RCA: 177] [Impact Index Per Article: 59.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 01/18/2021] [Indexed: 01/28/2023] Open

For:	Chicco D, Tötsch N, Jurman G. The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Min 2021;14:13. [PMID: 33541410 PMCID: PMC7863449 DOI: 10.1186/s13040-021-00244-z] [Citation(s) in RCA: 177] [Impact Index Per Article: 59.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 01/18/2021] [Indexed: 01/28/2023] Open

Number

Cited by Other Article(s)

Yadav S, Vora DS, Sundar D, Dhanjal JK. TCR-ESM: Employing protein language embeddings to predict TCR-peptide-MHC binding. Comput Struct Biotechnol J 2024;23:165-173. [PMID: 38146434 PMCID: PMC10749252 DOI: 10.1016/j.csbj.2023.11.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 11/19/2023] [Accepted: 11/20/2023] [Indexed: 12/27/2023] Open

Li S, Hamdi M, Dutta K, Fraum TJ, Luo J, Laforest R, Shoghi KI. FAST (fast analytical simulator of tracer)-PET: an accurate and efficient PET analytical simulation tool. Phys Med Biol 2024;69:165020. [PMID: 39047765 DOI: 10.1088/1361-6560/ad6743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 07/23/2024] [Indexed: 07/27/2024]

Abstract

Objective.Simulation of positron emission tomography (PET) images is an essential tool in the development and validation of quantitative imaging workflows and advanced image processing pipelines. Existing Monte Carlo or analytical PET simulators often compromise on either efficiency or accuracy. We aim to develop and validate fast analytical simulator of tracer (FAST)-PET, a novel analytical framework, to simulate PET images accurately and efficiently.Approach. FAST-PET simulates PET images by performing precise forward projection, scatter, and random estimation that match the scanner geometry and statistics. Although the same process should be applicable to other scanner models, we focus on the Siemens Biograph Vision-600 in this work. Calibration and validation of FAST-PET were performed through comparison with an experimental scan of a National Electrical Manufacturers Association (NEMA) Image Quality (IQ) phantom. Further validation was conducted between FAST-PET and Geant4 Application for Tomographic Emission (GATE) quantitatively in clinical image simulations in terms of intensity-based and texture-based features and task-based tumor segmentation.Main results.According to the NEMA IQ phantom simulation, FAST-PET's simulated images exhibited partial volume effects and noise levels comparable to experimental images, with a relative bias of the recovery coefficient RC within 10% for all spheres and a coefficient of variation for the background region within 6% across various acquisition times. FAST-PET generated clinical PET images exhibit high quantitative accuracy and texture comparable to GATE (correlation coefficients of all features over 0.95) but with ∼100-fold lower computation time. The tumor segmentation masks comparison between both methods exhibited significant overlap and shape similarity with high concordance CCC > 0.97 across measures.Significance.FAST-PET generated PET images with high quantitative accuracy comparable to GATE, making it ideal for applications requiring extensive PET image simulations such as virtual imaging trials, and the development and validation of image processing pipelines.

Collapse

Helander M. "Dead or Alive?" Assessment of the Binary End-of-Event Outcome Indicator for the NEMSIS Public Research Dataset. PREHOSP EMERG CARE 2024:1-15. [PMID: 39106451 DOI: 10.1080/10903127.2024.2389551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 07/23/2024] [Accepted: 07/26/2024] [Indexed: 08/09/2024]

Abstract

OBJECTIVES

The National Emergency Medical Services Information Services (NEMSIS) provides a robust set of data to evaluate prehospital care. However, a major limitation is that the vast majority of the records lack a definitive outcome. We aimed to evaluate the performance of a recently proposed method ('MLB' method) to impute missing end-of-EMS-event outcomes ("dead" or "alive") for patient care reports in the NEMSIS public research dataset.

METHODS

This study reproduced the recently published method for patient outcome imputation in the NEMSIS database and replicated the results for years 2017 through 2022 (n = 686,075). We performed statistical analyses leveraging an array of established performance metrics for binary classification in the machine learning literature. Evaluation metrics included overall accuracy, true positive rate, true negative rate, balanced accuracy, precision, F1 score, Cohen's Kappa coefficient, Matthews' coefficient, Hamming loss, the Jaccard similarity score, and the receiver operating characteristic/area under the curve.

RESULTS

Extended metrics show consistently good imputation performance from year-to-year but reveal weakness in accurately indicating the minority class: e.g., after adjustments for conflicting labels, "dead" prediction accuracy was 77.7% for 2018 and 61.8% over the six-year NEMSIS sub-sample, even though overall accuracy was 98.8%. Slight over-fitting is also present.

CONCLUSIONS

We found that the recently published MLB method produced reasonably good "dead" or "alive" indicators. We recommend reporting of True Positive Rate ("dead" prediction accuracy) and True Negative Rate ("alive" prediction accuracy) when applying the imputation method for analyses of NEMSIS data. More attention by EMS clinicians to complete documentation of target NEMSIS elements can further improve the method's performance.

Collapse

Zhang Y, Tian Y, Yan A. A SAR and QSAR study on 3CLpro inhibitors of SARS-CoV-2 using machine learning methods. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024:1-33. [PMID: 39077983 DOI: 10.1080/1062936x.2024.2375513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 06/27/2024] [Indexed: 07/31/2024]

Susanty M, Mursalim MKN, Hertadi R, Purwarianti A, LE Rajab T. Leveraging protein language model embeddings and logistic regression for efficient and accurate in-silico acidophilic proteins classification. Comput Biol Chem 2024;112:108163. [PMID: 39098138 DOI: 10.1016/j.compbiolchem.2024.108163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 07/02/2024] [Accepted: 07/24/2024] [Indexed: 08/06/2024]

Abbasian Ardakani A, Airom O, Khorshidi H, Bureau NJ, Salvi M, Molinari F, Acharya UR. Interpretation of Artificial Intelligence Models in Healthcare: A Pictorial Guide for Clinicians. JOURNAL OF ULTRASOUND IN MEDICINE : OFFICIAL JOURNAL OF THE AMERICAN INSTITUTE OF ULTRASOUND IN MEDICINE 2024. [PMID: 39032010 DOI: 10.1002/jum.16524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 06/19/2024] [Accepted: 07/01/2024] [Indexed: 07/22/2024]

Kehrein J, Bunker A, Luxenhofer R. POxload: Machine Learning Estimates Drug Loadings of Polymeric Micelles. Mol Pharm 2024;21:3356-3374. [PMID: 38805643 DOI: 10.1021/acs.molpharmaceut.4c00086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/30/2024]

Eken A, Nassehi F, Eroğul O. Diagnostic machine learning applications on clinical populations using functional near infrared spectroscopy: a review. Rev Neurosci 2024;35:421-449. [PMID: 38308531 DOI: 10.1515/revneuro-2023-0117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Accepted: 01/12/2024] [Indexed: 02/04/2024]

Parvanovova P, Hnilicova P, Kolisek M, Tatarkova Z, Halasova E, Kurca E, Holubcikova S, Koprusakova MT, Baranovicova E. Disturbances in Muscle Energy Metabolism in Patients with Amyotrophic Lateral Sclerosis. Metabolites 2024;14:356. [PMID: 39057679 PMCID: PMC11278632 DOI: 10.3390/metabo14070356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 06/17/2024] [Accepted: 06/19/2024] [Indexed: 07/28/2024] Open

Abstract

Amyotrophic lateral sclerosis (ALS) is a fatal neuromuscular disease type of motor neuron disorder characterized by degeneration of the upper and lower motor neurons resulting in dysfunction of the somatic muscles of the body. The ALS condition is manifested in progressive skeletal muscle atrophy and spasticity. It leads to death, mostly due to respiratory failure. Within the pathophysiology of the disease, muscle energy metabolism seems to be an important part. In our study, we used blood plasma from 25 patients with ALS diagnosed by definitive El Escorial criteria according to ALSFR-R (Revised Amyotrophic Lateral Sclerosis Functional Rating Scale) criteria and 25 age and sex-matched subjects. Aside from standard clinical biochemical parameters, we used the NMR (nuclear magnetic resonance) metabolomics approach to determine relative plasma levels of metabolites. We observed a decrease in total protein level in blood; however, despite accelerated skeletal muscle catabolism characteristic for ALS patients, we did not detect changes in plasma levels of essential amino acids. When focused on alterations in energy metabolism within muscle, compromised creatine uptake was accompanied by decreased plasma creatinine. We did not observe changes in plasma levels of BCAAs (branched chain amino acids; leucine, isoleucine, valine); however, the observed decrease in plasma levels of all three BCKAs (branched chain alpha-keto acids derived from BCAAs) suggests enhanced utilization of BCKAs as energy substrate. Glutamine, found to be increased in blood plasma in ALS patients, besides serving for ammonia detoxification, could also be considered a potential TCA (tricarboxylic acid) cycle contributor in times of decreased pyruvate utilization. When analyzing the data by using a cross-validated Random Forest algorithm, it finished with an AUC of 0.92, oob error of 8%, and an MCC (Matthew's correlation coefficient) of 0.84 when relative plasma levels of metabolites were used as input variables. Although the discriminatory power of the system used was promising, additional features are needed to create a robust discriminatory model.

Collapse

Almotairi S, Badr E, Abdelbaky I, Elhakeem M, Abdul Salam M. Hybrid transformer-CNN model for accurate prediction of peptide hemolytic potential. Sci Rep 2024;14:14263. [PMID: 38902287 PMCID: PMC11190137 DOI: 10.1038/s41598-024-63446-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 05/29/2024] [Indexed: 06/22/2024] Open

Salvatti BA, Chagas MA, Fernandes PO, Ladeira YFX, Bozzi AS, Valadares VS, Valente AP, de Miranda AS, Rocha WR, Maltarollo VG, Moraes AH. Understanding the Enzyme (S)-Norcoclaurine Synthase Promiscuity to Aldehydes and Ketones. J Chem Inf Model 2024;64:4462-4474. [PMID: 38776464 DOI: 10.1021/acs.jcim.3c01773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]

Abstract

The (S)-norcoclaurine synthase from Thalictrum flavum (TfNCS) stereoselectively catalyzes the Pictet-Spengler reaction between dopamine and 4-hydroxyphenylacetaldehyde to give (S)-norcoclaurine. TfNCS can catalyze the Pictet-Spengler reaction with various aldehydes and ketones, leading to diverse tetrahydroisoquinolines. This substrate promiscuity positions TfNCS as a highly promising enzyme for synthesizing fine chemicals. Understanding carbonyl-containing substrates' structural and electronic signatures that influence TfNCS activity can help expand its applications in the synthesis of different compounds and aid in protein optimization strategies. In this study, we investigated the influence of the molecular properties of aldehydes and ketones on their reactivity in the TfNCS-catalyzed Pictet-Spengler reaction. Initially, we compiled a library of reactive and unreactive compounds from previous publications. We also performed enzymatic assays using nuclear magnetic resonance to identify some reactive and unreactive carbonyl compounds, which were then included in the library. Subsequently, we employed QSAR and DFT calculations to establish correlations between substrate-candidate structures and reactivity. Our findings highlight correlations of structural and stereoelectronic features, including the electrophilicity of the carbonyl group, to the reactivity of aldehydes and ketones toward the TfNCS-catalyzed Pictet-Spengler reaction. Interestingly, experimental data of seven compounds out of fifty-three did not correlate with the electrophilicity of the carbonyl group. For these seven compounds, we identified unfavorable interactions between them and the TfNCS. Our results demonstrate the applications of in silico techniques in understanding enzyme promiscuity and specificity, with a particular emphasis on machine learning methodologies, DFT electronic structure calculations, and molecular dynamic (MD) simulations.

Collapse

Bian Z, Bao T, Sun X, Wang N, Mu Q, Jiang T, Yu Z, Ding J, Wang T, Zhou Q. Machine Learning Tools to Assist the Synthesis of Antibacterial Carbon Dots. Int J Nanomedicine 2024;19:5213-5226. [PMID: 38855729 PMCID: PMC11162209 DOI: 10.2147/ijn.s451680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 05/03/2024] [Indexed: 06/11/2024] Open

Niu Y, Li Z, Chen Z, Huang W, Tan J, Tian F, Yang T, Fan Y, Wei J, Mu J. Efficient screening of pharmacological broad-spectrum anti-cancer peptides utilizing advanced bidirectional Encoder representation from Transformers strategy. Heliyon 2024;10:e30373. [PMID: 38765108 PMCID: PMC11101728 DOI: 10.1016/j.heliyon.2024.e30373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 04/24/2024] [Accepted: 04/24/2024] [Indexed: 05/21/2024] Open

Abstract

In the vanguard of oncological advancement, this investigation delineates the integration of deep learning paradigms to refine the screening process for Anticancer Peptides (ACPs), epitomizing a new frontier in broad-spectrum oncolytic therapeutics renowned for their targeted antitumor efficacy and specificity. Conventional methodologies for ACP identification are marred by prohibitive time and financial exigencies, representing a formidable impediment to the evolution of precision oncology. In response, our research heralds the development of a groundbreaking screening apparatus that marries Natural Language Processing (NLP) with the Pseudo Amino Acid Composition (PseAAC) technique, thereby inaugurating a comprehensive ACP compendium for the extraction of quintessential primary and secondary structural attributes. This innovative methodological approach is augmented by an optimized BERT model, meticulously calibrated for ACP detection, which conspicuously surpasses existing BERT variants and traditional machine learning algorithms in both accuracy and selectivity. Subjected to rigorous validation via five-fold cross-validation and external assessment, our model exhibited exemplary performance, boasting an average Area Under the Curve (AUC) of 0.9726 and an F1 score of 0.9385, with external validation further affirming its prowess (AUC of 0.9848 and F1 of 0.9371). These findings vividly underscore the method's unparalleled efficacy and prospective utility in the precise identification and prognostication of ACPs, significantly ameliorating the financial and temporal burdens traditionally associated with ACP research and development. Ergo, this pioneering screening paradigm promises to catalyze the discovery and clinical application of ACPs, constituting a seminal stride towards the realization of more efficacious and economically viable precision oncology interventions.

Collapse

Adenis L, Mailler S, Menut L, Achim P, Generoso S. Lagrangian and Eulerian modelling of ¹⁰⁶Ru atmospheric transport in 2017 over northern hemisphere. JOURNAL OF ENVIRONMENTAL RADIOACTIVITY 2024;275:107416. [PMID: 38520991 DOI: 10.1016/j.jenvrad.2024.107416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/13/2024] [Accepted: 03/13/2024] [Indexed: 03/25/2024]

Susanty M, Naim Mursalim MK, Hertadi R, Purwarianti A, Rajab TLE. Classifying alkaliphilic proteins using embeddings from protein language model. Comput Biol Med 2024;173:108385. [PMID: 38547659 DOI: 10.1016/j.compbiomed.2024.108385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 03/22/2024] [Accepted: 03/24/2024] [Indexed: 04/17/2024]

Castillo-Mendieta K, Agüero-Chapin G, Marquez E, Perez-Castillo Y, Barigye SJ, Pérez-Cárdenas M, Peréz-Giménez F, Marrero-Ponce Y. Multiquery Similarity Searching Models: An Alternative Approach for Predicting Hemolytic Activity from Peptide Sequence. Chem Res Toxicol 2024;37:580-589. [PMID: 38501392 DOI: 10.1021/acs.chemrestox.3c00408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]

Affiliation(s)

Kevin Castillo-Mendieta School of Biological Sciences and Engineering, Yachay Tech University, Hda. San José s/n y Proyecto Yachay, Urcuquí 100119, Ecuador
Guillermin Agüero-Chapin CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, Terminal de Cruzeiros do Porto de Leixões, University of Porto, Av. General Norton de Matos s/n, 4450-208 Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
Edgar Marquez Grupo de Investigaciones en Química y Biología, Departamento de Química y Biología, Facultad de Ciencias Básicas, Universidad del Norte, Carrera 51B, Km 5, vía Puerto Colombia, Barranquilla 081007, Colombia
Yunierkis Perez-Castillo Bio-Chemoinformatics Research Group and Escuela de Ciencias Físicas y Matemáticas. Universidad de Las Américas, Quito 170504, Ecuador
Stephen J Barigye Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid (UAM), 28049 Madrid, Spain
Mariela Pérez-Cárdenas School of Biological Sciences and Engineering, Yachay Tech University, Hda. San José s/n y Proyecto Yachay, Urcuquí 100119, Ecuador
Facundo Peréz-Giménez Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Valencia 46100, Spain
Yovani Marrero-Ponce Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Valencia 46100, Spain Facultad de Ingeniería, Universidad Panamericana, Augusto Rodin No. 498, Insurgentes Mixcoac, Benito Juárez, CDMX, Mexico 03920, Mexico Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas; and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Universidad San Francisco de Quito (USFQ), Quito, Pichincha 170157, Ecuador

Collapse

Giudice L, Mohamed A, Malm T. StellarPath: Hierarchical-vertical multi-omics classifier synergizes stable markers and interpretable similarity networks for patient profiling. PLoS Comput Biol 2024;20:e1012022. [PMID: 38607982 PMCID: PMC11042724 DOI: 10.1371/journal.pcbi.1012022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 04/24/2024] [Accepted: 03/25/2024] [Indexed: 04/14/2024] Open

Charest N, Lowe CN, Ramsland C, Meyer B, Samano V, Williams AJ. Improving predictions of compound amenability for liquid chromatography-mass spectrometry to enhance non-targeted analysis. Anal Bioanal Chem 2024;416:2565-2579. [PMID: 38530399 PMCID: PMC11228616 DOI: 10.1007/s00216-024-05229-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 02/14/2024] [Accepted: 02/16/2024] [Indexed: 03/28/2024]

Abstract

Mass-spectrometry-based non-targeted analysis (NTA), in which mass spectrometric signals are assigned chemical identities based on a systematic collation of evidence, is a growing area of interest for toxicological risk assessment. Successful NTA results in better identification of potentially hazardous pollutants within the environment, facilitating the development of targeted analytical strategies to best characterize risks to human and ecological health. A supporting component of the NTA process involves assessing whether suspected chemicals are amenable to the mass spectrometric method, which is necessary in order to assign an observed signal to the chemical structure. Prior work from this group involved the development of a random forest model for predicting the amenability of 5517 unique chemical structures to liquid chromatography-mass spectrometry (LC-MS). This work improves the interpretability of the group's prior model of the same endpoint, as well as integrating 1348 more data points across negative and positive ionization modes. We enhance interpretability by feature engineering, a machine learning practice that reduces the input dimensionality while attempting to preserve performance statistics. We emphasize the importance of interpretable machine learning models within the context of building confidence in NTA identification. The novel data were curated by the labeling of compounds as amenable or unamenable by expert curators, resulting in an enhanced set of chemical compounds to expand the applicability domain of the prior model. The balanced accuracy benchmark of the newly developed model is comparable to performance previously reported (mean CV BA is 0.84 vs. 0.82 in positive mode, and 0.85 vs. 0.82 in negative mode), while on a novel external set, derived from this work's data, the Matthews correlation coefficients (MCC) for the novel models are 0.66 and 0.68 for positive and negative mode, respectively. Our group's prior published models scored MCC of 0.55 and 0.54 on the same external sets. This demonstrates appreciable improvement over the chemical space captured by the expanded dataset. This work forms part of our ongoing efforts to develop models with higher interpretability and higher performance to support NTA efforts.

Collapse

Reiter T, Schoedel R. Never miss a beep: Using mobile sensing to investigate (non-)compliance in experience sampling studies. Behav Res Methods 2024;56:4038-4060. [PMID: 37932624 PMCID: PMC11133120 DOI: 10.3758/s13428-023-02252-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/16/2023] [Indexed: 11/08/2023]

Winter NR, Blanke J, Leenings R, Ernsting J, Fisch L, Sarink K, Barkhau C, Emden D, Thiel K, Flinkenflügel K, Winter A, Goltermann J, Meinert S, Dohm K, Repple J, Gruber M, Leehr EJ, Opel N, Grotegerd D, Redlich R, Nitsch R, Bauer J, Heindel W, Gross J, Risse B, Andlauer TFM, Forstner AJ, Nöthen MM, Rietschel M, Hofmann SG, Pfarr JK, Teutenberg L, Usemann P, Thomas-Odenthal F, Wroblewski A, Brosch K, Stein F, Jansen A, Jamalabadi H, Alexander N, Straube B, Nenadić I, Kircher T, Dannlowski U, Hahn T. A Systematic Evaluation of Machine Learning-Based Biomarkers for Major Depressive Disorder. JAMA Psychiatry 2024;81:386-395. [PMID: 38198165 PMCID: PMC10782379 DOI: 10.1001/jamapsychiatry.2023.5083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/05/2023] [Indexed: 01/11/2024]

Abstract

Importance

Biological psychiatry aims to understand mental disorders in terms of altered neurobiological pathways. However, for one of the most prevalent and disabling mental disorders, major depressive disorder (MDD), no informative biomarkers have been identified.

Objective

To evaluate whether machine learning (ML) can identify a multivariate biomarker for MDD.

Design, Setting, and Participants

This study used data from the Marburg-Münster Affective Disorders Cohort Study, a case-control clinical neuroimaging study. Patients with acute or lifetime MDD and healthy controls aged 18 to 65 years were recruited from primary care and the general population in Münster and Marburg, Germany, from September 11, 2014, to September 26, 2018. The Münster Neuroimaging Cohort (MNC) was used as an independent partial replication sample. Data were analyzed from April 2022 to June 2023.

Exposure

Patients with MDD and healthy controls.

Main Outcome and Measure

Diagnostic classification accuracy was quantified on an individual level using an extensive ML-based multivariate approach across a comprehensive range of neuroimaging modalities, including structural and functional magnetic resonance imaging and diffusion tensor imaging as well as a polygenic risk score for depression.

Results

Of 1801 included participants, 1162 (64.5%) were female, and the mean (SD) age was 36.1 (13.1) years. There were a total of 856 patients with MDD (47.5%) and 945 healthy controls (52.5%). The MNC replication sample included 1198 individuals (362 with MDD [30.1%] and 836 healthy controls [69.9%]). Training and testing a total of 4 million ML models, mean (SD) accuracies for diagnostic classification ranged between 48.1% (3.6%) and 62.0% (4.8%). Integrating neuroimaging modalities and stratifying individuals based on age, sex, treatment, or remission status does not enhance model performance. Findings were replicated within study sites and also observed in structural magnetic resonance imaging within MNC. Under simulated conditions of perfect reliability, performance did not significantly improve. Analyzing model errors suggests that symptom severity could be a potential focus for identifying MDD subgroups.

Conclusion and Relevance

Despite the improved predictive capability of multivariate compared with univariate neuroimaging markers, no informative individual-level MDD biomarker-even under extensive ML optimization in a large sample of diagnosed patients-could be identified.

Collapse

Affiliation(s)

Nils R. Winter Institute for Translational Psychiatry, University of Münster, Münster, Germany Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Münster, Münster, Germany
Julian Blanke Institute for Translational Psychiatry, University of Münster, Münster, Germany
Ramona Leenings Institute for Translational Psychiatry, University of Münster, Münster, Germany Faculty of Mathematics and Computer Science, University of Münster, Münster, Germany
Jan Ernsting Institute for Translational Psychiatry, University of Münster, Münster, Germany Faculty of Mathematics and Computer Science, University of Münster, Münster, Germany Institute for Geoinformatics, University of Münster, Münster, Germany
Lukas Fisch Institute for Translational Psychiatry, University of Münster, Münster, Germany
Kelvin Sarink Institute for Translational Psychiatry, University of Münster, Münster, Germany
Carlotta Barkhau Institute for Translational Psychiatry, University of Münster, Münster, Germany
Daniel Emden Institute for Translational Psychiatry, University of Münster, Münster, Germany
Katharina Thiel Institute for Translational Psychiatry, University of Münster, Münster, Germany
Kira Flinkenflügel Institute for Translational Psychiatry, University of Münster, Münster, Germany
Alexandra Winter Institute for Translational Psychiatry, University of Münster, Münster, Germany
Janik Goltermann Institute for Translational Psychiatry, University of Münster, Münster, Germany
Susanne Meinert Institute for Translational Psychiatry, University of Münster, Münster, Germany Institute for Translational Neuroscience, University of Münster, Münster, Germany
Katharina Dohm Institute for Translational Psychiatry, University of Münster, Münster, Germany
Jonathan Repple Institute for Translational Psychiatry, University of Münster, Münster, Germany Department of Psychiatry, Psychosomatic Medicine and Psychotherapy, University Hospital Frankfurt, Goethe University, Frankfurt am Main, Germany
Marius Gruber Institute for Translational Psychiatry, University of Münster, Münster, Germany Department of Psychiatry, Psychosomatic Medicine and Psychotherapy, University Hospital Frankfurt, Goethe University, Frankfurt am Main, Germany
Elisabeth J. Leehr Institute for Translational Psychiatry, University of Münster, Münster, Germany
Nils Opel Institute for Translational Psychiatry, University of Münster, Münster, Germany Department of Psychiatry and Psychotherapy, University Hospital Jena, Jena, Germany Center for Intervention and Research on Adaptive and Maladaptive Brain Circuits Underlying Mental Health, Jena, Germany German Center for Mental Health (DZPG), Jena, Germany
Dominik Grotegerd Institute for Translational Psychiatry, University of Münster, Münster, Germany
Ronny Redlich Institute for Translational Psychiatry, University of Münster, Münster, Germany Center for Intervention and Research on Adaptive and Maladaptive Brain Circuits Underlying Mental Health, Jena, Germany Department of Psychology, University of Halle, Halle, Germany German Center for Mental Health (DZPG), Halle, Germany
Robert Nitsch Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Münster, Münster, Germany Institute for Translational Neuroscience, University of Münster, Münster, Germany
Jochen Bauer Clinic for Radiology, University of Münster, University Hospital Münster, Münster, Germany
Walter Heindel Clinic for Radiology, University of Münster, University Hospital Münster, Münster, Germany
Joachim Gross Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Münster, Münster, Germany Institute for Biomagnetism and Biosignalanalysis, University of Münster, Münster, Germany
Benjamin Risse Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Münster, Münster, Germany Faculty of Mathematics and Computer Science, University of Münster, Münster, Germany Institute for Geoinformatics, University of Münster, Münster, Germany
Till F. M. Andlauer Department of Neurology, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
Andreas J. Forstner Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany Institute of Neuroscience and Medicine (INM-1), Research Centre Jülich, Jülich, Germany
Markus M. Nöthen Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
Marcella Rietschel Department of Genetic Epidemiology, Central Institute of Mental Health, Faculty of Medicine Mannheim, University of Heidelberg, Mannheim, Germany
Stefan G. Hofmann Department of Clinical Psychology, Philipps-University Marburg, Marburg, Germany
Julia-Katharina Pfarr Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Lea Teutenberg Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Paula Usemann Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Florian Thomas-Odenthal Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Adrian Wroblewski Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Katharina Brosch Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Frederike Stein Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Andreas Jansen Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany Core Facility Brain Imaging, Faculty of Medicine, Philipps-University Marburg, Marburg, Germany
Hamidreza Jamalabadi Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany
Nina Alexander Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Benjamin Straube Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Igor Nenadić Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Tilo Kircher Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany Center for Mind, Brain and Behavior (CMBB), Marburg, Germany
Udo Dannlowski Institute for Translational Psychiatry, University of Münster, Münster, Germany Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Münster, Münster, Germany
Tim Hahn Institute for Translational Psychiatry, University of Münster, Münster, Germany Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Münster, Münster, Germany

Collapse

Guan J, Yao L, Xie P, Chung CR, Huang Y, Chiang YC, Lee TY. A two-stage computational framework for identifying antiviral peptides and their functional types based on contrastive learning and multi-feature fusion strategy. Brief Bioinform 2024;25:bbae208. [PMID: 38706321 PMCID: PMC11070730 DOI: 10.1093/bib/bbae208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 03/14/2024] [Accepted: 04/17/2024] [Indexed: 05/07/2024] Open

Román L, Melis-Arcos F, Pröschle T, Saa PA, Garrido D. Genome-scale metabolic modeling of the human milk oligosaccharide utilization by Bifidobacterium longum subsp. infantis. mSystems 2024;9:e0071523. [PMID: 38363147 PMCID: PMC10949479 DOI: 10.1128/msystems.00715-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 01/10/2024] [Indexed: 02/17/2024] Open

Abstract

Bifidobacterium longum subsp. infantis is a representative and dominant species in the infant gut and is considered a beneficial microbe. This organism displays multiple adaptations to thrive in the infant gut, regarded as a model for human milk oligosaccharides (HMOs) utilization. These carbohydrates are abundant in breast milk and include different molecules based on lactose. They contain fucose, sialic acid, and N-acetylglucosamine. Bifidobacterium metabolism is complex, and a systems view of relevant metabolic pathways and exchange metabolites during HMO consumption is missing. To address this limitation, a refined genome-scale network reconstruction of this bacterium is presented using a previous reconstruction of B. infantis ATCC 15967 as a template. The latter was expanded based on an extensive revision of genome annotations, current literature, and transcriptomic data integration. The metabolic reconstruction (iLR578) accounted for 578 genes, 1,047 reactions, and 924 metabolites. Starting from this reconstruction, we built context-specific genome-scale metabolic models using RNA-seq data from cultures growing in lactose and three HMOs. The models revealed notable differences in HMO metabolism depending on the functional characteristics of the substrates. Particularly, fucosyl-lactose showed a divergent metabolism due to a fucose moiety. High yields of lactate and acetate were predicted under growth rate maximization in all conditions, whereas formate, ethanol, and 1,2-propanediol were substantially lower. Similar results were also obtained under near-optimal growth on each substrate when varying the empirically observed acetate-to-lactate production ratio. Model predictions displayed reasonable agreement between central carbon metabolism fluxes and expression data across all conditions. Flux coupling analysis revealed additional connections between succinate exchange and arginine and sulfate metabolism and a strong coupling between central carbon reactions and adenine metabolism. More importantly, specific networks of coupled reactions under each carbon source were derived and analyzed. Overall, the presented network reconstruction constitutes a valuable platform for probing the metabolism of this prominent infant gut bifidobacteria.IMPORTANCEThis work presents a detailed reconstruction of the metabolism of Bifidobacterium longum subsp. infantis, a prominent member of the infant gut microbiome, providing a systems view of its metabolism of human milk oligosaccharides.

Collapse

Woods AI, Primrose DM, Paiva J, Blanco AN, Alberto MF, Sánchez-Luceros A. Clinical relevance of genetic variants in the von Willebrand factor according to in-silico methods. Am J Med Genet A 2024;194:e63430. [PMID: 37872709 DOI: 10.1002/ajmg.a.63430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/03/2023] [Accepted: 09/22/2023] [Indexed: 10/25/2023]

Khaleel HA, Alhilfi RA, Rawaf S, Tabche C. Identify future epidemic threshold and intensity for influenza-like illness in Iraq by using the moving epidemic method. IJID REGIONS 2024;10:126-131. [PMID: 38260712 PMCID: PMC10801321 DOI: 10.1016/j.ijregi.2023.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 12/16/2023] [Accepted: 12/18/2023] [Indexed: 01/24/2024]

Ban D, Housley SN, Matyunina LV, McDonald LD, Bae-Jump VL, Benigno BB, Skolnick J, McDonald JF. A personalized probabilistic approach to ovarian cancer diagnostics. Gynecol Oncol 2024;182:168-175. [PMID: 38266403 PMCID: PMC10960662 DOI: 10.1016/j.ygyno.2023.12.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 12/18/2023] [Accepted: 12/29/2023] [Indexed: 01/26/2024]

Mollura M, Chicco D, Paglialonga A, Barbieri R. Identifying prognostic factors for survival in intensive care unit patients with SIRS or sepsis by machine learning analysis on electronic health records. PLOS DIGITAL HEALTH 2024;3:e0000459. [PMID: 38489347 PMCID: PMC10942078 DOI: 10.1371/journal.pdig.0000459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 02/05/2024] [Indexed: 03/17/2024]

Abstract

BACKGROUND

Systemic inflammatory response syndrome (SIRS) and sepsis are the most common causes of in-hospital death. However, the characteristics associated with the improvement in the patient conditions during the ICU stay were not fully elucidated for each population as well as the possible differences between the two.

GOAL

The aim of this study is to highlight the differences between the prognostic clinical features for the survival of patients diagnosed with SIRS and those of patients diagnosed with sepsis by using a multi-variable predictive modeling approach with a reduced set of easily available measurements collected at the admission to the intensive care unit (ICU).

METHODS

Data were collected from 1,257 patients (816 non-sepsis SIRS and 441 sepsis) admitted to the ICU. We compared the performance of five machine learning models in predicting patient survival. Matthews correlation coefficient (MCC) was used to evaluate model performances and feature importance, and by applying Monte Carlo stratified Cross-Validation.

RESULTS

Extreme Gradient Boosting (MCC = 0.489) and Logistic Regression (MCC = 0.533) achieved the highest results for SIRS and sepsis cohorts, respectively. In order of importance, APACHE II, mean platelet volume (MPV), eosinophil counts (EoC), and C-reactive protein (CRP) showed higher importance for predicting sepsis patient survival, whereas, SOFA, APACHE II, platelet counts (PLTC), and CRP obtained higher importance in the SIRS cohort.

CONCLUSION

By using complete blood count parameters as predictors of ICU patient survival, machine learning models can accurately predict the survival of SIRS and sepsis ICU patients. Interestingly, feature importance highlights the role of CRP and APACHE II in both SIRS and sepsis populations. In addition, MPV and EoC are shown to be important features for the sepsis population only, whereas SOFA and PLTC have higher importance for SIRS patients.

Collapse

Zacometti C, Sammarco G, Massaro A, Lefevre S, Frégière-Salomon A, Lafeuille JL, Candalino IF, Piro R, Tata A, Suman M. Authenticity assessment of ground black pepper by combining headspace gas-chromatography ion mobility spectrometry and machine learning. Food Res Int 2024;179:114023. [PMID: 38342542 DOI: 10.1016/j.foodres.2024.114023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/08/2024] [Accepted: 01/12/2024] [Indexed: 02/13/2024]

Abstract

Currently, the authentication of ground black pepper is a major concern, creating a need for a rapid, highly sensitive and specific detection tool to prevent the introduction of adulterated batches into the food chain. To this aim, head space gas-chromatography ion mobility spectrometry (HS-GC-IMS), combined with machine learning, is tested in this initial, proof-of-concept study. A broad variety of authentic samples originating from eight countries and three continents were collected and spiked with a range of adulterants, both endogenous sub-products and an assortment of exogenous materials. The method is characterized by no sample preparation and requires 20 min for chromatographic separation and ion mobility data acquisition. After an explorative analysis of the data, those were submitted to two different machine learning algorithms (partial least squared discriminant analysis-PLS-DA and support vector machine-SVM). While the PLS-DA model did not provide fully satisfactory performances, the combination of HS-GC-IMS and SVM successfully classified the samples as authentic, exogenously-adulterated or endogenously-adulterated with an overall accuracy of 90 % and 96 % on withheld test set 1 and withheld test set 2, respectively (at a 95 % confidence level). Some limitations, expected to be mitigated by further research, were encountered in the correct classification of endogenously adulterated ground black pepper. Correct categorization of the ground black pepper samples was not adversely affected by the operator or the time span of data collection (the method development and model challenge were carried out by two operators over 6 months of the study, using ground black pepper harvested between 2015 and 2019). Therefore, HS-GC-IMS, coupled to an intelligent tool, is proposed to: (i) aid in industrial decision-making before utilization of a new batch of ground black pepper in the production chain; (ii) reduce the use of time-consuming conventional analyses and; (iii) increase the number of ground black pepper samples analyzed within an industrial quality control frame.

Collapse

Faadiya AN, Widyaningrum R, Arindra PK, Diba SF. The diagnostic performance of impacted third molars in the mandible: A review of deep learning on panoramic radiographs. Saudi Dent J 2024;36:404-412. [PMID: 38525176 PMCID: PMC10960107 DOI: 10.1016/j.sdentj.2023.11.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 03/26/2024] Open

Abstract

Background

Mandibular third molar is prone to impaction, resulting in its inability to erupt into the oral cavity. The radiographic examination is required to support the odontectomy of impacted teeth. The use of computer-aided diagnosis based on deep learning is emerging in the field of medical and dentistry with the advancement of artificial intelligence (AI) technology. This review describes the performance and prospects of deep learning for the detection, classification, and evaluation of third molar-mandibular canal relationships on panoramic radiographs.

Methods

This work was conducted using three databases: PubMed, Google Scholar, and Science Direct. Following the literature selection, 49 articles were reviewed, with the 12 main articles discussed in this review.

Results

Several models of deep learning are currently used for segmentation and classification of third molar impaction with or without the combination of other techniques. Deep learning has demonstrated significant diagnostic performance in identifying mandibular impacted third molars (ITM) on panoramic radiographs, with an accuracy range of 78.91% to 90.23%. Meanwhile, the accuracy of deep learning in determining the relationship between ITM and the mandibular canal (MC) ranges from 72.32% to 99%.

Conclusion

Deep learning-based AI with high performance for the detection, classification, and evaluation of the relationship of ITM to the MC using panoramic radiographs has been developed over the past decade. However, deep learning must be improved using large datasets, and the evaluation of diagnostic performance for deep learning models should be aligned with medical diagnostic test protocols. Future studies involving collaboration among oral radiologists, clinicians, and computer scientists are required to identify appropriate AI development models that are accurate, efficient, and applicable to clinical services.

Collapse

Singhal V, Chou N, Lee J, Yue Y, Liu J, Chock WK, Lin L, Chang YC, Teo EML, Aow J, Lee HK, Chen KH, Prabhakar S. BANKSY unifies cell typing and tissue domain segmentation for scalable spatial omics data analysis. Nat Genet 2024;56:431-441. [PMID: 38413725 PMCID: PMC10937399 DOI: 10.1038/s41588-024-01664-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 01/16/2024] [Indexed: 02/29/2024]

Affiliation(s)

Vipul Singhal Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Nigel Chou Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Joseph Lee Faculty of Science, National University of Singapore, Singapore, Republic of Singapore
Yifei Yue Department of Chemical and Biomolecular Engineering, National University of Singapore, Singapore, Republic of Singapore
Jinyue Liu Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Wan Kee Chock Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Li Lin Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Yun-Ching Chang Veranome Biosystems, Mountain View, CA, USA
Erica Mei Ling Teo Veranome Biosystems, Mountain View, CA, USA
Jonathan Aow Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Hwee Kuan Lee Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore School of Computing, National University of Singapore, Singapore, Republic of Singapore Singapore Eye Research Institute, Singapore, Republic of Singapore International Research Laboratory on Artificial Intelligence, Singapore, Republic of Singapore School of Biological Sciences, Nanyang Technological University, Singapore, Republic of Singapore Singapore Institute for Clinical Sciences, Agency for Science, Technology and Research, Singapore, Republic of Singapore
Kok Hao Chen Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore.
Shyam Prabhakar Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore. Population and Global Health, Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Republic of Singapore. Cancer Science Institute of Singapore, National University of Singapore, Singapore, Republic of Singapore.

Collapse

Alipour M, Seok S, Mednick SC, Malerba P. A classification-based generative approach to selective targeting of global slow oscillations during sleep. Front Hum Neurosci 2024;18:1342975. [PMID: 38415278 PMCID: PMC10896842 DOI: 10.3389/fnhum.2024.1342975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 01/30/2024] [Indexed: 02/29/2024] Open

Abstract

Background

Given sleep's crucial role in health and cognition, numerous sleep-based brain interventions are being developed, aiming to enhance cognitive function, particularly memory consolidation, by improving sleep. Research has shown that Transcranial Alternating Current Stimulation (tACS) during sleep can enhance memory performance, especially when used in a closed-loop (cl-tACS) mode that coordinates with sleep slow oscillations (SOs, 0.5-1.5Hz). However, sleep tACS research is characterized by mixed results across individuals, which are often attributed to individual variability.

Objective/Hypothesis

This study targets a specific type of SOs, widespread on the electrode manifold in a short delay ("global SOs"), due to their close relationship with long-term memory consolidation. We propose a model-based approach to optimize cl-tACS paradigms, targeting global SOs not only by considering their temporal properties but also their spatial profile.

Methods

We introduce selective targeting of global SOs using a classification-based approach. We first estimate the current elicited by various stimulation paradigms, and optimize parameters to match currents found in natural sleep during a global SO. Then, we employ an ensemble classifier trained on sleep data to identify effective paradigms. Finally, the best stimulation protocol is determined based on classification performance.

Results

Our study introduces a model-driven cl-tACS approach that specifically targets global SOs, with the potential to extend to other brain dynamics. This method establishes a connection between brain dynamics and stimulation optimization.

Conclusion

Our research presents a novel approach to optimize cl-tACS during sleep, with a focus on targeting global SOs. This approach holds promise for improving cl-tACS not only for global SOs but also for other physiological events, benefiting both research and clinical applications in sleep and cognition.

Collapse

Harkany T, Tretiakov E, Varela L, Jarc J, Rebernik P, Newbold S, Keimpema E, Verkhratsky A, Horvath T, Romanov R. Molecularly stratified hypothalamic astrocytes are cellular foci for obesity. RESEARCH SQUARE 2024:rs.3.rs-3748581. [PMID: 38405925 PMCID: PMC10889077 DOI: 10.21203/rs.3.rs-3748581/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]

Abstract

Astrocytes safeguard the homeostasis of the central nervous system1,2. Despite their prominent morphological plasticity under conditions that challenge the brain's adaptive capacity3-5, the classification of astrocytes, and relating their molecular make-up to spatially devolved neuronal operations that specify behavior or metabolism, remained mostly futile6,7. Although it seems unexpected in the era of single-cell biology, the lack of a major advance in stratifying astrocytes under physiological conditions rests on the incompatibility of 'neurocentric' algorithms that rely on stable developmental endpoints, lifelong transcriptional, neurotransmitter, and neuropeptide signatures for classification6-8 with the dynamic functional states, anatomic allocation, and allostatic plasticity of astrocytes1. Simplistically, therefore, astrocytes are still grouped as 'resting' vs. 'reactive', the latter referring to pathological states marked by various inducible genes3,9,10. Here, we introduced a machine learning-based feature recognition algorithm that benefits from the cumulative power of published single-cell RNA-seq data on astrocytes as a reference map to stepwise eliminate pleiotropic and inducible cellular features. For the healthy hypothalamus, this walk-back approach revealed gene regulatory networks (GRNs) that specified subsets of astrocytes, and could be used as landmarking tools for their anatomical assignment. The core molecular censuses retained by astrocyte subsets were sufficient to stratify them by allostatic competence, chiefly their signaling and metabolic interplay with neurons. Particularly, we found differentially expressed mitochondrial genes in insulin-sensing astrocytes and demonstrated their reciprocal signaling with neurons that work antagonistically within the food intake circuitry. As a proof-of-concept, we showed that disrupting Mfn2 expression in astrocytes reduced their ability to support dynamic circuit reorganization, a time-locked feature of satiety in the hypothalamus, thus leading to obesity in mice. Overall, our results suggest that astrocytes in the healthy brain are fundamentally more heterogeneous than previously thought and topologically mirror the specificity of local neurocircuits.

Collapse

Coverdell TC, Sampson M, Zubirán R, Wolska A, Donato LJ, Meeusen JW, Jaffe AS, Remaley AT. An improved method for estimating low LDL-C based on the enhanced Sampson-NIH equation. Lipids Health Dis 2024;23:43. [PMID: 38331834 PMCID: PMC10851542 DOI: 10.1186/s12944-024-02018-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 01/13/2024] [Indexed: 02/10/2024] Open

Abstract

BACKGROUND

The accurate measurement of Low-density lipoprotein cholesterol (LDL-C) is critical in the decision to utilize the new lipid-lowering therapies like PCSK9-inhibitors (PCSK9i) for high-risk cardiovascular disease patients that do not achieve sufficiently low LDL-C on statin therapy.

OBJECTIVE

To improve the estimation of low LDL-C by developing a new equation that includes apolipoprotein B (apoB) as an independent variable, along with the standard lipid panel test results.

METHODS

Using β-quantification (BQ) as the reference method, which was performed on a large dyslipidemic population (N = 24,406), the following enhanced Sampson-NIH equation (eS LDL-C) was developed by least-square regression analysis: [Formula: see text] RESULTS: The eS LDL-C equation was the most accurate equation for a broad range of LDL-C values based on regression related parameters and the mean absolute difference (mg/dL) from the BQ reference method (eS LDL-C: 4.51, Sampson-NIH equation [S LDL-C]: 6.07; extended Martin equation [eM LDL-C]: 6.64; Friedewald equation [F LDL-C]: 8.3). It also had the best area-under-the-curve accuracy score by Regression Error Characteristic plots for LDL-C < 100 mg/dL (eS LDL-C: 0.953; S LDL-C: 0.920; eM LDL-C: 0.915; F LDL-C: 0.874) and was the best equation for categorizing patients as being below or above the 70 mg/dL LDL-C treatment threshold for adding new lipid-lowering drugs by kappa score analysis when compared to BQ LDL-C for TG < 800 mg/dL (eS LDL-C: 0.870 (0.853-0.887); S LDL-C:0.763 (0.749-0.776); eM LDL-C:0.706 (0.690-0.722); F LDL-C:0.687 (0.672-0.701). Approximately a third of patients with an F LDL-C < 70 mg/dL had falsely low test results, but about 80% were correctly reclassified as higher (≥ 70 mg/dL) by the eS LDL-C equation, making them potentially eligible for PCSK9i treatment. The M LDL-C and S LDL-C equations had less false low results below 70 mg/dL than the F LDL-C equation but reclassification by the eS LDL-C equation still also increased the net number of patients correctly classified.

CONCLUSIONS

The use of the eS LDL-C equation as a confirmatory test improves the identification of high-risk cardiovascular disease patients, who could benefit from new lipid-lowering therapies but have falsely low LDL-C, as determined by the standard LDL-C equations used in current practice.

Collapse

Unlu O, Shin J, Mailly CJ, Oates MF, Tucci MR, Varugheese M, Wagholikar K, Wang F, Scirica BM, Blood AJ, Aronson SJ. Retrieval Augmented Generation Enabled Generative Pre-Trained Transformer 4 (GPT-4) Performance for Clinical Trial Screening. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.08.24302376. [PMID: 38370719 PMCID: PMC10871450 DOI: 10.1101/2024.02.08.24302376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]

Abstract

Background

Subject screening is a key aspect of all clinical trials; however, traditionally, it is a labor-intensive and error-prone task, demanding significant time and resources. With the advent of large language models (LLMs) and related technologies, a paradigm shift in natural language processing capabilities offers a promising avenue for increasing both quality and efficiency of screening efforts. This study aimed to test the Retrieval-Augmented Generation (RAG) process enabled Generative Pretrained Transformer Version 4 (GPT-4) to accurately identify and report on inclusion and exclusion criteria for a clinical trial.

Methods

The Co-Operative Program for Implementation of Optimal Therapy in Heart Failure (COPILOT-HF) trial aims to recruit patients with symptomatic heart failure. As part of the screening process, a list of potentially eligible patients is created through an electronic health record (EHR) query. Currently, structured data in the EHR can only be used to determine 5 out of 6 inclusion and 5 out of 17 exclusion criteria. Trained, but non-licensed, study staff complete manual chart review to determine patient eligibility and record their assessment of the inclusion and exclusion criteria. We obtained the structured assessments completed by the study staff and clinical notes for the past two years and developed a workflow of clinical note-based question answering system powered by RAG architecture and GPT-4 that we named RECTIFIER (RAG-Enabled Clinical Trial Infrastructure for Inclusion Exclusion Review). We used notes from 100 patients as a development dataset, 282 patients as a validation dataset, and 1894 patients as a test set. An expert clinician completed a blinded review of patients' charts to answer the eligibility questions and determine the "gold standard" answers. We calculated the sensitivity, specificity, accuracy, and Matthews correlation coefficient (MCC) for each question and screening method. We also performed bootstrapping to calculate the confidence intervals for each statistic.

Results

Both RECTIFIER and study staff answers closely aligned with the expert clinician answers across criteria with accuracy ranging between 97.9% and 100% (MCC 0.837 and 1) for RECTIFIER and 91.7% and 100% (MCC 0.644 and 1) for study staff. RECTIFIER performed better than study staff to determine the inclusion criteria of "symptomatic heart failure" with an accuracy of 97.9% vs 91.7% and an MCC of 0.924 vs 0.721, respectively. Overall, the sensitivity and specificity of determining eligibility for the RECTIFIER was 92.3% (CI) and 93.9% (CI), and study staff was 90.1% (CI) and 83.6% (CI), respectively.

Conclusion

GPT-4 based solutions have the potential to improve efficiency and reduce costs in clinical trial screening. When incorporating new tools such as RECTIFIER, it is important to consider the potential hazards of automating the screening process and set up appropriate mitigation strategies such as final clinician review before patient engagement.

Collapse

Affiliation(s)

Ozan Unlu Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Division of Cardiovascular Medicine, Brigham and Women's Hospital, Boston, MA Department of Biomedical Informatics, Harvard Medical School, Boston, MA Harvard Medical School, Boston, MA
Jiyeon Shin Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Mass General Brigham Personalized Medicine, Cambridge, MA
Charlotte J Mailly Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Mass General Brigham Personalized Medicine, Cambridge, MA
Michael F Oates Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Mass General Brigham Personalized Medicine, Cambridge, MA
Michela R Tucci Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA
Matthew Varugheese Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA
Kavishwar Wagholikar Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Research Information Science and Computing, Mass General Brigham, Somerville, MA
Fei Wang Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Mass General Brigham Personalized Medicine, Cambridge, MA
Benjamin M Scirica Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Division of Cardiovascular Medicine, Brigham and Women's Hospital, Boston, MA Harvard Medical School, Boston, MA
Alexander J Blood Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Division of Cardiovascular Medicine, Brigham and Women's Hospital, Boston, MA Harvard Medical School, Boston, MA
Samuel J Aronson Accelerator for Clinical Transformation, Brigham and Women's Hospital, Boston, MA Mass General Brigham Personalized Medicine, Cambridge, MA

Collapse

Guha S, Ibrahim A, Wu Q, Geng P, Chou Y, Yang H, Ma J, Lu L, Wang D, Schwartz LH, Xie CM, Zhao B. Machine learning-based identification of contrast-enhancement phase of computed tomography scans. PLoS One 2024;19:e0294581. [PMID: 38306329 PMCID: PMC10836663 DOI: 10.1371/journal.pone.0294581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 11/04/2023] [Indexed: 02/04/2024] Open

Jia J, Wu G, Li M. iGly-IDN: Identifying Lysine Glycation Sites in Proteins Based on Improved DenseNet. J Comput Biol 2024;31:161-174. [PMID: 38016151 DOI: 10.1089/cmb.2023.0112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023] Open

Lee S, Lee I. Comprehensive assessment of machine learning methods for diagnosing gastrointestinal diseases through whole metagenome sequencing data. Gut Microbes 2024;16:2375679. [PMID: 38972064 PMCID: PMC11229738 DOI: 10.1080/19490976.2024.2375679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 06/28/2024] [Indexed: 07/09/2024] Open

Karthikeyan S, Vazquez-Zapien GJ, Martinez-Cuazitl A, Delgado-Macuil RJ, Rivera-Alatorre DE, Garibay-Gonzalez F, Delgado-Gonzalez J, Valencia-Trujillo D, Guerrero-Ruiz M, Atriano-Colorado C, Lopez-Reyes A, Lopez-Mezquita DJ, Mata-Miranda MM. Two-trace two-dimensional correlation spectra (2T2D-COS) analysis using FTIR spectra to monitor the immune response by COVID-19. J Mol Med (Berl) 2024;102:53-67. [PMID: 37947852 DOI: 10.1007/s00109-023-02390-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 09/22/2023] [Accepted: 10/20/2023] [Indexed: 11/12/2023]

Abstract

There is a growing trend in using saliva for SARS-CoV-2 detection with reasonable accuracy. We have studied the responses of IgA, IgG, and IgM in human saliva by directly comparing disease with control analyzing two-trace two-dimensional correlation spectra (2T2D-COS) employing Fourier transform infrared (FTIR) spectra. It explores the molecular-level variation between control and COVID-19 saliva samples. The advantage of 2T2D spectra is that it helps in discriminating remarkably subtle features between two simple pairs of spectra. It gives spectral information from highly overlapped bands associated with different systems. The clinical findings from 2T2D show the decrease of IgG and IgM salivary antibodies in the 50, 60, 65, and 75-years COVID-19 samples. Among the various COVID-19 populations studied the female 30-years group reveals defense mechanisms exhibited by IgM and IgA. Lipids and fatty acids decrease, resulting in lipid oxidation due to the SARS-CoV-2 in the samples studied. Study shows salivary thiocyanate plays defense against SARS-CoV-2 in the male population in 25 and 35 age groups. The receiver operation characteristics statistical method shows a sensitivity of 98% and a specificity of 94% for the samples studied. The measure of accuracy computed as F score and G score has a high value, supporting our study's validation. Thus, 2T2D-COS analysis can potentially monitor the progression of immunoglobulin's response function to COVID-19 with reasonable accuracy, which could help diagnose clinical trials. KEY MESSAGES: The molecular profile of salivary antibodies is well resolved and identified from 2T2D-COS FTIR spectra. The IgG antibody plays a significant role in the defense mechanism against SARS-CoV-2 in 25-40 years. 2T2D-COS reveals the absence of salivary thiocyanate in the 40-75 years COVID-19 population. The receiver operation characteristic (ROC) analysis validates our study with high sensitivity and specificity.

Collapse

Affiliation(s)

Sivakumaran Karthikeyan Department of Physics, Dr. Ambedkar Government Arts College, Chennai, Tamil Nadu, 600039, India.
Gustavo J Vazquez-Zapien Centro de Investigación y Desarrollo del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, 11400, Mexico. Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico.
Adriana Martinez-Cuazitl Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico Escuela Nacional de Medicina y Homeopatía, Instituto Politécnico Nacional, Mexico City, 07320, Mexico
Raul J Delgado-Macuil Centro de Investigación en Biotecnología Aplicada, Instituto Politécnico Nacional, Tlaxcala, 90700, Mexico
Daniel E Rivera-Alatorre Centro de Investigación y Desarrollo del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, 11400, Mexico
Francisco Garibay-Gonzalez Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico
Josemaria Delgado-Gonzalez Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico
Daniel Valencia-Trujillo Servicio de Microbiología Clínica, Instituto Nacional de Enfermedades Respiratorias, Mexico City, 14080, Mexico
Melissa Guerrero-Ruiz Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico
Consuelo Atriano-Colorado Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico
Alberto Lopez-Reyes Laboratorio de Gerociencias, Instituto Nacional de Rehabilitación Luis Guillermo Ibarra Ibarra, Secretaría de Salud, Mexico City, 14389, Mexico
Dante J Lopez-Mezquita Hospital Central Militar, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico
Monica M Mata-Miranda Escuela Militar de Medicina, Centro Militar de Ciencias de la Salud, Secretaría de la Defensa Nacional, Mexico City, 11200, Mexico.

Collapse

Almeida RL, Maltarollo VG, Coelho FGF. Overcoming class imbalance in drug discovery problems: Graph neural networks and balancing approaches. J Mol Graph Model 2024;126:108627. [PMID: 37801808 DOI: 10.1016/j.jmgm.2023.108627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/12/2023] [Accepted: 09/12/2023] [Indexed: 10/08/2023]

Zavorsky GS, Agostoni P. Two is better than one: the double diffusion technique in classifying heart failure. ERJ Open Res 2024;10:00644-2023. [PMID: 38226067 PMCID: PMC10789268 DOI: 10.1183/23120541.00644-2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/15/2023] [Indexed: 01/17/2024] Open

Guan J, Yao L, Chung CR, Xie P, Zhang Y, Deng J, Chiang YC, Lee TY. Predicting Anti-inflammatory Peptides by Ensemble Machine Learning and Deep Learning. J Chem Inf Model 2023;63:7886-7898. [PMID: 38054927 DOI: 10.1021/acs.jcim.3c01602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]

Cassidy RM, Flores EM, Trinh Nguyen AK, Cheruvu SS, Uribe RA, Krachler AM, Odem MA. Systematic analysis of proximal midgut- and anorectal-originating contractions in larval zebrafish using event feature detection and supervised machine learning algorithms. Neurogastroenterol Motil 2023;35:e14675. [PMID: 37743702 PMCID: PMC10841157 DOI: 10.1111/nmo.14675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 07/16/2023] [Accepted: 08/28/2023] [Indexed: 09/26/2023]

Abstract

BACKGROUND

Zebrafish larvae are translucent, allowing in vivo analysis of gut development and physiology, including gut motility. While recent progress has been made in measuring gut motility in larvae, challenges remain which can influence results, such as how data are interpreted, opportunities for technical user error, and inconsistencies in methods.

METHODS

To overcome these challenges, we noninvasively introduced Nile Red fluorescent dye to fill the intraluminal gut space in zebrafish larvae and collected serial confocal microscopic images of gut fluorescence. We automated the detection of fluorescent-contrasted contraction events against the median-subtracted signal and compared it to manually annotated gut contraction events across anatomically defined gut regions. Supervised machine learning (multiple logistic regression) was then used to discriminate between true contraction events and noise. To demonstrate, we analyzed motility in larvae under control and reserpine-treated conditions. We also used automated event detection analysis to compare unfed and fed larvae.

KEY RESULTS

Automated analysis retained event features for proximal midgut-originating retrograde and anterograde contractions and anorectal-originating retrograde contractions. While manual annotation showed reserpine disrupted gut motility, machine learning only achieved equivalent contraction discrimination in controls and failed to accurately identify contractions after reserpine due to insufficient intraluminal fluorescence. Automated analysis also showed feeding had no effect on the frequency of anorectal-originating contractions.

CONCLUSIONS & INFERENCES

Automated event detection analysis rapidly and accurately annotated contraction events, including the previously neglected phenomenon of anorectal contractions. However, challenges remain to discriminate contraction events based on intraluminal fluorescence under treatment conditions that disrupt functional motility.

Collapse

Michelsen C, Jørgensen CC, Heltberg M, Jensen MH, Lucchetti A, Petersen PB, Petersen T, Kehlet H, Madsen F, Hansen TB, Gromov K, Jakobsen T, Varnum C, Overgaard S, Rathsach M, Hansen L. Machine-learning vs. logistic regression for preoperative prediction of medical morbidity after fast-track hip and knee arthroplasty-a comparative study. BMC Anesthesiol 2023;23:391. [PMID: 38030979 PMCID: PMC10685559 DOI: 10.1186/s12871-023-02354-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 11/21/2023] [Indexed: 12/01/2023] Open

Abstract

BACKGROUND

Machine-learning models may improve prediction of length of stay (LOS) and morbidity after surgery. However, few studies include fast-track programs, and most rely on administrative coding with limited follow-up and information on perioperative care. This study investigates potential benefits of a machine-learning model for prediction of postoperative morbidity in fast-track total hip (THA) and knee arthroplasty (TKA).

METHODS

Cohort study in consecutive unselected primary THA/TKA between 2014-2017 from seven Danish centers with established fast-track protocols. Preoperative comorbidity and prescribed medication were recorded prospectively and information on length of stay and readmissions was obtained through the Danish National Patient Registry and medical records. We used a machine-learning model (Boosted Decision Trees) based on boosted decision trees with 33 preoperative variables for predicting "medical" morbidity leading to LOS > 4 days or 90-days readmissions and compared to a logistical regression model based on the same variables. We also evaluated two parsimonious models, using the ten most important variables in the full machine-learning and logistic regression models. Data collected between 2014-2016 (n:18,013) was used for model training and data from 2017 (n:3913) was used for testing. Model performances were analyzed using precision, area under receiver operating (AUROC) and precision recall curves (AUPRC), as well as the Mathews Correlation Coefficient. Variable importance was analyzed using Shapley Additive Explanations values.

RESULTS

Using a threshold of 20% "risk-patients" (n:782), precision, AUROC and AUPRC were 13.6%, 76.3% and 15.5% vs. 12.4%, 74.7% and 15.6% for the machine-learning and logistic regression model, respectively. The parsimonious machine-learning model performed better than the full logistic regression model. Of the top ten variables, eight were shared between the machine-learning and logistic regression models, but with a considerable age-related variation in importance of specific types of medication.

CONCLUSION

A machine-learning model using preoperative characteristics and prescriptions slightly improved identification of patients in high-risk of "medical" complications after fast-track THA and TKA compared to a logistic regression model. Such algorithms could help find a manageable population of patients who may benefit most from intensified perioperative care.

Collapse

Zhang Y, Aaronson KD, Gryak J, Wittrup E, Minoccheri C, Golbus JR, Najarian K. Predicting need for heart failure advanced therapies using an interpretable tropical geometry-based fuzzy neural network. PLoS One 2023;18:e0295016. [PMID: 38015947 PMCID: PMC10684094 DOI: 10.1371/journal.pone.0295016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 11/13/2023] [Indexed: 11/30/2023] Open

Abstract

BACKGROUND

Timely referral for advanced therapies (i.e., heart transplantation, left ventricular assist device) is critical for ensuring optimal outcomes for heart failure patients. Using electronic health records, our goal was to use data from a single hospitalization to develop an interpretable clinical decision-making system for predicting the need for advanced therapies at the subsequent hospitalization.

METHODS

Michigan Medicine heart failure patients from 2013-2021 with a left ventricular ejection fraction ≤ 35% and at least two heart failure hospitalizations within one year were used to train an interpretable machine learning model constructed using fuzzy logic and tropical geometry. Clinical knowledge was used to initialize the model. The performance and robustness of the model were evaluated with the mean and standard deviation of the area under the receiver operating curve (AUC), the area under the precision-recall curve (AUPRC), and the F1 score of the ensemble. We inferred membership functions from the model for continuous clinical variables, extracted decision rules, and then evaluated their relative importance.

RESULTS

The model was trained and validated using data from 557 heart failure hospitalizations from 300 patients, of whom 193 received advanced therapies. The mean (standard deviation) of AUC, AUPRC, and F1 scores of the proposed model initialized with clinical knowledge was 0.747 (0.080), 0.642 (0.080), and 0.569 (0.067), respectively, showing superior predictive performance or increased interpretability over other machine learning methods. The model learned critical risk factors predicting the need for advanced therapies in the subsequent hospitalization. Furthermore, our model displayed transparent rule sets composed of these critical concepts to justify the prediction.

CONCLUSION

These results demonstrate the ability to successfully predict the need for advanced heart failure therapies by generating transparent and accessible clinical rules although further research is needed to prospectively validate the risk factors identified by the model.

Collapse

Hülpüsch C, Rauer L, Nussbaumer T, Schwierzeck V, Bhattacharyya M, Erhart V, Traidl-Hoffmann C, Reiger M, Neumann AU. Benchmarking MicrobIEM - a user-friendly tool for decontamination of microbiome sequencing data. BMC Biol 2023;21:269. [PMID: 37996810 PMCID: PMC10666409 DOI: 10.1186/s12915-023-01737-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 10/16/2023] [Indexed: 11/25/2023] Open

Affiliation(s)

Claudia Hülpüsch Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany Chair of Environmental Medicine, Technical University of Munich, Munich, Germany CK CARE, Christine Kühne Center for Allergy Research and Education, Davos, Switzerland
Luise Rauer Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany Chair of Environmental Medicine, Technical University of Munich, Munich, Germany Institute of Environmental Medicine, Helmholtz Munich, Augsburg, Germany
Thomas Nussbaumer Institute of Environmental Medicine, Helmholtz Munich, Augsburg, Germany
Vera Schwierzeck Institute of Environmental Medicine, Helmholtz Munich, Augsburg, Germany Institute of Hygiene, University Hospital Muenster, Muenster, Germany
Madhumita Bhattacharyya Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany Chair of Environmental Medicine, Technical University of Munich, Munich, Germany
Veronika Erhart Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany
Claudia Traidl-Hoffmann Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany Chair of Environmental Medicine, Technical University of Munich, Munich, Germany CK CARE, Christine Kühne Center for Allergy Research and Education, Davos, Switzerland Institute of Environmental Medicine, Helmholtz Munich, Augsburg, Germany ZIEL - Institute for Food & Health, Technical University of Munich, Freising-Weihenstephan, Germany
Matthias Reiger Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany Chair of Environmental Medicine, Technical University of Munich, Munich, Germany Institute of Environmental Medicine, Helmholtz Munich, Augsburg, Germany
Avidan U Neumann Environmental Medicine, Faculty of Medicine, University of Augsburg, Stenglinstr. 2, 86156, Augsburg, Germany. Institute of Environmental Medicine, Helmholtz Munich, Augsburg, Germany.

Collapse

Tsuyuzaki K, Ishii M, Nikaido I. Sctensor detects many-to-many cell-cell interactions from single cell RNA-sequencing data. BMC Bioinformatics 2023;24:420. [PMID: 37936079 PMCID: PMC10631077 DOI: 10.1186/s12859-023-05490-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/21/2023] [Indexed: 11/09/2023] Open

Phan TV, Nguyen VTV, Le MT, Nguyen BGD, Vu TT, Thai KM. Identification of efflux pump inhibitors for Pseudomonas aeruginosa MexAB-OprM via ligand-based pharmacophores, 2D-QSAR, molecular docking, and molecular dynamics approaches. Mol Divers 2023:10.1007/s11030-023-10758-9. [PMID: 37919619 DOI: 10.1007/s11030-023-10758-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 10/24/2023] [Indexed: 11/04/2023]

Chicco D, Haupt R, Garaventa A, Uva P, Luksch R, Cangelosi D. Computational intelligence analysis of high-risk neuroblastoma patient health records reveals time to maximum response as one of the most relevant factors for outcome prediction. Eur J Cancer 2023;193:113291. [PMID: 37708628 DOI: 10.1016/j.ejca.2023.113291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 07/24/2023] [Accepted: 08/09/2023] [Indexed: 09/16/2023]

Henriques SC, Paixão P, Almeida L, Silva NE. Predictive Potential of C_max Bioequivalence in Pilot Bioavailability/Bioequivalence Studies, through the Alternative ƒ₂ Similarity Factor Method. Pharmaceutics 2023;15:2498. [PMID: 37896259 PMCID: PMC10610255 DOI: 10.3390/pharmaceutics15102498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 10/08/2023] [Accepted: 10/18/2023] [Indexed: 10/29/2023] Open

Abstract

Pilot bioavailability/bioequivalence (BA/BE) studies are downsized trials that can be conducted prior to the definitive pivotal trial. In these trials, 12 to 18 subjects are usually enrolled, although, in principle, a sample size is not formally calculated. In a previous work, authors recommended the use of an alternative approach to the average bioequivalence methodology to evaluate pilot studies' data, using the geometric mean (Gmean) ƒ2 factor with a cut off of 35, which has shown to be an appropriate method to assess the potential bioequivalence for the maximum observed concentration (Cmax) metric under the assumptions of a true Test-to-Reference Geometric Mean Ratio (GMR) of 100% and an inter-occasion variability (IOV) in the range of 10% to 45%. In this work, the authors evaluated the proposed ƒ2 factor in comparison with the standard average bioequivalence in more extreme scenarios, using a true GMR of 90% or 111% for truly bioequivalent formulations, and 80% or 125% for truly bioinequivalent formulations, in order to better derive conclusions on the potential of this analysis method. Several scenarios of pilot BA/BE crossover studies were simulated through population pharmacokinetic modelling, accounting for different IOV levels. A redefined decision tree is proposed, suggesting a fixed sample size of 20 subjects for pilot studies in the case of intra-subject coefficient of variation (ISCV%) > 20% or unknown variability, and suggesting the assessment of study results through the average bioequivalence analysis, and additionally through Gmean ƒ2 factor method in the case of the 90% confidence interval (CI) for GMR is outside the regulatory acceptance bioequivalence interval of [80.00-125.00]%. Using this alternative approach, the certainty levels to proceed with pivotal studies, depending on Gmean ƒ2 values and variability scenarios tested (20-60% IOV), were assessed, which is expected to be helpful in terms of the decision to proceed with pivotal bioequivalence studies.

Collapse

Foody GM. Challenges in the real world use of classification accuracy metrics: From recall and precision to the Matthews correlation coefficient. PLoS One 2023;18:e0291908. [PMID: 37792898 PMCID: PMC10550141 DOI: 10.1371/journal.pone.0291908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Accepted: 09/07/2023] [Indexed: 10/06/2023] Open

Abstract

The accuracy of a classification is fundamental to its interpretation, use and ultimately decision making. Unfortunately, the apparent accuracy assessed can differ greatly from the true accuracy. Mis-estimation of classification accuracy metrics and associated mis-interpretations are often due to variations in prevalence and the use of an imperfect reference standard. The fundamental issues underlying the problems associated with variations in prevalence and reference standard quality are revisited here for binary classifications with particular attention focused on the use of the Matthews correlation coefficient (MCC). A key attribute claimed of the MCC is that a high value can only be attained when the classification performed well on both classes in a binary classification. However, it is shown here that the apparent magnitude of a set of popular accuracy metrics used in fields such as computer science medicine and environmental science (Recall, Precision, Specificity, Negative Predictive Value, J, F1, likelihood ratios and MCC) and one key attribute (prevalence) were all influenced greatly by variations in prevalence and use of an imperfect reference standard. Simulations using realistic values for data quality in applications such as remote sensing showed each metric varied over the range of possible prevalence and at differing levels of reference standard quality. The direction and magnitude of accuracy metric mis-estimation were a function of prevalence and the size and nature of the imperfections in the reference standard. It was evident that the apparent MCC could be substantially under- or over-estimated. Additionally, a high apparent MCC arose from an unquestionably poor classification. As with some other metrics of accuracy, the utility of the MCC may be overstated and apparent values need to be interpreted with caution. Apparent accuracy and prevalence values can be mis-leading and calls for the issues to be recognised and addressed should be heeded.

Collapse

Ghorbanali Z, Zare-Mirakabad F, Salehi N, Akbari M, Masoudi-Nejad A. DrugRep-HeSiaGraph: when heterogenous siamese neural network meets knowledge graphs for drug repurposing. BMC Bioinformatics 2023;24:374. [PMID: 37789314 PMCID: PMC10548718 DOI: 10.1186/s12859-023-05479-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 09/12/2023] [Indexed: 10/05/2023] Open