Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Heydari AA, Davalos OA, Zhao L, Hoyer KK, Sindi SS. ACTIVA: realistic single-cell RNA-seq generation with automatic cell-type identification using introspective variational autoencoders. Bioinformatics 2022;38:2194-2201. [PMID: 35179571 PMCID: PMC9004654 DOI: 10.1093/bioinformatics/btac095] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 01/19/2022] [Accepted: 02/15/2022] [Indexed: 02/04/2023] Open

For:	Heydari AA, Davalos OA, Zhao L, Hoyer KK, Sindi SS. ACTIVA: realistic single-cell RNA-seq generation with automatic cell-type identification using introspective variational autoencoders. Bioinformatics 2022;38:2194-2201. [PMID: 35179571 PMCID: PMC9004654 DOI: 10.1093/bioinformatics/btac095] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 01/19/2022] [Accepted: 02/15/2022] [Indexed: 02/04/2023] Open

Number

Cited by Other Article(s)

Kong T, Yu T, Zhao J, Hu Z, Xiong N, Wan J, Dong X, Pan Y, Zheng H, Zhang L. scGAA: a general gated axial-attention model for accurate cell-type annotation of single-cell RNA-seq data. Sci Rep 2024;14:22308. [PMID: 39333739 PMCID: PMC11436728 DOI: 10.1038/s41598-024-73356-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Accepted: 09/17/2024] [Indexed: 09/30/2024] Open

Qi Y, Wang X, Qin LX. Optimizing Sample Size for Supervised Machine Learning with Bulk Transcriptomic Sequencing: A Learning Curve Approach. ARXIV 2024:arXiv:2409.06180v1. [PMID: 39314504 PMCID: PMC11419172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]

Friedrich VD, Pennitz P, Wyler E, Adler JM, Postmus D, Müller K, Teixeira Alves LG, Prigann J, Pott F, Vladimirova D, Hoefler T, Goekeri C, Landthaler M, Goffinet C, Saliba AE, Scholz M, Witzenrath M, Trimpert J, Kirsten H, Nouailles G. Neural network-assisted humanisation of COVID-19 hamster transcriptomic data reveals matching severity states in human disease. EBioMedicine 2024:105312. [PMID: 39317638 DOI: 10.1016/j.ebiom.2024.105312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 08/12/2024] [Accepted: 08/13/2024] [Indexed: 09/26/2024] Open

Abstract

BACKGROUND

Translating findings from animal models to human disease is essential for dissecting disease mechanisms, developing and testing precise therapeutic strategies. The coronavirus disease 2019 (COVID-19) pandemic has highlighted this need, particularly for models showing disease severity-dependent immune responses.

METHODS

Single-cell transcriptomics (scRNAseq) is well poised to reveal similarities and differences between species at the molecular and cellular level with unprecedented resolution. However, computational methods enabling detailed matching are still scarce. Here, we provide a structured scRNAseq-based approach that we applied to scRNAseq from blood leukocytes originating from humans and hamsters affected with moderate or severe COVID-19.

FINDINGS

Integration of data from patients with COVID-19 with two hamster models that develop moderate (Syrian hamster, Mesocricetus auratus) or severe (Roborovski hamster, Phodopus roborovskii) disease revealed that most cellular states are shared across species. A neural network-based analysis using variational autoencoders quantified the overall transcriptomic similarity across species and severity levels, showing highest similarity between neutrophils of Roborovski hamsters and patients with severe COVID-19, while Syrian hamsters better matched patients with moderate disease, particularly in classical monocytes. We further used transcriptome-wide differential expression analysis to identify which disease stages and cell types display strongest transcriptional changes.

INTERPRETATION

Consistently, hamsters' response to COVID-19 was most similar to humans in monocytes and neutrophils. Disease-linked pathways found in all species specifically related to interferon response or inhibition of viral replication. Analysis of candidate genes and signatures supported the results. Our structured neural network-supported workflow could be applied to other diseases, allowing better identification of suitable animal models with similar pathomechanisms across species.

FUNDING

This work was supported by German Federal Ministry of Education and Research, (BMBF) grant IDs: 01ZX1304B, 01ZX1604B, 01ZX1906A, 01ZX1906B, 01KI2124, 01IS18026B and German Research Foundation (DFG) grant IDs: 14933180, 431232613.

Collapse

Affiliation(s)

Vincent D Friedrich University of Leipzig, Institute for Medical Informatics, Statistics, and Epidemiology, Leipzig, Germany; Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Leipzig, Germany
Peter Pennitz Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Infectious Diseases, Respiratory Medicine and Critical Care, Berlin, Germany
Emanuel Wyler Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
Julia M Adler Freie Universität Berlin, Institut für Virologie, Berlin, Germany
Dylan Postmus Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Institute of Virology, Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany; Liverpool School of Tropical Medicine, Department of Tropical Disease Biology, Liverpool, United Kingdom
Kristina Müller University of Leipzig, Institute for Medical Informatics, Statistics, and Epidemiology, Leipzig, Germany
Luiz Gustavo Teixeira Alves Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
Julia Prigann Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Institute of Virology, Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany; Gladstone Institutes, San Francisco, USA
Fabian Pott Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Institute of Virology, Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
Daria Vladimirova Freie Universität Berlin, Institut für Virologie, Berlin, Germany
Thomas Hoefler Freie Universität Berlin, Institut für Virologie, Berlin, Germany
Cengiz Goekeri Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Infectious Diseases, Respiratory Medicine and Critical Care, Berlin, Germany; Cyprus International University, Faculty of Medicine, Nicosia, Cyprus
Markus Landthaler Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany; Humboldt-Universität zu Berlin, Institut fuer Biologie, Berlin, Germany
Christine Goffinet Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Institute of Virology, Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany; Liverpool School of Tropical Medicine, Department of Tropical Disease Biology, Liverpool, United Kingdom
Antoine-Emmanuel Saliba Faculty of Medicine, Institute of Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, Germany; Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Center for Infection Research (HZI), Würzburg, Germany
Markus Scholz University of Leipzig, Institute for Medical Informatics, Statistics, and Epidemiology, Leipzig, Germany; University of Leipzig, Faculty of Mathematics and Computer Science, Leipzig, Germany
Martin Witzenrath Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Infectious Diseases, Respiratory Medicine and Critical Care, Berlin, Germany; German Center for Lung Research (DZL), Berlin, Germany
Jakob Trimpert Freie Universität Berlin, Institut für Virologie, Berlin, Germany
Holger Kirsten University of Leipzig, Institute for Medical Informatics, Statistics, and Epidemiology, Leipzig, Germany.
Geraldine Nouailles Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Infectious Diseases, Respiratory Medicine and Critical Care, Berlin, Germany.

Collapse

Jin W, Xia Y, Thela SR, Liu Y, Chen L. In silico generation and augmentation of regulatory variants from massively parallel reporter assay using conditional variational autoencoder. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600715. [PMID: 38979263 PMCID: PMC11230389 DOI: 10.1101/2024.06.25.600715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]

Vallevik VB, Babic A, Marshall SE, Elvatun S, Brøgger HMB, Alagaratnam S, Edwin B, Veeraragavan NR, Befring AK, Nygård JF. Can I trust my fake data - A comprehensive quality assessment framework for synthetic tabular data in healthcare. Int J Med Inform 2024;185:105413. [PMID: 38493547 DOI: 10.1016/j.ijmedinf.2024.105413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/17/2024] [Accepted: 03/11/2024] [Indexed: 03/19/2024]

Abstract

BACKGROUND

Ensuring safe adoption of AI tools in healthcare hinges on access to sufficient data for training, testing and validation. Synthetic data has been suggested in response to privacy concerns and regulatory requirements and can be created by training a generator on real data to produce a dataset with similar statistical properties. Competing metrics with differing taxonomies for quality evaluation have been proposed, resulting in a complex landscape. Optimising quality entails balancing considerations that make the data fit for use, yet relevant dimensions are left out of existing frameworks.

METHOD

We performed a comprehensive literature review on the use of quality evaluation metrics on synthetic data within the scope of synthetic tabular healthcare data using deep generative methods. Based on this and the collective team experiences, we developed a conceptual framework for quality assurance. The applicability was benchmarked against a practical case from the Dutch National Cancer Registry.

CONCLUSION

We present a conceptual framework for quality assuranceof synthetic data for AI applications in healthcare that aligns diverging taxonomies, expands on common quality dimensions to include the dimensions of Fairness and Carbon footprint, and proposes stages necessary to support real-life applications. Building trust in synthetic data by increasing transparency and reducing the safety risk will accelerate the development and uptake of trustworthy AI tools for the benefit of patients.

DISCUSSION

Despite the growing emphasis on algorithmic fairness and carbon footprint, these metrics were scarce in the literature review. The overwhelming focus was on statistical similarity using distance metrics while sequential logic detection was scarce. A consensus-backed framework that includes all relevant quality dimensions can provide assurance for safe and responsible real-life applications of synthetic data. As the choice of appropriate metrics are highly context dependent, further research is needed on validation studies to guide metric choices and support the development of technical standards.

Collapse

Selvarajoo K, Maurer-Stroh S. Towards multi-omics synthetic data integration. Brief Bioinform 2024;25:bbae213. [PMID: 38711370 PMCID: PMC11074586 DOI: 10.1093/bib/bbae213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 04/19/2024] [Indexed: 05/08/2024] Open

Salimi A, Jang JH, Lee JY. Leveraging attention-enhanced variational autoencoders: Novel approach for investigating latent space of aptamer sequences. Int J Biol Macromol 2024;255:127884. [PMID: 37926303 DOI: 10.1016/j.ijbiomac.2023.127884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 10/27/2023] [Accepted: 11/02/2023] [Indexed: 11/07/2023]

Erfanian N, Heydari AA, Feriz AM, Iañez P, Derakhshani A, Ghasemigol M, Farahpour M, Razavi SM, Nasseri S, Safarpour H, Sahebkar A. Deep learning applications in single-cell genomics and transcriptomics data analysis. Biomed Pharmacother 2023;165:115077. [PMID: 37393865 DOI: 10.1016/j.biopha.2023.115077] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 06/22/2023] [Accepted: 06/23/2023] [Indexed: 07/04/2023] Open

Davalos OA, Heydari AA, Fertig EJ, Sindi SS, Hoyer KK. Boosting Single-Cell RNA Sequencing Analysis with Simple Neural Attention. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.29.542760. [PMID: 37398136 PMCID: PMC10312486 DOI: 10.1101/2023.05.29.542760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]

Liu C, Huang H, Yang P. Multi-task learning from multimodal single-cell omics with Matilda. Nucleic Acids Res 2023;51:e45. [PMID: 36912104 PMCID: PMC10164589 DOI: 10.1093/nar/gkad157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 01/28/2023] [Accepted: 02/21/2023] [Indexed: 03/14/2023] Open

Jiao L, Wang G, Dai H, Li X, Wang S, Song T. scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings. Biomolecules 2023;13:biom13040611. [PMID: 37189359 DOI: 10.3390/biom13040611] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 03/05/2023] [Accepted: 03/10/2023] [Indexed: 03/31/2023] Open

Heydari AA, Sindi SS. Deep learning in spatial transcriptomics: Learning from the next next-generation sequencing. BIOPHYSICS REVIEWS 2023;4:011306. [PMID: 38505815 PMCID: PMC10903438 DOI: 10.1063/5.0091135] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 12/19/2022] [Indexed: 03/21/2024]

Abstract

Spatial transcriptomics (ST) technologies are rapidly becoming the extension of single-cell RNA sequencing (scRNAseq), holding the potential of profiling gene expression at a single-cell resolution while maintaining cellular compositions within a tissue. Having both expression profiles and tissue organization enables researchers to better understand cellular interactions and heterogeneity, providing insight into complex biological processes that would not be possible with traditional sequencing technologies. Data generated by ST technologies are inherently noisy, high-dimensional, sparse, and multi-modal (including histological images, count matrices, etc.), thus requiring specialized computational tools for accurate and robust analysis. However, many ST studies currently utilize traditional scRNAseq tools, which are inadequate for analyzing complex ST datasets. On the other hand, many of the existing ST-specific methods are built upon traditional statistical or machine learning frameworks, which have shown to be sub-optimal in many applications due to the scale, multi-modality, and limitations of spatially resolved data (such as spatial resolution, sensitivity, and gene coverage). Given these intricacies, researchers have developed deep learning (DL)-based models to alleviate ST-specific challenges. These methods include new state-of-the-art models in alignment, spatial reconstruction, and spatial clustering, among others. However, DL models for ST analysis are nascent and remain largely underexplored. In this review, we provide an overview of existing state-of-the-art tools for analyzing spatially resolved transcriptomics while delving deeper into the DL-based approaches. We discuss the new frontiers and the open questions in this field and highlight domains in which we anticipate transformational DL applications.

Collapse

Liu Q, Liang Y, Wang D, Li J. LFSC: A linear fast semi-supervised clustering algorithm that integrates reference-bulk and single-cell transcriptomes. Front Genet 2022;13:1068075. [PMID: 36531230 PMCID: PMC9754124 DOI: 10.3389/fgene.2022.1068075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 11/09/2022] [Indexed: 08/29/2023] Open

Brendel M, Su C, Bai Z, Zhang H, Elemento O, Wang F. Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022;20:814-835. [PMID: 36528240 PMCID: PMC10025684 DOI: 10.1016/j.gpb.2022.11.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 08/17/2022] [Accepted: 11/24/2022] [Indexed: 12/23/2022]