Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Yang ZK, Pan L, Zhang Y, Luo H, Gao F. Data-driven identification of SARS-CoV-2 subpopulations using PhenoGraph and binary-coded genomic data. Brief Bioinform 2021;22:bbab307. [PMID: 34382087 PMCID: PMC8385964 DOI: 10.1093/bib/bbab307] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 07/01/2021] [Accepted: 07/17/2021] [Indexed: 01/08/2023] Open

For:	Yang ZK, Pan L, Zhang Y, Luo H, Gao F. Data-driven identification of SARS-CoV-2 subpopulations using PhenoGraph and binary-coded genomic data. Brief Bioinform 2021;22:bbab307. [PMID: 34382087 PMCID: PMC8385964 DOI: 10.1093/bib/bbab307] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 07/01/2021] [Accepted: 07/17/2021] [Indexed: 01/08/2023] Open

Number

Cited by Other Article(s)

Chen J, Ionita M, Feng Y, Lu Y, Orzechowski P, Garai S, Hassinger K, Bao J, Wen J, Duong-Tran D, Wagenaar J, McKeague ML, Painter MM, Mathew D, Pattekar A, Meyer NJ, Wherry EJ, Greenplate AR, Shen L. Automated Cytometric Gating with Human-Level Performance Using Bivariate Segmentation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.06.592739. [PMID: 38766268 PMCID: PMC11100732 DOI: 10.1101/2024.05.06.592739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]

Abstract

Recent advances in cytometry technology have enabled high-throughput data collection with multiple single-cell protein expression measurements. The significant biological and technical variance between samples in cytometry has long posed a formidable challenge during the gating process, especially for the initial gates which deal with unpredictable events, such as debris and technical artifacts. Even with the same experimental machine and protocol, the target population, as well as the cell population that needs to be excluded, may vary across different measurements. To address this challenge and mitigate the labor-intensive manual gating process, we propose a deep learning framework UNITO to rigorously identify the hierarchical cytometric subpopulations. The UNITO framework transformed a cell-level classification task into an image-based semantic segmentation problem. For reproducibility purposes, the framework was applied to three independent cohorts and successfully detected initial gates that were required to identify single cellular events as well as subsequent cell gates. We validated the UNITO framework by comparing its results with previous automated methods and the consensus of at least four experienced immunologists. UNITO outperformed existing automated methods and differed from human consensus by no more than each individual human. Most critically, UNITO framework functions as a fully automated pipeline after training and does not require human hints or prior knowledge. Unlike existing multi-channel classification or clustering pipelines, UNITO can reproduce a similar contour compared to manual gating for each intermediate gating to achieve better interpretability and provide post hoc visual inspection. Beyond acting as a pioneering framework that uses image segmentation to do auto-gating, UNITO gives a fast and interpretable way to assign the cell subtype membership, and the speed of UNITO will not be impacted by the number of cells from each sample. The pre-gating and gating inference takes approximately 2 minutes for each sample using our pre-defined 9 gates system, and it can also adapt to any sequential prediction with different configurations.

Collapse

Affiliation(s)

Jiong Chen Department of Bioengineering, University of Pennsylvania School of Engineering and Applied Science, PA, USA Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
Matei Ionita Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Yanbo Feng Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
Yinfeng Lu Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA Department of Mathematics, University of Pennsylvania School of Arts and Sciences, PA, USA
Patryk Orzechowski Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA Department of Automatics and Robotics, AGH University of Science and Technology, al. Mickiewicza 30, Krakow, 30-059, Poland
Sumita Garai Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
Kenneth Hassinger Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA
Jingxuan Bao Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
Junhao Wen Laboratory of AI and Biomedical Science, Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, CA, USA
Duy Duong-Tran Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA Department of Mathematics, United States Naval Academy, Annapolis, MD, USA
Joost Wagenaar Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA
Michelle L. McKeague Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Mark M. Painter Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Divij Mathew Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Ajinkya Pattekar Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Nuala J. Meyer Division of Pulmonary and Critical Care Medicine, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
E. John Wherry Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Allison R. Greenplate Department of Systems Pharmacology & Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, PA, USA Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, PA, USA
Li Shen Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA

Collapse

Li X, Zhang Y, Wang J, Han J, Shen T. Long-term dynamic shifts in genomic base content and evolutionary trajectories of SARS-CoV-2 variants. J Med Virol 2023;95:e29128. [PMID: 37772482 DOI: 10.1002/jmv.29128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/30/2023] [Accepted: 09/15/2023] [Indexed: 09/30/2023]

Zheng P, Zhou C, Ding Y, Liu B, Lu L, Zhu F, Duan S. Nanopore sequencing technology and its applications. MedComm (Beijing) 2023;4:e316. [PMID: 37441463 PMCID: PMC10333861 DOI: 10.1002/mco2.316] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 05/29/2023] [Accepted: 05/31/2023] [Indexed: 07/15/2023] Open

Miao M, De Clercq E, Li G. Towards Efficient and Accurate SARS-CoV-2 Genome Sequence Typing Based on Supervised Learning Approaches. Microorganisms 2022;10:microorganisms10091785. [PMID: 36144387 PMCID: PMC9505117 DOI: 10.3390/microorganisms10091785] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 08/24/2022] [Accepted: 09/01/2022] [Indexed: 11/16/2022] Open

Munis AM, Andersson M, Mobbs A, Hyde SC, Gill DR. Genomic diversity of SARS-CoV-2 in Oxford during United Kingdom's first national lockdown. Sci Rep 2021;11:21484. [PMID: 34728747 PMCID: PMC8564533 DOI: 10.1038/s41598-021-01022-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 10/18/2021] [Indexed: 12/15/2022] Open

Di Pasquale A, Radomski N, Mangone I, Calistri P, Lorusso A, Cammà C. SARS-CoV-2 surveillance in Italy through phylogenomic inferences based on Hamming distances derived from pan-SNPs, -MNPs and -InDels. BMC Genomics 2021;22:782. [PMID: 34717546 PMCID: PMC8556844 DOI: 10.1186/s12864-021-08112-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 10/20/2021] [Indexed: 01/08/2023] Open

Abstract

BACKGROUND

Faced with the ongoing global pandemic of coronavirus disease, the 'National Reference Centre for Whole Genome Sequencing of microbial pathogens: database and bioinformatic analysis' (GENPAT) formally established at the 'Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise' (IZSAM) in Teramo (Italy) is in charge of the SARS-CoV-2 surveillance at the genomic scale. In a context of SARS-CoV-2 surveillance requiring correct and fast assessment of epidemiological clusters from substantial amount of samples, the present study proposes an analytical workflow for identifying accurately the PANGO lineages of SARS-CoV-2 samples and building of discriminant minimum spanning trees (MST) bypassing the usual time consuming phylogenomic inferences based on multiple sequence alignment (MSA) and substitution model.

RESULTS

GENPAT constituted two collections of SARS-CoV-2 samples. The first collection consisted of SARS-CoV-2 positive swabs collected by IZSAM from the Abruzzo region (Italy), then sequenced by next generation sequencing (NGS) and analyzed in GENPAT (n = 1592), while the second collection included samples from several Italian provinces and retrieved from the reference Global Initiative on Sharing All Influenza Data (GISAID) (n = 17,201). The main results of the present work showed that (i) GENPAT and GISAID detected the same PANGO lineages, (ii) the PANGO lineages B.1.177 (i.e. historical in Italy) and B.1.1.7 (i.e. 'UK variant') are major concerns today in several Italian provinces, and the new MST-based method (iii) clusters most of the PANGO lineages together, (iv) with a higher dicriminatory power than PANGO lineages, (v) and faster that the usual phylogenomic methods based on MSA and substitution model.

CONCLUSIONS

The genome sequencing efforts of Italian provinces, combined with a structured national system of NGS data management, provided support for surveillance SARS-CoV-2 in Italy. We propose to build phylogenomic trees of SARS-CoV-2 variants through an accurate, discriminant and fast MST-based method avoiding the typical time consuming steps related to MSA and substitution model-based phylogenomic inference.

Collapse