1
|
Kan Z, Wen J, Bonato V, Webster J, Yang W, Ivanov V, Kim KH, Roh W, Liu C, Mu XJ, Lapira-Miller J, Oyer J, VanArsdale T, Rejto PA, Bienkowska J. Real-world clinical multi-omics analyses reveal bifurcation of ER-independent and ER-dependent drug resistance to CDK4/6 inhibitors. Nat Commun 2025; 16:932. [PMID: 39843429 PMCID: PMC11754447 DOI: 10.1038/s41467-025-55914-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 01/02/2025] [Indexed: 01/30/2025] Open
Abstract
To better understand drug resistance mechanisms to CDK4/6 inhibitors and inform precision medicine, we analyze real-world multi-omics data from 400 HR+/HER2- metastatic breast cancer patients treated with CDK4/6 inhibitors plus endocrine therapies, including 200 pre-treatment and 227 post-progression samples. The prevalences of ESR1 and RB1 alterations significantly increase in post-progression samples. Integrative clustering analysis identifies three subgroups harboring different resistance mechanisms: ER driven, ER co-driven and ER independent. The ER independent subgroup, growing from 5% pre-treatment to 21% post-progression, is characterized by down-regulated estrogen signaling and enrichment of resistance markers including TP53 mutations, CCNE1 over-expression and Her2/Basal subtypes. Trajectory inference analyses identify a pseudotime variable strongly correlated with ER independence and disease progression; and revealed bifurcated evolutionary trajectories for ER-independent vs. ER-dependent drug resistance mechanisms. Machine learning models predict therapeutic dependency on ESR1 and CDK4 among ER-dependent tumors and CDK2 dependency among ER-independent tumors, confirmed by experimental validation.
Collapse
Affiliation(s)
- Zhengyan Kan
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA.
| | - Ji Wen
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | | | | | - Wenjing Yang
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | - Vladimir Ivanov
- Global Biometrics & Data Management, Pfizer Inc., Cambridge, MA, USA
| | | | - Whijae Roh
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | - Chaoting Liu
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | | | | | - Jon Oyer
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | - Todd VanArsdale
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | - Paul A Rejto
- Oncology Research & Development, Pfizer Inc., San Diego, CA, USA
| | | |
Collapse
|
2
|
Qiu X, Zhu DY, Lu Y, Yao J, Jing Z, Min KH, Cheng M, Pan H, Zuo L, King S, Fang Q, Zheng H, Wang M, Wang S, Zhang Q, Yu S, Liao S, Liu C, Wu X, Lai Y, Hao S, Zhang Z, Wu L, Zhang Y, Li M, Tu Z, Lin J, Yang Z, Li Y, Gu Y, Ellison D, Chen A, Liu L, Weissman JS, Ma J, Xu X, Liu S, Bai Y. Spatiotemporal modeling of molecular holograms. Cell 2024; 187:7351-7373.e61. [PMID: 39532097 DOI: 10.1016/j.cell.2024.10.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 05/29/2024] [Accepted: 10/08/2024] [Indexed: 11/16/2024]
Abstract
Quantifying spatiotemporal dynamics during embryogenesis is crucial for understanding congenital diseases. We developed Spateo (https://github.com/aristoteleo/spateo-release), a 3D spatiotemporal modeling framework, and applied it to a 3D mouse embryogenesis atlas at E9.5 and E11.5, capturing eight million cells. Spateo enables scalable, partial, non-rigid alignment, multi-slice refinement, and mesh correction to create molecular holograms of whole embryos. It introduces digitization methods to uncover multi-level biology from subcellular to whole organ, identifying expression gradients along orthogonal axes of emergent 3D structures, e.g., secondary organizers such as midbrain-hindbrain boundary (MHB). Spateo further jointly models intercellular and intracellular interaction to dissect signaling landscapes in 3D structures, including the zona limitans intrathalamica (ZLI). Lastly, Spateo introduces "morphometric vector fields" of cell migration and integrates spatial differential geometry to unveil molecular programs underlying asymmetrical murine heart organogenesis and others, bridging macroscopic changes with molecular dynamics. Thus, Spateo enables the study of organ ecology at a molecular level in 3D space over time.
Collapse
Affiliation(s)
- Xiaojie Qiu
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA; Basic Sciences and Engineering Initiative, Betty Irene Moore Children's Heart Center, Lucile Packard Children's Hospital, Stanford, CA, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA; Stanford Cardiovascular Institute, Stanford University, Stanford, CA, USA.
| | - Daniel Y Zhu
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Yifan Lu
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA; Basic Sciences and Engineering Initiative, Betty Irene Moore Children's Heart Center, Lucile Packard Children's Hospital, Stanford, CA, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA; Electronic Information School, Wuhan University, Wuhan 430072, China
| | - Jiajun Yao
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Zehua Jing
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kyung Hoi Min
- Ginkgo Bioworks, The Innovation and Design Building, Boston, MA 02210, USA
| | - Mengnan Cheng
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China
| | | | - Lulu Zuo
- BGI Research, Shenzhen 518083, China
| | - Samuel King
- Department of Bioengineering, Stanford University School of Medicine, Stanford, CA, USA
| | - Qi Fang
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China
| | - Huiwen Zheng
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingyue Wang
- BGI Research, Hangzhou 310030, China; Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Shuai Wang
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qingquan Zhang
- Department of Medicine, Division of Cardiology, University of California, San Diego, La Jolla, CA, USA
| | - Sichao Yu
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
| | - Sha Liao
- BGI Research, Shenzhen 518083, China; STOmics Tech Co., Ltd, Shenzhen 518083, China; BGI Research, Chongqing 401329, China
| | - Chao Liu
- BGI Research, Wuhan 430074, China
| | - Xinchao Wu
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Yiwei Lai
- BGI Research, Shenzhen 518083, China
| | | | - Zhewei Zhang
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Liang Wu
- BGI Research, Chongqing 401329, China
| | | | - Mei Li
- STOmics Tech Co., Ltd, Shenzhen 518083, China
| | - Zhencheng Tu
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jinpei Lin
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China
| | - Zhuoxuan Yang
- BGI Research, Hangzhou 310030, China; School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | | | - Ying Gu
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | | | - Ao Chen
- BGI Research, Shenzhen 518083, China; STOmics Tech Co., Ltd, Shenzhen 518083, China; BGI Research, Chongqing 401329, China
| | - Longqi Liu
- BGI Research, Hangzhou 310030, China; Shenzhen Bay Laboratory, Shenzhen 518132, China; Shenzhen Key Laboratory of Single-Cell Omics, BGI-Shenzhen, Shenzhen 518120, China
| | - Jonathan S Weissman
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA; Koch Institute for Integrative Cancer Research at MIT, MIT, Cambridge, MA, USA
| | - Jiayi Ma
- Electronic Information School, Wuhan University, Wuhan 430072, China.
| | - Xun Xu
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China.
| | - Shiping Liu
- BGI Research, Hangzhou 310030, China; Shenzhen Bay Laboratory, Shenzhen 518132, China; Shenzhen Key Laboratory of Single-Cell Omics, BGI-Shenzhen, Shenzhen 518120, China; The Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangzhou, Guangdong, China.
| | - Yinqi Bai
- BGI Research, Sanya 572025, China; Hainan Technology Innovation Center for Marine Biological Resources Utilization (Preparatory Period), BGI Research, Sanya 572025, China.
| |
Collapse
|
3
|
Kotov A, Seal S, Alkobtawi M, Kappès V, Ruiz SM, Arbès H, Harland RM, Peshkin L, Monsoro-Burq AH. A time-resolved single-cell roadmap of the logic driving anterior neural crest diversification from neural border to migration stages. Proc Natl Acad Sci U S A 2024; 121:e2311685121. [PMID: 38683994 PMCID: PMC11087755 DOI: 10.1073/pnas.2311685121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Accepted: 03/12/2024] [Indexed: 05/02/2024] Open
Abstract
Neural crest cells exemplify cellular diversification from a multipotent progenitor population. However, the full sequence of early molecular choices orchestrating the emergence of neural crest heterogeneity from the embryonic ectoderm remains elusive. Gene-regulatory-networks (GRN) govern early development and cell specification toward definitive neural crest. Here, we combine ultradense single-cell transcriptomes with machine-learning and large-scale transcriptomic and epigenomic experimental validation of selected trajectories, to provide the general principles and highlight specific features of the GRN underlying neural crest fate diversification from induction to early migration stages using Xenopus frog embryos as a model. During gastrulation, a transient neural border zone state precedes the choice between neural crest and placodes which includes multiple converging gene programs. During neurulation, transcription factor connectome, and bifurcation analyses demonstrate the early emergence of neural crest fates at the neural plate stage, alongside an unbiased multipotent-like lineage persisting until epithelial-mesenchymal transition stage. We also decipher circuits driving cranial and vagal neural crest formation and provide a broadly applicable high-throughput validation strategy for investigating single-cell transcriptomes in vertebrate GRNs in development, evolution, and disease.
Collapse
Affiliation(s)
- Aleksandr Kotov
- Université Paris-Saclay, Département de Biologie, Faculté des Sciences d’Orsay, Signalisation Radiobiology and Cancer, CNRS UMR 3347, INSERM U1021, OrsayF-91405, France
- Institut Curie Research Division, Paris Science et Lettres Research University, OrsayF-91405, France
| | - Subham Seal
- Université Paris-Saclay, Département de Biologie, Faculté des Sciences d’Orsay, Signalisation Radiobiology and Cancer, CNRS UMR 3347, INSERM U1021, OrsayF-91405, France
- Institut Curie Research Division, Paris Science et Lettres Research University, OrsayF-91405, France
| | - Mansour Alkobtawi
- Université Paris-Saclay, Département de Biologie, Faculté des Sciences d’Orsay, Signalisation Radiobiology and Cancer, CNRS UMR 3347, INSERM U1021, OrsayF-91405, France
- Institut Curie Research Division, Paris Science et Lettres Research University, OrsayF-91405, France
| | - Vincent Kappès
- Université Paris-Saclay, Département de Biologie, Faculté des Sciences d’Orsay, Signalisation Radiobiology and Cancer, CNRS UMR 3347, INSERM U1021, OrsayF-91405, France
- Institut Curie Research Division, Paris Science et Lettres Research University, OrsayF-91405, France
| | - Sofia Medina Ruiz
- Molecular and Cell Biology Department, Genetics, Genomics and Development Division, University of California Berkeley, CA94720
| | - Hugo Arbès
- Université Paris-Saclay, Département de Biologie, Faculté des Sciences d’Orsay, Signalisation Radiobiology and Cancer, CNRS UMR 3347, INSERM U1021, OrsayF-91405, France
- Institut Curie Research Division, Paris Science et Lettres Research University, OrsayF-91405, France
| | - Richard M. Harland
- Molecular and Cell Biology Department, Genetics, Genomics and Development Division, University of California Berkeley, CA94720
| | - Leonid Peshkin
- Systems Biology Division, Harvard Medical School, Boston, MA02115
| | - Anne H. Monsoro-Burq
- Université Paris-Saclay, Département de Biologie, Faculté des Sciences d’Orsay, Signalisation Radiobiology and Cancer, CNRS UMR 3347, INSERM U1021, OrsayF-91405, France
- Institut Curie Research Division, Paris Science et Lettres Research University, OrsayF-91405, France
- Institut Universitaire de France, ParisF-75005, France
| |
Collapse
|
4
|
Morizet D, Foucher I, Alunni A, Bally-Cuif L. Reconstruction of macroglia and adult neurogenesis evolution through cross-species single-cell transcriptomic analyses. Nat Commun 2024; 15:3306. [PMID: 38632253 PMCID: PMC11024210 DOI: 10.1038/s41467-024-47484-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 03/29/2024] [Indexed: 04/19/2024] Open
Abstract
Macroglia fulfill essential functions in the adult vertebrate brain, producing and maintaining neurons and regulating neuronal communication. However, we still know little about their emergence and diversification. We used the zebrafish D. rerio as a distant vertebrate model with moderate glial diversity as anchor to reanalyze datasets covering over 600 million years of evolution. We identify core features of adult neurogenesis and innovations in the mammalian lineage with a potential link to the rarity of radial glia-like cells in adult humans. Our results also suggest that functions associated with astrocytes originated in a multifunctional cell type fulfilling both neural stem cell and astrocytic functions before these diverged. Finally, we identify conserved elements of macroglial cell identity and function and their time of emergence during evolution.
Collapse
Affiliation(s)
- David Morizet
- Institut Pasteur, Université Paris Cité, CNRS UMR3738, Zebrafish Neurogenetics Unit, Team supported by the Ligue Nationale Contre le Cancer, F-75015, Paris, France.
- Sorbonne Université, Collège doctoral, F-75005, Paris, France.
| | - Isabelle Foucher
- Institut Pasteur, Université Paris Cité, CNRS UMR3738, Zebrafish Neurogenetics Unit, Team supported by the Ligue Nationale Contre le Cancer, F-75015, Paris, France
| | - Alessandro Alunni
- Institut Pasteur, Université Paris Cité, CNRS UMR3738, Zebrafish Neurogenetics Unit, Team supported by the Ligue Nationale Contre le Cancer, F-75015, Paris, France
- Institut des Neurosciences Paris-Saclay, Université Paris-Saclay, CNRS UMR9197, F-91190, Gif-sur-Yvette, France
| | - Laure Bally-Cuif
- Institut Pasteur, Université Paris Cité, CNRS UMR3738, Zebrafish Neurogenetics Unit, Team supported by the Ligue Nationale Contre le Cancer, F-75015, Paris, France.
| |
Collapse
|
5
|
Croizer H, Mhaidly R, Kieffer Y, Gentric G, Djerroudi L, Leclere R, Pelon F, Robley C, Bohec M, Meng A, Meseure D, Romano E, Baulande S, Peltier A, Vincent-Salomon A, Mechta-Grigoriou F. Deciphering the spatial landscape and plasticity of immunosuppressive fibroblasts in breast cancer. Nat Commun 2024; 15:2806. [PMID: 38561380 PMCID: PMC10984943 DOI: 10.1038/s41467-024-47068-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 03/19/2024] [Indexed: 04/04/2024] Open
Abstract
Although heterogeneity of FAP+ Cancer-Associated Fibroblasts (CAF) has been described in breast cancer, their plasticity and spatial distribution remain poorly understood. Here, we analyze trajectory inference, deconvolute spatial transcriptomics at single-cell level and perform functional assays to generate a high-resolution integrated map of breast cancer (BC), with a focus on inflammatory and myofibroblastic (iCAF/myCAF) FAP+ CAF clusters. We identify 10 spatially-organized FAP+ CAF-related cellular niches, called EcoCellTypes, which are differentially localized within tumors. Consistent with their spatial organization, cancer cells drive the transition of detoxification-associated iCAF (Detox-iCAF) towards immunosuppressive extracellular matrix (ECM)-producing myCAF (ECM-myCAF) via a DPP4- and YAP-dependent mechanism. In turn, ECM-myCAF polarize TREM2+ macrophages, regulatory NK and T cells to induce immunosuppressive EcoCellTypes, while Detox-iCAF are associated with FOLR2+ macrophages in an immuno-protective EcoCellType. FAP+ CAF subpopulations accumulate differently according to the invasive BC status and predict invasive recurrence of ductal carcinoma in situ (DCIS), which could help in identifying low-risk DCIS patients eligible for therapeutic de-escalation.
Collapse
Affiliation(s)
- Hugo Croizer
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Rana Mhaidly
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Yann Kieffer
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Geraldine Gentric
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Lounes Djerroudi
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
- Department of Diagnostic and Theragnostic Medicine, Institut Curie Hospital Group, 26, Rue d'Ulm, F-75248, Paris, France
| | - Renaud Leclere
- Department of Diagnostic and Theragnostic Medicine, Institut Curie Hospital Group, 26, Rue d'Ulm, F-75248, Paris, France
| | - Floriane Pelon
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Catherine Robley
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Mylene Bohec
- Institut Curie, PSL Research University, ICGex Next-Generation Sequencing Platform, 75005, Paris, France
- Institut Curie, PSL Research University, Single Cell Initiative, 75005, Paris, France
| | - Arnaud Meng
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Didier Meseure
- Department of Diagnostic and Theragnostic Medicine, Institut Curie Hospital Group, 26, Rue d'Ulm, F-75248, Paris, France
| | - Emanuela Romano
- Department of Medical Oncology, Center for Cancer Immunotherapy, Institut Curie, 26, Rue d'Ulm, F-75248, Paris, France
| | - Sylvain Baulande
- Institut Curie, PSL Research University, ICGex Next-Generation Sequencing Platform, 75005, Paris, France
- Institut Curie, PSL Research University, Single Cell Initiative, 75005, Paris, France
| | - Agathe Peltier
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France
| | - Anne Vincent-Salomon
- Department of Diagnostic and Theragnostic Medicine, Institut Curie Hospital Group, 26, Rue d'Ulm, F-75248, Paris, France
| | - Fatima Mechta-Grigoriou
- Institut Curie, Stress and Cancer Laboratory, Equipe Labélisée par la Ligue Nationale Contre le Cancer, PSL Research University, 26, Rue d'Ulm, F-75248, Paris, France.
- Inserm, U830, 26, Rue d'Ulm, F-75005, Paris, France.
| |
Collapse
|
6
|
Ualiyeva S, Lemire E, Wong C, Perniss A, Boyd A, Avilés EC, Minichetti DG, Maxfield A, Roditi R, Matsumoto I, Wang X, Deng W, Barrett NA, Buchheit KM, Laidlaw TM, Boyce JA, Bankova LG, Haber AL. A nasal cell atlas reveals heterogeneity of tuft cells and their role in directing olfactory stem cell proliferation. Sci Immunol 2024; 9:eabq4341. [PMID: 38306414 PMCID: PMC11127180 DOI: 10.1126/sciimmunol.abq4341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 12/08/2023] [Indexed: 02/04/2024]
Abstract
The olfactory neuroepithelium serves as a sensory organ for odors and forms part of the nasal mucosal barrier. Olfactory sensory neurons are surrounded and supported by epithelial cells. Among them, microvillous cells (MVCs) are strategically positioned at the apical surface, but their specific functions are enigmatic, and their relationship to the other specialized epithelial cells is unclear. Here, we establish that the family of MVCs comprises tuft cells and ionocytes in both mice and humans. Integrating analysis of the respiratory and olfactory epithelia, we define the distinct receptor expression of TRPM5+ tuft-MVCs compared with Gɑ-gustducinhigh respiratory tuft cells and characterize a previously undescribed population of glandular DCLK1+ tuft cells. To establish how allergen sensing by tuft-MVCs might direct olfactory mucosal responses, we used an integrated single-cell transcriptional and protein analysis. Inhalation of Alternaria induced mucosal epithelial effector molecules including Chil4 and a distinct pathway leading to proliferation of the quiescent olfactory horizontal basal stem cell (HBC) pool, both triggered in the absence of olfactory apoptosis. Alternaria- and ATP-elicited HBC proliferation was dependent on TRPM5+ tuft-MVCs, identifying these specialized epithelial cells as regulators of olfactory stem cell responses. Together, our data provide high-resolution characterization of nasal tuft cell heterogeneity and identify a function of TRPM5+ tuft-MVCs in directing the olfactory mucosal response to allergens.
Collapse
Affiliation(s)
- Saltanat Ualiyeva
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Evan Lemire
- Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA
| | - Caitlin Wong
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Alexander Perniss
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Amelia Boyd
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Evelyn C. Avilés
- Department of Neurobiology, Harvard Medical School, Boston, MA; currently at Faculty of Biological Sciences, Pontificia Universidad Católica de Chile
| | - Dante G. Minichetti
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Alice Maxfield
- Division of Otolaryngology-Head and Neck Surgery, Brigham and Women’s Hospital and Department of Otolaryngology-Head and Neck Surgery, Harvard Medical School, Boston, MA
| | - Rachel Roditi
- Division of Otolaryngology-Head and Neck Surgery, Brigham and Women’s Hospital and Department of Otolaryngology-Head and Neck Surgery, Harvard Medical School, Boston, MA
| | | | - Xin Wang
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Wenjiang Deng
- Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA
| | - Nora A. Barrett
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Kathleen M. Buchheit
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Tanya M. Laidlaw
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Joshua A. Boyce
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Lora G. Bankova
- Division of Allergy and Clinical Immunology, Jeff and Penny Vinik Center for Allergic Disease Research, Brigham & Women’s Hospital and Department of Medicine, Harvard Medical School, Boston, MA
| | - Adam L. Haber
- Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA
| |
Collapse
|
7
|
Joodaki M, Shaigan M, Parra V, Bülow RD, Kuppe C, Hölscher DL, Cheng M, Nagai JS, Goedertier M, Bouteldja N, Tesar V, Barratt J, Roberts IS, Coppo R, Kramann R, Boor P, Costa IG. Detection of PatIent-Level distances from single cell genomics and pathomics data with Optimal Transport (PILOT). Mol Syst Biol 2024; 20:57-74. [PMID: 38177382 PMCID: PMC10883279 DOI: 10.1038/s44320-023-00003-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/20/2023] [Accepted: 11/24/2023] [Indexed: 01/06/2024] Open
Abstract
Although clinical applications represent the next challenge in single-cell genomics and digital pathology, we still lack computational methods to analyze single-cell or pathomics data to find sample-level trajectories or clusters associated with diseases. This remains challenging as single-cell/pathomics data are multi-scale, i.e., a sample is represented by clusters of cells/structures, and samples cannot be easily compared with each other. Here we propose PatIent Level analysis with Optimal Transport (PILOT). PILOT uses optimal transport to compute the Wasserstein distance between two individual single-cell samples. This allows us to perform unsupervised analysis at the sample level and uncover trajectories or cellular clusters associated with disease progression. We evaluate PILOT and competing approaches in single-cell genomics or pathomics studies involving various human diseases with up to 600 samples/patients and millions of cells or tissue structures. Our results demonstrate that PILOT detects disease-associated samples from large and complex single-cell or pathomics data. Moreover, PILOT provides a statistical approach to find changes in cell populations, gene expression, and tissue structures related to the trajectories or clusters supporting interpretation of predictions.
Collapse
Affiliation(s)
- Mehdi Joodaki
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Mina Shaigan
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Victor Parra
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Roman D Bülow
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Christoph Kuppe
- Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Aachen, Germany
| | - David L Hölscher
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Mingbo Cheng
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - James S Nagai
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Michaël Goedertier
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Nassim Bouteldja
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Vladimir Tesar
- Department of Nephrology, 1st Faculty of Medicine and General University Hospital, Charles University, Prague, Czech Republic
| | - Jonathan Barratt
- John Walls Renal Unit, University Hospital of Leicester National Health Service Trust, Leicester, UK
- Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
| | - Ian Sd Roberts
- Department of Cellular Pathology, Oxford University Hospitals National Health Services Foundation Trust, Oxford, UK
| | - Rosanna Coppo
- Fondazione Ricerca Molinette, Regina Margherita Children's University Hospital, Torino, Italy
| | - Rafael Kramann
- Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Aachen, Germany
- Department of Internal Medicine, Nephrology and Transplantation, Erasmus Medical Center, Rotterdam, Netherlands
| | - Peter Boor
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany.
| | - Ivan G Costa
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany.
| |
Collapse
|
8
|
Lobón-Iglesias MJ, Andrianteranagna M, Han ZY, Chauvin C, Masliah-Planchon J, Manriquez V, Tauziede-Espariat A, Turczynski S, Bouarich-Bourimi R, Frah M, Dufour C, Blauwblomme T, Cardoen L, Pierron G, Maillot L, Guillemot D, Reynaud S, Bourneix C, Pouponnot C, Surdez D, Bohec M, Baulande S, Delattre O, Piaggio E, Ayrault O, Waterfall JJ, Servant N, Beccaria K, Dangouloff-Ros V, Bourdeaut F. Imaging and multi-omics datasets converge to define different neural progenitor origins for ATRT-SHH subgroups. Nat Commun 2023; 14:6669. [PMID: 37863903 PMCID: PMC10589300 DOI: 10.1038/s41467-023-42371-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 10/09/2023] [Indexed: 10/22/2023] Open
Abstract
Atypical teratoid rhabdoid tumors (ATRT) are divided into MYC, TYR and SHH subgroups, suggesting diverse lineages of origin. Here, we investigate the imaging of human ATRT at diagnosis and the precise anatomic origin of brain tumors in the Rosa26-CreERT2::Smarcb1flox/flox model. This cross-species analysis points to an extra-cerebral origin for MYC tumors. Additionally, we clearly distinguish SHH ATRT emerging from the cerebellar anterior lobe (CAL) from those emerging from the basal ganglia (BG) and intra-ventricular (IV) regions. Molecular characteristics point to the midbrain-hindbrain boundary as the origin of CAL SHH ATRT, and to the ganglionic eminence as the origin of BG/IV SHH ATRT. Single-cell RNA sequencing on SHH ATRT supports these hypotheses. Trajectory analyses suggest that SMARCB1 loss induces a de-differentiation process mediated by repressors of the neuronal program such as REST, ID and the NOTCH pathway.
Collapse
Affiliation(s)
- María-Jesús Lobón-Iglesias
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
| | - Mamy Andrianteranagna
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
- INSERM U900, Bioinformatics, Biostatistics, Epidemiology and Computational Systems Unit, Institut Curie, Mines Paris Tech, PSL Research University, Institut Curie Research Center, Paris, France
| | - Zhi-Yan Han
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
| | - Céline Chauvin
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
| | - Julien Masliah-Planchon
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
| | - Valeria Manriquez
- INSERM U932, Immunity and Cancer, PSL Research University, Institut Curie Research Center, Paris, France
| | - Arnault Tauziede-Espariat
- Department of Neuropathology, GHU Paris-Psychiatry and Neurosciences, Sainte-Anne Hospital, Paris, France
- Paris Psychiatry and Neurosciences Institute (IPNP), UMR S1266, INSERM, IMA-BRAIN, Paris, France
| | - Sandrina Turczynski
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
| | - Rachida Bouarich-Bourimi
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
| | - Magali Frah
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France
| | - Christelle Dufour
- Department of Children and Adolescents Oncology, Gustave Roussy, Paris Saclay University, Villejuif, France
| | - Thomas Blauwblomme
- Department of Pediatric Neurosurgery-AP-HP, Necker Sick Kids Hospital, Université de Paris, Paris, France
| | | | - Gaelle Pierron
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
| | - Laetitia Maillot
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
| | - Delphine Guillemot
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
| | - Stéphanie Reynaud
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
| | - Christine Bourneix
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
| | - Célio Pouponnot
- CNRS UMR 3347, INSERM U1021, Institut Curie, PSL Research University, Université Paris-Saclay, Orsay, France
| | - Didier Surdez
- INSERM U830, Diversity and Plasticity of Childhood Tumors Lab, PSL Research University, SIREDO Oncology Center, Institut Curie Research Center, Paris, France
- Balgrist University Hospital, Faculty of Medicine, University of Zurich (UZH), Zurich, Switzerland
| | - Mylene Bohec
- Institut Curie, PSL University, Single Cell Initiative, ICGex Next-Generation Sequencing Platform, PSL University, 75005, Paris, France
| | - Sylvain Baulande
- Institut Curie, PSL University, Single Cell Initiative, ICGex Next-Generation Sequencing Platform, PSL University, 75005, Paris, France
| | - Olivier Delattre
- Somatic Genetic Unit, Department of Pathology and Diagnostic and Theranostic Medecine, Institut Curie Hospital, Paris, France
- INSERM U830, Diversity and Plasticity of Childhood Tumors Lab, PSL Research University, SIREDO Oncology Center, Institut Curie Research Center, Paris, France
| | - Eliane Piaggio
- INSERM U932, Immunity and Cancer, PSL Research University, Institut Curie Research Center, Paris, France
| | - Olivier Ayrault
- CNRS UMR 3347, INSERM U1021, Institut Curie, PSL Research University, Université Paris-Saclay, Orsay, France
| | - Joshua J Waterfall
- INSERM U830, Integrative Functional Genomics of Cancer Lab, PSL Research University, Institut Curie Research Center, Paris, France
- Department of Translational Research, PSL Research University, Institut Curie Research Center, Paris, France
| | - Nicolas Servant
- INSERM U900, Bioinformatics, Biostatistics, Epidemiology and Computational Systems Unit, Institut Curie, Mines Paris Tech, PSL Research University, Institut Curie Research Center, Paris, France
| | - Kevin Beccaria
- Department of Pediatric Neurosurgery-AP-HP, Necker Sick Kids Hospital, Université de Paris, Paris, France
| | - Volodia Dangouloff-Ros
- Pediatric Radiology Department, AP-HP, Necker Sick Kids Hospital and Paris Cite Universiy INSERM 1299 and UMR 1163, Institut Imagine, Paris, France
| | - Franck Bourdeaut
- INSERM U830, Laboratory of Translational Research In Pediatric Oncology, PSL Research University, SIREDO Oncology center, Institut Curie Research Center, Paris, France.
- Department of Pediatric Oncology, SIREDO Oncology Center, Institut Curie Hospital, Paris, and Université de Paris, Paris, France.
| |
Collapse
|
9
|
Wu YS, Taniar D, Adhinugraha K, Tsai LK, Pai TW. Detection of Amyotrophic Lateral Sclerosis (ALS) Comorbidity Trajectories Based on Principal Tree Model Analytics. Biomedicines 2023; 11:2629. [PMID: 37893003 PMCID: PMC10604752 DOI: 10.3390/biomedicines11102629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 09/11/2023] [Accepted: 09/22/2023] [Indexed: 10/29/2023] Open
Abstract
The multifaceted nature and swift progression of Amyotrophic Lateral Sclerosis (ALS) pose considerable challenges to our understanding of its evolution and interplay with comorbid conditions. This study seeks to elucidate the temporal dynamics of ALS progression and its interaction with associated diseases. We employed a principal tree-based model to decipher patterns within clinical data derived from a population-based database in Taiwan. The disease progression was portrayed as branched trajectories, each path representing a series of distinct stages. Each stage embodied the cumulative occurrence of co-existing diseases, depicted as nodes on the tree, with edges symbolizing potential transitions between these linked nodes. Our model identified eight distinct ALS patient trajectories, unveiling unique patterns of disease associations at various stages of progression. These patterns may suggest underlying disease mechanisms or risk factors. This research re-conceptualizes ALS progression as a migration through diverse stages, instead of the perspective of a sequence of isolated events. This new approach illuminates patterns of disease association across different progression phases. The insights obtained from this study hold the potential to inform doctors regarding the development of personalized treatment strategies, ultimately enhancing patient prognosis and quality of life.
Collapse
Affiliation(s)
- Yang-Sheng Wu
- Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei 106, Taiwan;
| | - David Taniar
- Department of Software Systems & Cybersecurity, Monash University, Melbourne, VIC 3800, Australia;
| | - Kiki Adhinugraha
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3086, Australia;
| | - Li-Kai Tsai
- Department of Neurology and Stroke Center, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei 100, Taiwan;
| | - Tun-Wen Pai
- Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei 106, Taiwan;
| |
Collapse
|
10
|
Smolander J, Junttila S, Elo LL. Cell-connectivity-guided trajectory inference from single-cell data. Bioinformatics 2023; 39:btad515. [PMID: 37624916 PMCID: PMC10474950 DOI: 10.1093/bioinformatics/btad515] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 07/24/2023] [Accepted: 08/23/2023] [Indexed: 08/27/2023] Open
Abstract
MOTIVATION Single-cell RNA-sequencing enables cell-level investigation of cell differentiation, which can be modelled using trajectory inference methods. While tremendous effort has been put into designing these methods, inferring accurate trajectories automatically remains difficult. Therefore, the standard approach involves testing different trajectory inference methods and picking the trajectory giving the most biologically sensible model. As the default parameters are often suboptimal, their tuning requires methodological expertise. RESULTS We introduce Totem, an open-source, easy-to-use R package designed to facilitate inference of tree-shaped trajectories from single-cell data. Totem generates a large number of clustering results, estimates their topologies as minimum spanning trees, and uses them to measure the connectivity of the cells. Besides automatic selection of an appropriate trajectory, cell connectivity enables to visually pinpoint branching points and milestones relevant to the trajectory. Furthermore, testing different trajectories with Totem is fast, easy, and does not require in-depth methodological knowledge. AVAILABILITY AND IMPLEMENTATION Totem is available as an R package at https://github.com/elolab/Totem.
Collapse
Affiliation(s)
- Johannes Smolander
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Sini Junttila
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Laura L Elo
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
- Institute of Biomedicine, University of Turku, 20520 Turku, Finland
| |
Collapse
|
11
|
Yuan F, Gasser GN, Lemire E, Montoro DT, Jagadeesh K, Zhang Y, Duan Y, Ievlev V, Wells KL, Rotti PG, Shahin W, Winter M, Rosen BH, Evans I, Cai Q, Yu M, Walsh SA, Acevedo MR, Pandya DN, Akurathi V, Dick DW, Wadas TJ, Joo NS, Wine JJ, Birket S, Fernandez CM, Leung HM, Tearney GJ, Verkman AS, Haggie PM, Scott K, Bartels D, Meyerholz DK, Rowe SM, Liu X, Yan Z, Haber AL, Sun X, Engelhardt JF. Transgenic ferret models define pulmonary ionocyte diversity and function. Nature 2023; 621:857-867. [PMID: 37730992 PMCID: PMC10533402 DOI: 10.1038/s41586-023-06549-9] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Accepted: 08/17/2023] [Indexed: 09/22/2023]
Abstract
Speciation leads to adaptive changes in organ cellular physiology and creates challenges for studying rare cell-type functions that diverge between humans and mice. Rare cystic fibrosis transmembrane conductance regulator (CFTR)-rich pulmonary ionocytes exist throughout the cartilaginous airways of humans1,2, but limited presence and divergent biology in the proximal trachea of mice has prevented the use of traditional transgenic models to elucidate ionocyte functions in the airway. Here we describe the creation and use of conditional genetic ferret models to dissect pulmonary ionocyte biology and function by enabling ionocyte lineage tracing (FOXI1-CreERT2::ROSA-TG), ionocyte ablation (FOXI1-KO) and ionocyte-specific deletion of CFTR (FOXI1-CreERT2::CFTRL/L). By comparing these models with cystic fibrosis ferrets3,4, we demonstrate that ionocytes control airway surface liquid absorption, secretion, pH and mucus viscosity-leading to reduced airway surface liquid volume and impaired mucociliary clearance in cystic fibrosis, FOXI1-KO and FOXI1-CreERT2::CFTRL/L ferrets. These processes are regulated by CFTR-dependent ionocyte transport of Cl- and HCO3-. Single-cell transcriptomics and in vivo lineage tracing revealed three subtypes of pulmonary ionocytes and a FOXI1-lineage common rare cell progenitor for ionocytes, tuft cells and neuroendocrine cells during airway development. Thus, rare pulmonary ionocytes perform critical CFTR-dependent functions in the proximal airway that are hallmark features of cystic fibrosis airway disease. These studies provide a road map for using conditional genetics in the first non-rodent mammal to address gene function, cell biology and disease processes that have greater evolutionary conservation between humans and ferrets.
Collapse
Affiliation(s)
- Feng Yuan
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Grace N Gasser
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Evan Lemire
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | | | | | - Yan Zhang
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Yifan Duan
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Vitaly Ievlev
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Kristen L Wells
- Barbara Davis Center for Childhood Diabetes, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Pavana G Rotti
- Synthetic Biology Center, Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Weam Shahin
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Michael Winter
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Bradley H Rosen
- Division of Pulmonary, Critical Care, Occupational, and Sleep Medicine, Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Idil Evans
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Qian Cai
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Miao Yu
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Susan A Walsh
- Department of Radiology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Michael R Acevedo
- Department of Radiology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Darpan N Pandya
- Department of Radiology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Vamsidhar Akurathi
- Department of Radiology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - David W Dick
- Department of Radiology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Thaddeus J Wadas
- Department of Radiology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Nam Soo Joo
- Cystic Fibrosis Research Laboratory, Department of Psychology, Stanford University, Stanford, CA, USA
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Jeffrey J Wine
- Cystic Fibrosis Research Laboratory, Department of Psychology, Stanford University, Stanford, CA, USA
| | - Susan Birket
- Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Courtney M Fernandez
- Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Hui Min Leung
- Wellman Center for Photomedicine, Massachusetts General Hospital, Boston, MA, USA
| | - Guillermo J Tearney
- Wellman Center for Photomedicine, Massachusetts General Hospital, Boston, MA, USA
| | - Alan S Verkman
- Department of Medicine, UCSF, San Francisco, CA, USA
- Department of Physiology, UCSF, San Francisco, CA, USA
| | - Peter M Haggie
- Department of Medicine, UCSF, San Francisco, CA, USA
- Department of Physiology, UCSF, San Francisco, CA, USA
| | - Kathleen Scott
- Office of Animal Resources, University of Iowa, Iowa City, IA, USA
| | - Douglas Bartels
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | | | - Steven M Rowe
- Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Xiaoming Liu
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Ziying Yan
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Adam L Haber
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
| | - Xingshen Sun
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA.
| | - John F Engelhardt
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA.
| |
Collapse
|
12
|
Lysov M, Pukhkiy K, Vasiliev E, Getmanskaya A, Turlapov V. Ensuring Explainability and Dimensionality Reduction in a Multidimensional HSI World for Early XAI-Diagnostics of Plant Stress. ENTROPY (BASEL, SWITZERLAND) 2023; 25:e25050801. [PMID: 37238556 DOI: 10.3390/e25050801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 05/08/2023] [Accepted: 05/08/2023] [Indexed: 05/28/2023]
Abstract
This work is mostly devoted to the search for effective solutions to the problem of early diagnosis of plant stress (given an example of wheat and its drought stress), which would be based on explainable artificial intelligence (XAI). The main idea is to combine the benefits of two of the most popular agricultural data sources, hyperspectral images (HSI) and thermal infrared images (TIR), in a single XAI model. Our own dataset of a 25-day experiment was used, which was created via both (1) an HSI camera Specim IQ (400-1000 nm, 204, 512 × 512) and (2) a TIR camera Testo 885-2 (320 × 240, res. 0.1 °C). The HSI were a source of the k-dimensional high-level features of plants (k ≤ K, where K is the number of HSI channels) for the learning process. Such combination was implemented as a single-layer perceptron (SLP) regressor, which is the main feature of the XAI model and receives as input an HSI pixel-signature belonging to the plant mask, which then automatically through the mask receives a mark from the TIR. The correlation of HSI channels with the TIR image on the plant's mask on the days of the experiment was studied. It was established that HSI channel 143 (820 nm) was the most correlated with TIR. The problem of training the HSI signatures of plants with their corresponding temperature value via the XAI model was solved. The RMSE of plant temperature prediction is 0.2-0.3 °C, which is acceptable for early diagnostics. Each HSI pixel was represented in training by a number (k) of channels (k ≤ K = 204 in our case). The number of channels used for training was minimized by a factor of 25-30, from 204 to eight or seven, while maintaining the RMSE value. The model is computationally efficient in training; the average training time was much less than one minute (Intel Core i3-8130U, 2.2 GHz, 4 cores, 4 GB). This XAI model can be considered a research-aimed model (R-XAI), which allows the transfer of knowledge about plants from the TIR domain to the HSI domain, with their contrasting onto only a few from hundreds of HSI channels.
Collapse
Affiliation(s)
- Maxim Lysov
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhny Novgorod, Russia
| | - Konstantin Pukhkiy
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhny Novgorod, Russia
| | - Evgeny Vasiliev
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhny Novgorod, Russia
| | - Alexandra Getmanskaya
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhny Novgorod, Russia
| | - Vadim Turlapov
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhny Novgorod, Russia
| |
Collapse
|
13
|
Li X, Lin Y, Xie C, Li Z, Chen M, Wang P, Zhou J. A Clustering Method Unifying Cell-Type Recognition and Subtype Identification for Tumor Heterogeneity Analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:822-832. [PMID: 36044493 DOI: 10.1109/tcbb.2022.3203185] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The rapid development of single-cell technology has opened up a whole new perspective for identifying cell types in multicellular organisms and understanding the relationships between them. Distinguishing different cell types and subtypes can identify the components of different immune cells and different tumor clones in the tumor microenvironment, which is the basic work of tumor heterogeneity analysis and can help researchers understand the mechanism of tumor immune escape. Existing algorithms treat both cell types and subtypes as populations of cells with specific gene expression patterns, which is not conducive to accurate cell typing. For that, we proposed a cell similarity metric that unifies cell type recognition and subtype identification (UCRSI), with the assumption that selectively expressed genes represent differences in underlying cell type with on/off manner, while differences in expression level represent different cell subtype with more/less manner. Our method calculates these two kinds of differences separately, and then combines them using a consensus adjacency matrix, and finally cell typing is completed using spectral clustering algorithm. The results show that UCRSI can reconstruct expert annotation of single-cell RNA sequencing datasets more robustly than existing methods. And, UCRSI is useful for analyzing tumor heterogeneity and improving visualization of large-scale cell clustering.
Collapse
|
14
|
Faure L, Soldatov R, Kharchenko PV, Adameyko I. scFates: a scalable python package for advanced pseudotime and bifurcation analysis from single-cell data. Bioinformatics 2023; 39:btac746. [PMID: 36394263 PMCID: PMC9805561 DOI: 10.1093/bioinformatics/btac746] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/30/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022] Open
Abstract
SUMMARY scFates provides an extensive toolset for the analysis of dynamic trajectories comprising tree learning, feature association testing, branch differential expression and with a focus on cell biasing and fate splits at the level of bifurcations. It is meant to be fully integrated into the scanpy ecosystem for seamless analysis of trajectories from single-cell data of various modalities (e.g. RNA and ATAC). AVAILABILITY AND IMPLEMENTATION scFates is released as open-source software under the BSD 3-Clause 'New' License and is available from the Python Package Index at https://pypi.org/project/scFates/. The source code is available on GitHub at https://github.com/LouisFaure/scFates/. Code reproduction and tutorials on published datasets are available on GitHub at https://github.com/LouisFaure/scFates_notebooks. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Louis Faure
- Department of Neuroimmunology, Center for Brain Research, Medical University Vienna, 1090 Vienna, Austria
| | - Ruslan Soldatov
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Peter V Kharchenko
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
- Altos Labs, San Diego, CA 92121, USA
| | - Igor Adameyko
- Department of Neuroimmunology, Center for Brain Research, Medical University Vienna, 1090 Vienna, Austria
| |
Collapse
|
15
|
Mirkes EM, Bac J, Fouché A, Stasenko SV, Zinovyev A, Gorban AN. Domain Adaptation Principal Component Analysis: Base Linear Method for Learning with Out-of-Distribution Data. ENTROPY (BASEL, SWITZERLAND) 2022; 25:33. [PMID: 36673174 PMCID: PMC9858254 DOI: 10.3390/e25010033] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 12/18/2022] [Accepted: 12/21/2022] [Indexed: 06/17/2023]
Abstract
Domain adaptation is a popular paradigm in modern machine learning which aims at tackling the problem of divergence (or shift) between the labeled training and validation datasets (source domain) and a potentially large unlabeled dataset (target domain). The task is to embed both datasets into a common space in which the source dataset is informative for training while the divergence between source and target is minimized. The most popular domain adaptation solutions are based on training neural networks that combine classification and adversarial learning modules, frequently making them both data-hungry and difficult to train. We present a method called Domain Adaptation Principal Component Analysis (DAPCA) that identifies a linear reduced data representation useful for solving the domain adaptation task. DAPCA algorithm introduces positive and negative weights between pairs of data points, and generalizes the supervised extension of principal component analysis. DAPCA is an iterative algorithm that solves a simple quadratic optimization problem at each iteration. The convergence of the algorithm is guaranteed, and the number of iterations is small in practice. We validate the suggested algorithm on previously proposed benchmarks for solving the domain adaptation task. We also show the benefit of using DAPCA in analyzing single-cell omics datasets in biomedical applications. Overall, DAPCA can serve as a practical preprocessing step in many machine learning applications leading to reduced dataset representations, taking into account possible divergence between source and target domains.
Collapse
Affiliation(s)
- Evgeny M. Mirkes
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
| | - Jonathan Bac
- Institut Curie, PSL Research University, 75005 Paris, France
- Institut National de la Santé et de la Recherche Médicale (INSERM), U900, 75012 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75005 Paris, France
| | - Aziz Fouché
- Institut Curie, PSL Research University, 75005 Paris, France
- Institut National de la Santé et de la Recherche Médicale (INSERM), U900, 75012 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75005 Paris, France
| | - Sergey V. Stasenko
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603000 Nizhniy Novgorod, Russia
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, 75005 Paris, France
- Institut National de la Santé et de la Recherche Médicale (INSERM), U900, 75012 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75005 Paris, France
| | - Alexander N. Gorban
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
| |
Collapse
|
16
|
Bastide S, Chomsky E, Saudemont B, Loe-Mie Y, Schmutz S, Novault S, Marlow H, Tanay A, Spitz F. TATTOO-seq delineates spatial and cell type-specific regulatory programs in the developing limb. SCIENCE ADVANCES 2022; 8:eadd0695. [PMID: 36516250 PMCID: PMC9750149 DOI: 10.1126/sciadv.add0695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 11/14/2022] [Indexed: 06/17/2023]
Abstract
The coordinated differentiation of progenitor cells into specialized cell types and their spatial organization into distinct domains is central to embryogenesis. Here, we developed and applied an unbiased spatially resolved single-cell transcriptomics method to identify the genetic programs underlying the emergence of specialized cell types during mouse limb development and their spatial integration. We identify multiple transcription factors whose expression patterns are predominantly associated with cell type specification or spatial position, suggesting two parallel yet highly interconnected regulatory systems. We demonstrate that the embryonic limb undergoes a complex multiscale reorganization upon perturbation of one of its spatial organizing centers, including the loss of specific cell populations, alterations of preexisting cell states' molecular identities, and changes in their relative spatial distribution. Our study shows how multidimensional single-cell, spatially resolved molecular atlases can allow the deconvolution of spatial identity and cell fate and reveal the interconnected genetic networks that regulate organogenesis and its reorganization upon genetic alterations.
Collapse
Affiliation(s)
- Sébastien Bastide
- (Epi)genomics of Animal Development, Department of Developmental and Stem Cell Biology, Institut Pasteur, Paris, France
- École Doctorale “Complexité du Vivant”, Sorbonne Université, 75005 Paris, France
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA
| | - Elad Chomsky
- Department of Computer Science and Applied Mathematics, Weizmann Institute, Rehovot, Israel
- Department of Biological Regulation, Weizmann Institute, Rehovot, Israel
| | - Baptiste Saudemont
- (Epi)genomics of Animal Development, Department of Developmental and Stem Cell Biology, Institut Pasteur, Paris, France
| | - Yann Loe-Mie
- (Epi)genomics of Animal Development, Department of Developmental and Stem Cell Biology, Institut Pasteur, Paris, France
- Hub de Bioinformatique et Biostatistique, Département Biologie Computationnelle, Institut Pasteur, Paris, France
| | - Sandrine Schmutz
- Cytometry and Biomarkers, Center for Technological Resources and Research, Institut Pasteur, Paris, France
| | - Sophie Novault
- Cytometry and Biomarkers, Center for Technological Resources and Research, Institut Pasteur, Paris, France
| | - Heather Marlow
- (Epi)genomics of Animal Development, Department of Developmental and Stem Cell Biology, Institut Pasteur, Paris, France
- Department of Organismal Biology and Anatomy, The University of Chicago, Chicago, IL, USA
| | - Amos Tanay
- Department of Computer Science and Applied Mathematics, Weizmann Institute, Rehovot, Israel
| | - François Spitz
- (Epi)genomics of Animal Development, Department of Developmental and Stem Cell Biology, Institut Pasteur, Paris, France
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA
| |
Collapse
|
17
|
Bonnaire T, Decelle A, Aghanim N. Regularization of Mixture Models for Robust Principal Graph Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:9119-9130. [PMID: 34757901 DOI: 10.1109/tpami.2021.3124973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
A regularized version of Mixture Models is proposed to learn a principal graph from a distribution of D-dimensional datapoints. In the particular case of manifold learning for ridge detection, we assume that the underlying structure can be modeled as a graph acting like a topological prior for the Gaussian clusters turning the problem into a maximum a posteriori estimation. Parameters of the model are iteratively estimated through an Expectation-Maximization procedure making the learning of the structure computationally efficient with guaranteed convergence for any graph prior in a polynomial time. We also embed in the formalism a natural way to make the algorithm robust to outliers of the pattern and heteroscedasticity of the manifold sampling coherently with the graph structure. The method uses a graph prior given by the minimum spanning tree that we extend using random sub-samplings of the dataset to take into account cycles that can be observed in the spatial distribution.
Collapse
|
18
|
Kirdin A, Sidorov S, Zolotykh N. Rosenblatt's First Theorem and Frugality of Deep Learning. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1635. [PMID: 36359726 PMCID: PMC9689667 DOI: 10.3390/e24111635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/02/2022] [Accepted: 11/06/2022] [Indexed: 06/16/2023]
Abstract
The Rosenblatt's first theorem about the omnipotence of shallow networks states that elementary perceptrons can solve any classification problem if there are no discrepancies in the training set. Minsky and Papert considered elementary perceptrons with restrictions on the neural inputs: a bounded number of connections or a relatively small diameter of the receptive field for each neuron at the hidden layer. They proved that under these constraints, an elementary perceptron cannot solve some problems, such as the connectivity of input images or the parity of pixels in them. In this note, we demonstrated Rosenblatt's first theorem at work, showed how an elementary perceptron can solve a version of the travel maze problem, and analysed the complexity of that solution. We also constructed a deep network algorithm for the same problem. It is much more efficient. The shallow network uses an exponentially large number of neurons on the hidden layer (Rosenblatt's A-elements), whereas for the deep network, the second-order polynomial complexity is sufficient. We demonstrated that for the same complex problem, the deep network can be much smaller and reveal a heuristic behind this effect.
Collapse
Affiliation(s)
- Alexander Kirdin
- Institute of Information Technologies, Mathematics and Mechanics, Lobachevsky State University, 603022 Nizhni Novgorod, Russia
- Institute for Computational Modelling, Russian Academy of Sciences, Siberian Branch, 660036 Krasnoyarsk, Russia
| | - Sergey Sidorov
- Institute of Information Technologies, Mathematics and Mechanics, Lobachevsky State University, 603022 Nizhni Novgorod, Russia
| | - Nikolai Zolotykh
- Institute of Information Technologies, Mathematics and Mechanics, Lobachevsky State University, 603022 Nizhni Novgorod, Russia
| |
Collapse
|
19
|
Lysov M, Maximova I, Vasiliev E, Getmanskaya A, Turlapov V. Entropy as a High-Level Feature for XAI-Based Early Plant Stress Detection. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1597. [PMID: 36359687 PMCID: PMC9689005 DOI: 10.3390/e24111597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 10/17/2022] [Accepted: 10/26/2022] [Indexed: 06/16/2023]
Abstract
This article is devoted to searching for high-level explainable features that can remain explainable for a wide class of objects or phenomena and become an integral part of explainable AI (XAI). The present study involved a 25-day experiment on early diagnosis of wheat stress using drought stress as an example. The state of the plants was periodically monitored via thermal infrared (TIR) and hyperspectral image (HSI) cameras. A single-layer perceptron (SLP)-based classifier was used as the main instrument in the XAI study. To provide explainability of the SLP input, the direct HSI was replaced by images of six popular vegetation indices and three HSI channels (R630, G550, and B480; referred to as indices), along with the TIR image. Furthermore, in the explainability analysis, each of the 10 images was replaced by its 6 statistical features: min, max, mean, std, max-min, and the entropy. For the SLP output explainability, seven output neurons corresponding to the key states of the plants were chosen. The inner layer of the SLP was constructed using 15 neurons, including 10 corresponding to the indices and 5 reserved neurons. The classification possibilities of all 60 features and 10 indices of the SLP classifier were studied. Study result: Entropy is the earliest high-level stress feature for all indices; entropy and an entropy-like feature (max-min) paired with one of the other statistical features can provide, for most indices, 100% accuracy (or near 100%), serving as an integral part of XAI.
Collapse
|
20
|
Cuevas-Diaz Duran R, González-Orozco JC, Velasco I, Wu JQ. Single-cell and single-nuclei RNA sequencing as powerful tools to decipher cellular heterogeneity and dysregulation in neurodegenerative diseases. Front Cell Dev Biol 2022; 10:884748. [PMID: 36353512 PMCID: PMC9637968 DOI: 10.3389/fcell.2022.884748] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Accepted: 10/06/2022] [Indexed: 08/10/2023] Open
Abstract
Neurodegenerative diseases affect millions of people worldwide and there are currently no cures. Two types of common neurodegenerative diseases are Alzheimer's (AD) and Parkinson's disease (PD). Single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq) have become powerful tools to elucidate the inherent complexity and dynamics of the central nervous system at cellular resolution. This technology has allowed the identification of cell types and states, providing new insights into cellular susceptibilities and molecular mechanisms underlying neurodegenerative conditions. Exciting research using high throughput scRNA-seq and snRNA-seq technologies to study AD and PD is emerging. Herein we review the recent progress in understanding these neurodegenerative diseases using these state-of-the-art technologies. We discuss the fundamental principles and implications of single-cell sequencing of the human brain. Moreover, we review some examples of the computational and analytical tools required to interpret the extensive amount of data generated from these assays. We conclude by highlighting challenges and limitations in the application of these technologies in the study of AD and PD.
Collapse
Affiliation(s)
| | | | - Iván Velasco
- Instituto de Fisiología Celular—Neurociencias, Universidad Nacional Autónoma de México, Mexico City, Mexico
- Laboratorio de Reprogramación Celular, Instituto Nacional de Neurología y Neurocirugía “Manuel Velasco Suárez”, Mexico City, Mexico
| | - Jia Qian Wu
- The Vivian L. Smith Department of Neurosurgery, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, United States
- Center for Stem Cell and Regenerative Medicine, UT Brown Foundation Institute of Molecular Medicine, Houston, TX, United States
- MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, United States
| |
Collapse
|
21
|
Lin PC, Tsai YS, Yeh YM, Shen MR. Cutting-Edge AI Technologies Meet Precision Medicine to Improve Cancer Care. Biomolecules 2022; 12:1133. [PMID: 36009026 PMCID: PMC9405970 DOI: 10.3390/biom12081133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 08/11/2022] [Accepted: 08/15/2022] [Indexed: 11/18/2022] Open
Abstract
To provide precision medicine for better cancer care, researchers must work on clinical patient data, such as electronic medical records, physiological measurements, biochemistry, computerized tomography scans, digital pathology, and the genetic landscape of cancer tissue. To interpret big biodata in cancer genomics, an operational flow based on artificial intelligence (AI) models and medical management platforms with high-performance computing must be set up for precision cancer genomics in clinical practice. To work in the fast-evolving fields of patient care, clinical diagnostics, and therapeutic services, clinicians must understand the fundamentals of the AI tool approach. Therefore, the present article covers the following four themes: (i) computational prediction of pathogenic variants of cancer susceptibility genes; (ii) AI model for mutational analysis; (iii) single-cell genomics and computational biology; (iv) text mining for identifying gene targets in cancer; and (v) the NVIDIA graphics processing units, DRAGEN field programmable gate arrays systems and AI medical cloud platforms in clinical next-generation sequencing laboratories. Based on AI medical platforms and visualization, large amounts of clinical biodata can be rapidly copied and understood using an AI pipeline. The use of innovative AI technologies can deliver more accurate and rapid cancer therapy targets.
Collapse
Affiliation(s)
- Peng-Chan Lin
- Department of Oncology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
- Department of Genomic Medicine, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
| | - Yi-Shan Tsai
- Department of Medical Imaging, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
| | - Yu-Min Yeh
- Department of Oncology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
| | - Meng-Ru Shen
- Institute of Clinical Medicine, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
- Department of Obstetrics and Gynecology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
- Department of Pharmacology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan 704, Taiwan
| |
Collapse
|
22
|
Petitpré C, Faure L, Uhl P, Fontanet P, Filova I, Pavlinkova G, Adameyko I, Hadjab S, Lallemend F. Single-cell RNA-sequencing analysis of the developing mouse inner ear identifies molecular logic of auditory neuron diversification. Nat Commun 2022; 13:3878. [PMID: 35790771 PMCID: PMC9256748 DOI: 10.1038/s41467-022-31580-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 06/21/2022] [Indexed: 11/08/2022] Open
Abstract
Different types of spiral ganglion neurons (SGNs) are essential for auditory perception by transmitting complex auditory information from hair cells (HCs) to the brain. Here, we use deep, single cell transcriptomics to study the molecular mechanisms that govern their identity and organization in mice. We identify a core set of temporally patterned genes and gene regulatory networks that may contribute to the diversification of SGNs through sequential binary decisions and demonstrate a role for NEUROD1 in driving specification of a Ic-SGN phenotype. We also find that each trajectory of the decision tree is defined by initial co-expression of alternative subtype molecular controls followed by gradual shifts toward cell fate resolution. Finally, analysis of both developing SGN and HC types reveals cell-cell signaling potentially playing a role in the differentiation of SGNs. Our results indicate that SGN identities are drafted prior to birth and reveal molecular principles that shape their differentiation and will facilitate studies of their development, physiology, and dysfunction.
Collapse
Affiliation(s)
- Charles Petitpré
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Louis Faure
- Department of Neuroimmunology, Center for Brain Research, Medical University Vienna, 1090, Vienna, Austria
| | - Phoebe Uhl
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Paula Fontanet
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Iva Filova
- Institute of Biotechnology CAS, 25250, Vestec, Czech Republic
| | | | - Igor Adameyko
- Department of Neuroimmunology, Center for Brain Research, Medical University Vienna, 1090, Vienna, Austria
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
| | - Saida Hadjab
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.
| | - Francois Lallemend
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.
- Ming-Wai Lau Centre for Reparative Medicine, Stockholm Node, Karolinska Institutet, Stockholm, Sweden.
| |
Collapse
|
23
|
Hu Y, Jiang Y, Behnan J, Ribeiro MM, Kalantzi C, Zhang MD, Lou D, Häring M, Sharma N, Okawa S, Del Sol A, Adameyko I, Svensson M, Persson O, Ernfors P. Neural network learning defines glioblastoma features to be of neural crest perivascular or radial glia lineages. SCIENCE ADVANCES 2022; 8:eabm6340. [PMID: 35675414 PMCID: PMC9177076 DOI: 10.1126/sciadv.abm6340] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Glioblastoma is believed to originate from nervous system cells; however, a putative origin from vessel-associated progenitor cells has not been considered. We deeply single-cell RNA-sequenced glioblastoma progenitor cells of 18 patients and integrated 710 bulk tumors and 73,495 glioma single cells of 100 patients to determine the relation of glioblastoma cells to normal brain cell types. A novel neural network-based projection of the developmental trajectory of normal brain cells uncovered two principal cell-lineage features of glioblastoma, neural crest perivascular and radial glia, carrying defining methylation patterns and survival differences. Consistently, introducing tumorigenic alterations in naïve human brain perivascular cells resulted in brain tumors. Thus, our results suggest that glioblastoma can arise from the brains' vasculature, and patients with such glioblastoma have a significantly poorer outcome.
Collapse
Affiliation(s)
- Yizhou Hu
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Yiwen Jiang
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Jinan Behnan
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Mariana Messias Ribeiro
- Computational Biology Group, Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 4362 Esch-sur-Alzette, Luxembourg
| | - Chrysoula Kalantzi
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Ming-Dong Zhang
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Daohua Lou
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Martin Häring
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Nilesh Sharma
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Satoshi Okawa
- Computational Biology Group, Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 4362 Esch-sur-Alzette, Luxembourg
| | - Antonio Del Sol
- Computational Biology Group, Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 4362 Esch-sur-Alzette, Luxembourg
- CIC bioGUNE, Bizkaia Technology Park, 48160 Derio, Spain
- IKERBASQUE, Basque Foundation for Science, 48013 Bilbao, Spain
| | - Igor Adameyko
- Department of Molecular Neurosciences, Center for Brain Research, Medical University Vienna, Vienna, Austria
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
| | - Mikael Svensson
- Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
- Department of Neurosurgery, Karolinska University Hospital, Stockholm, Sweden
| | - Oscar Persson
- Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
- Department of Neurosurgery, Karolinska University Hospital, Stockholm, Sweden
| | - Patrik Ernfors
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
- Corresponding author.
| |
Collapse
|
24
|
Zinovyev A, Sadovsky M, Calzone L, Fouché A, Groeneveld CS, Chervov A, Barillot E, Gorban AN. Modeling Progression of Single Cell Populations Through the Cell Cycle as a Sequence of Switches. Front Mol Biosci 2022; 8:793912. [PMID: 35178429 PMCID: PMC8846220 DOI: 10.3389/fmolb.2021.793912] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/15/2021] [Indexed: 11/13/2022] Open
Abstract
Cell cycle is a biological process underlying the existence and propagation of life in time and space. It has been an object for mathematical modeling for long, with several alternative mechanistic modeling principles suggested, describing in more or less details the known molecular mechanisms. Recently, cell cycle has been investigated at single cell level in snapshots of unsynchronized cell populations, exploiting the new methods for transcriptomic and proteomic molecular profiling. This raises a need for simplified semi-phenomenological cell cycle models, in order to formalize the processes underlying the cell cycle, at a higher abstracted level. Here we suggest a modeling framework, recapitulating the most important properties of the cell cycle as a limit trajectory of a dynamical process characterized by several internal states with switches between them. In the simplest form, this leads to a limit cycle trajectory, composed by linear segments in logarithmic coordinates describing some extensive (depending on system size) cell properties. We prove a theorem connecting the effective embedding dimensionality of the cell cycle trajectory with the number of its linear segments. We also develop a simplified kinetic model with piecewise-constant kinetic rates describing the dynamics of lumps of genes involved in S-phase and G2/M phases. We show how the developed cell cycle models can be applied to analyze the available single cell datasets and simulate certain properties of the observed cell cycle trajectories. Based on our model, we can predict with good accuracy the cell line doubling time from the length of cell cycle trajectory.
Collapse
Affiliation(s)
- Andrei Zinovyev
- Institut Curie, PSL Research University, Paris, France
- INSERM, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
- *Correspondence: Andrei Zinovyev,
| | - Michail Sadovsky
- Institute of Computational Modeling (RAS), Krasnoyarsk, Russia
- Laboratory of Medical Cybernetics, V.F.Voino-Yasenetsky Krasnoyarsk State Medical University, Krasnoyarsk, Russia
- Federal Research and Clinic Center of FMBA of Russia, Krasnoyarsk, Russia
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, Nizhniy Novgorod, Russia
| | - Laurence Calzone
- Institut Curie, PSL Research University, Paris, France
- INSERM, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Aziz Fouché
- Institut Curie, PSL Research University, Paris, France
- INSERM, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Clarice S. Groeneveld
- Cartes d’Identité des Tumeurs (CIT) Program, Ligue Nationale Contre le Cancer, Paris, France
- Oncologie Moleculaire, UMR144, Institut Curie, Paris, France
| | - Alexander Chervov
- Institut Curie, PSL Research University, Paris, France
- INSERM, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, Paris, France
- INSERM, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Alexander N. Gorban
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, Nizhniy Novgorod, Russia
- Department of Mathematics, University of Leicester, Leicester, United Kingdom
| |
Collapse
|
25
|
Smolander J, Junttila S, Venäläinen MS, Elo LL. scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data. Bioinformatics 2021; 38:1328-1335. [PMID: 34888622 PMCID: PMC8825760 DOI: 10.1093/bioinformatics/btab831] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 11/30/2021] [Accepted: 12/03/2021] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION Computational models are needed to infer a representation of the cells, i.e. a trajectory, from single-cell RNA-sequencing data that model cell differentiation during a dynamic process. Although many trajectory inference methods exist, their performance varies greatly depending on the dataset and hence there is a need to establish more accurate, better generalizable methods. RESULTS We introduce scShaper, a new trajectory inference method that enables accurate linear trajectory inference. The ensemble approach of scShaper generates a continuous smooth pseudotime based on a set of discrete pseudotimes. We demonstrate that scShaper is able to infer accurate trajectories for a variety of trigonometric trajectories, including many for which the commonly used principal curves method fails. A comprehensive benchmarking with state-of-the-art methods revealed that scShaper achieved superior accuracy of the cell ordering and, in particular, the differentially expressed genes. Moreover, scShaper is a fast method with few hyperparameters, making it a promising alternative to the principal curves method for linear pseudotemporal ordering. AVAILABILITY AND IMPLEMENTATION scShaper is available as an R package at https://github.com/elolab/scshaper. The test data are available at https://doi.org/10.5281/zenodo.5734488. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Johannes Smolander
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland
| | - Sini Junttila
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland
| | - Mikko S Venäläinen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland
| | - Laura L Elo
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland,Institute of Biomedicine, University of Turku, 20520 Turku, Finland,To whom correspondence should be addressed.
| |
Collapse
|
26
|
Gorban AN, Tyukina TA, Pokidysheva LI, Smirnova EV. It is useful to analyze correlation graphs: Reply to comments on "Dynamic and thermodynamic models of adaptation". Phys Life Rev 2021; 40:15-23. [PMID: 34836787 DOI: 10.1016/j.plrev.2021.10.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 10/25/2021] [Indexed: 12/22/2022]
Affiliation(s)
- A N Gorban
- Department of Mathematics, University of Leicester, Leicester, UK; Lobachevsky University, Nizhni Novgorod, Russia.
| | - T A Tyukina
- Department of Mathematics, University of Leicester, Leicester, UK; Lobachevsky University, Nizhni Novgorod, Russia.
| | | | - E V Smirnova
- Siberian Federal University, Krasnoyarsk, Russia.
| |
Collapse
|
27
|
Farrar EJ, Hiriart E, Mahmut A, Jagla B, Peal DS, Milan DJ, Butcher JT, Puceat M. OCT4-mediated inflammation induces cell reprogramming at the origin of cardiac valve development and calcification. SCIENCE ADVANCES 2021; 7:eabf7910. [PMID: 34739324 PMCID: PMC8570594 DOI: 10.1126/sciadv.abf7910] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Cell plasticity plays a key role in embryos by maintaining the differentiation potential of progenitors. Whether postnatal somatic cells revert to an embryonic-like naïve state regaining plasticity and redifferentiate into a cell type leading to a disease remains intriguing. Using genetic lineage tracing and single-cell RNA sequencing, we reveal that Oct4 is induced by nuclear factor κB (NFκB) at embyronic day 9.5 in a subset of mouse endocardial cells originating from the anterior heart forming field at the onset of endocardial-to-mesenchymal transition. These cells acquired a chondro-osteogenic fate. OCT4 in adult valvular aortic cells leads to calcification of mouse and human valves. These calcifying cells originate from the Oct4 embryonic lineage. Genetic deletion of Pou5f1 (Pit-Oct-Unc, OCT4) in the endocardial cell lineage prevents aortic stenosis and calcification of ApoE−/− mouse valve. We established previously unidentified self-cell reprogramming NFκB- and OCT4-mediated inflammatory pathway triggering a dose-dependent mechanism of valve calcification.
Collapse
Affiliation(s)
- Emily J. Farrar
- Nancy E. and Peter C. Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA
| | - Emilye Hiriart
- INSERM U1251, Aix-Marseille University, MMG, Marseille, France
| | - Ablajan Mahmut
- Nancy E. and Peter C. Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA
| | - Bernd Jagla
- Pasteur Institute, Cytometry and Biomarkers Unit of Technology and Service, C2RT, & Hub de Bioinformatique et Biostatistique–Département Biologie Computationnelle, Paris, France
| | - David S. Peal
- Cardiovascular Research Center, Cardiology Division, Massachusetts General Hospital Research Institute, Harvard Medical School, Boston, MA 02114, USA
| | - David J. Milan
- Cardiovascular Research Center, Cardiology Division, Massachusetts General Hospital Research Institute, Harvard Medical School, Boston, MA 02114, USA
| | - Jonathan T. Butcher
- Nancy E. and Peter C. Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA
- Corresponding author. (M.P.); (J.B.)
| | - Michel Puceat
- INSERM U1251, Aix-Marseille University, MMG, Marseille, France
- Corresponding author. (M.P.); (J.B.)
| |
Collapse
|
28
|
Stein DF, Chen H, Vinyard ME, Qin Q, Combs RD, Zhang Q, Pinello L. singlecellVR: Interactive Visualization of Single-Cell Data in Virtual Reality. Front Genet 2021; 12:764170. [PMID: 34777482 PMCID: PMC8582280 DOI: 10.3389/fgene.2021.764170] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 09/27/2021] [Indexed: 11/21/2022] Open
Abstract
Single-cell assays have transformed our ability to model heterogeneity within cell populations. As these assays have advanced in their ability to measure various aspects of molecular processes in cells, computational methods to analyze and meaningfully visualize such data have required matched innovation. Independently, Virtual Reality (VR) has recently emerged as a powerful technology to dynamically explore complex data and shows promise for adaptation to challenges in single-cell data visualization. However, adopting VR for single-cell data visualization has thus far been hindered by expensive prerequisite hardware or advanced data preprocessing skills. To address current shortcomings, we present singlecellVR, a user-friendly web application for visualizing single-cell data, designed for cheap and easily available virtual reality hardware (e.g., Google Cardboard, ∼$8). singlecellVR can visualize data from a variety of sequencing-based technologies including transcriptomic, epigenomic, and proteomic data as well as combinations thereof. Analysis modalities supported include approaches to clustering as well as trajectory inference and visualization of dynamical changes discovered through modelling RNA velocity. We provide a companion software package, scvr to streamline data conversion from the most widely-adopted single-cell analysis tools as well as a growing database of pre-analyzed datasets to which users can contribute.
Collapse
Affiliation(s)
- David F. Stein
- Graduate School of Biomedical Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, United States
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, United States
| | - Huidong Chen
- Molecular Pathology Unit, Massachusetts General Hospital, Charlestown, MA, United States
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Pathology, Harvard Medical School, Boston, MA, United States
- Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Michael E. Vinyard
- Molecular Pathology Unit, Massachusetts General Hospital, Charlestown, MA, United States
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Pathology, Harvard Medical School, Boston, MA, United States
- Broad Institute of Harvard and MIT, Cambridge, MA, United States
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, United States
| | - Qian Qin
- Molecular Pathology Unit, Massachusetts General Hospital, Charlestown, MA, United States
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Pathology, Harvard Medical School, Boston, MA, United States
- Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Rebecca D. Combs
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, United States
- Winsor School, Boston, MA, United States
| | - Qian Zhang
- Molecular Pathology Unit, Massachusetts General Hospital, Charlestown, MA, United States
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Pathology, Harvard Medical School, Boston, MA, United States
- Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Luca Pinello
- Molecular Pathology Unit, Massachusetts General Hospital, Charlestown, MA, United States
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Pathology, Harvard Medical School, Boston, MA, United States
- Broad Institute of Harvard and MIT, Cambridge, MA, United States
| |
Collapse
|
29
|
Bac J, Mirkes EM, Gorban AN, Tyukin I, Zinovyev A. Scikit-Dimension: A Python Package for Intrinsic Dimension Estimation. ENTROPY (BASEL, SWITZERLAND) 2021; 23:1368. [PMID: 34682092 PMCID: PMC8534554 DOI: 10.3390/e23101368] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/10/2021] [Accepted: 10/16/2021] [Indexed: 02/07/2023]
Abstract
Dealing with uncertainty in applications of machine learning to real-life data critically depends on the knowledge of intrinsic dimensionality (ID). A number of methods have been suggested for the purpose of estimating ID, but no standard package to easily apply them one by one or all at once has been implemented in Python. This technical note introduces scikit-dimension, an open-source Python package for intrinsic dimension estimation. The scikit-dimension package provides a uniform implementation of most of the known ID estimators based on the scikit-learn application programming interface to evaluate the global and local intrinsic dimension, as well as generators of synthetic toy and benchmark datasets widespread in the literature. The package is developed with tools assessing the code quality, coverage, unit testing and continuous integration. We briefly describe the package and demonstrate its use in a large-scale (more than 500 datasets) benchmarking of methods for ID estimation for real-life and synthetic data.
Collapse
Affiliation(s)
- Jonathan Bac
- Institut Curie, PSL Research University, 75248 Paris, France
- INSERM, U900, 75248 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75272 Paris, France
| | - Evgeny M. Mirkes
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (E.M.M.); (A.N.G.); (I.T.)
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhniy Novgorod, Russia
| | - Alexander N. Gorban
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (E.M.M.); (A.N.G.); (I.T.)
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhniy Novgorod, Russia
| | - Ivan Tyukin
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (E.M.M.); (A.N.G.); (I.T.)
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhniy Novgorod, Russia
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, 75248 Paris, France
- INSERM, U900, 75248 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75272 Paris, France
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhniy Novgorod, Russia
| |
Collapse
|
30
|
Gorban AN, Grechuk B, Mirkes EM, Stasenko SV, Tyukin IY. High-Dimensional Separability for One- and Few-Shot Learning. ENTROPY (BASEL, SWITZERLAND) 2021; 23:1090. [PMID: 34441230 PMCID: PMC8392747 DOI: 10.3390/e23081090] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 08/08/2021] [Accepted: 08/13/2021] [Indexed: 12/31/2022]
Abstract
This work is driven by a practical question: corrections of Artificial Intelligence (AI) errors. These corrections should be quick and non-iterative. To solve this problem without modification of a legacy AI system, we propose special 'external' devices, correctors. Elementary correctors consist of two parts, a classifier that separates the situations with high risk of error from the situations in which the legacy AI system works well and a new decision that should be recommended for situations with potential errors. Input signals for the correctors can be the inputs of the legacy AI system, its internal signals, and outputs. If the intrinsic dimensionality of data is high enough then the classifiers for correction of small number of errors can be very simple. According to the blessing of dimensionality effects, even simple and robust Fisher's discriminants can be used for one-shot learning of AI correctors. Stochastic separation theorems provide the mathematical basis for this one-short learning. However, as the number of correctors needed grows, the cluster structure of data becomes important and a new family of stochastic separation theorems is required. We refuse the classical hypothesis of the regularity of the data distribution and assume that the data can have a rich fine-grained structure with many clusters and corresponding peaks in the probability density. New stochastic separation theorems for data with fine-grained structure are formulated and proved. On the basis of these theorems, the multi-correctors for granular data are proposed. The advantages of the multi-corrector technology were demonstrated by examples of correcting errors and learning new classes of objects by a deep convolutional neural network on the CIFAR-10 dataset. The key problems of the non-classical high-dimensional data analysis are reviewed together with the basic preprocessing steps including the correlation transformation, supervised Principal Component Analysis (PCA), semi-supervised PCA, transfer component analysis, and new domain adaptation PCA.
Collapse
Affiliation(s)
- Alexander N. Gorban
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (B.G.); (E.M.M.); (I.Y.T.)
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhni Novgorod, Russia;
| | - Bogdan Grechuk
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (B.G.); (E.M.M.); (I.Y.T.)
| | - Evgeny M. Mirkes
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (B.G.); (E.M.M.); (I.Y.T.)
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhni Novgorod, Russia;
| | - Sergey V. Stasenko
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhni Novgorod, Russia;
| | - Ivan Y. Tyukin
- Department of Mathematics, University of Leicester, Leicester LE1 7RH, UK; (B.G.); (E.M.M.); (I.Y.T.)
- Laboratory of Advanced Methods for High-Dimensional Data Analysis, Lobachevsky University, 603105 Nizhni Novgorod, Russia;
- Department of Geoscience and Petroleum, Norwegian University of Science and Technology, 7491 Trondheim, Norway
| |
Collapse
|
31
|
Kuznetsov VV, Moskalenko VA, Gribanov DV, Zolotykh NY. Interpretable Feature Generation in ECG Using a Variational Autoencoder. Front Genet 2021; 12:638191. [PMID: 33868375 PMCID: PMC8049433 DOI: 10.3389/fgene.2021.638191] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Accepted: 03/01/2021] [Indexed: 12/22/2022] Open
Abstract
We propose a method for generating an electrocardiogram (ECG) signal for one cardiac cycle using a variational autoencoder. Our goal was to encode the original ECG signal using as few features as possible. Using this method we extracted a vector of new 25 features, which in many cases can be interpreted. The generated ECG has quite natural appearance. The low value of the Maximum Mean Discrepancy metric, 3.83 × 10−3, indicates good quality of ECG generation too. The extracted new features will help to improve the quality of automatic diagnostics of cardiovascular diseases. Generating new synthetic ECGs will allow us to solve the issue of the lack of labeled ECG for using them in supervised learning.
Collapse
Affiliation(s)
- V V Kuznetsov
- Institute of Information Technologies, Mathematics, and Mechanics, Lobachevsky State University of Nizhni Novgorod, Nizhni Novgorod, Russia
| | - V A Moskalenko
- Institute of Information Technologies, Mathematics, and Mechanics, Lobachevsky State University of Nizhni Novgorod, Nizhni Novgorod, Russia.,Mathematics of Future Technologies Center, Lobachevsky State University of Nizhni Novgorod, Nizhni Novgorod, Russia
| | - D V Gribanov
- Laboratory of Algorithms and Technologies for Networks Analysis, National Research University Higher School of Economics, Nizhni Novgorod, Russia
| | - Nikolai Yu Zolotykh
- Institute of Information Technologies, Mathematics, and Mechanics, Lobachevsky State University of Nizhni Novgorod, Nizhni Novgorod, Russia.,Mathematics of Future Technologies Center, Lobachevsky State University of Nizhni Novgorod, Nizhni Novgorod, Russia
| |
Collapse
|
32
|
Golovenkin SE, Bac J, Chervov A, Mirkes EM, Orlova YV, Barillot E, Gorban AN, Zinovyev A. Trajectories, bifurcations, and pseudo-time in large clinical datasets: applications to myocardial infarction and diabetes data. Gigascience 2020; 9:giaa128. [PMID: 33241287 PMCID: PMC7688475 DOI: 10.1093/gigascience/giaa128] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 09/30/2020] [Accepted: 10/22/2020] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Large observational clinical datasets are becoming increasingly available for mining associations between various disease traits and administered therapy. These datasets can be considered as representations of the landscape of all possible disease conditions, in which a concrete disease state develops through stereotypical routes, characterized by "points of no return" and "final states" (such as lethal or recovery states). Extracting this information directly from the data remains challenging, especially in the case of synchronic (with a short-term follow-up) observations. RESULTS Here we suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values, through modeling the geometrical data structure as a bouquet of bifurcating clinical trajectories. The methodology is based on application of elastic principal graphs, which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection, and quantifying the geodesic distances (pseudo-time) in partially ordered sequences of observations. The methodology allows a patient to be positioned on a particular clinical trajectory (pathological scenario) and the degree of progression along it to be characterized with a qualitative estimate of the uncertainty of the prognosis. We developed a tool ClinTrajan for clinical trajectory analysis implemented in the Python programming language. We test the methodology in 2 large publicly available datasets: myocardial infarction complications and readmission of diabetic patients data. CONCLUSIONS Our pseudo-time quantification-based approach makes it possible to apply the methods developed for dynamical disease phenotyping and illness trajectory analysis (diachronic data analysis) to synchronic observational data.
Collapse
Affiliation(s)
- Sergey E Golovenkin
- Prof. V.F. Voino-Yasenetsky Krasnoyarsk State Medical University, 660022 Krasnoyarsk, Russia
| | - Jonathan Bac
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| | - Alexander Chervov
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| | - Evgeny M Mirkes
- Centre for Artificial Intelligence, Data Analytics and Modelling, University of Leicester, LE1 7RH Leicester, UK
- Laboratory of advanced methods for high-dimensional data analysis, Lobachevsky University, 603000 Nizhny Novgorod, Russia
| | - Yuliya V Orlova
- Prof. V.F. Voino-Yasenetsky Krasnoyarsk State Medical University, 660022 Krasnoyarsk, Russia
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| | - Alexander N Gorban
- Centre for Artificial Intelligence, Data Analytics and Modelling, University of Leicester, LE1 7RH Leicester, UK
- Laboratory of advanced methods for high-dimensional data analysis, Lobachevsky University, 603000 Nizhny Novgorod, Russia
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| |
Collapse
|
33
|
Minimum Spanning vs. Principal Trees for Structured Approximations of Multi-Dimensional Datasets. ENTROPY 2020; 22:e22111274. [PMID: 33287042 PMCID: PMC7711596 DOI: 10.3390/e22111274] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/06/2020] [Accepted: 11/07/2020] [Indexed: 12/27/2022]
Abstract
Construction of graph-based approximations for multi-dimensional data point clouds is widely used in a variety of areas. Notable examples of applications of such approximators are cellular trajectory inference in single-cell data analysis, analysis of clinical trajectories from synchronic datasets, and skeletonization of images. Several methods have been proposed to construct such approximating graphs, with some based on computation of minimum spanning trees and some based on principal graphs generalizing principal curves. In this article we propose a methodology to compare and benchmark these two graph-based data approximation approaches, as well as to define their hyperparameters. The main idea is to avoid comparing graphs directly, but at first to induce clustering of the data point cloud from the graph approximation and, secondly, to use well-established methods to compare and score the data cloud partitioning induced by the graphs. In particular, mutual information-based approaches prove to be useful in this context. The induced clustering is based on decomposing a graph into non-branching segments, and then clustering the data point cloud by the nearest segment. Such a method allows efficient comparison of graph-based data approximations of arbitrary topology and complexity. The method is implemented in Python using the standard scikit-learn library which provides high speed and efficiency. As a demonstration of the methodology we analyse and compare graph-based data approximation methods using synthetic as well as real-life single cell datasets.
Collapse
|
34
|
Faure L, Wang Y, Kastriti ME, Fontanet P, Cheung KKY, Petitpré C, Wu H, Sun LL, Runge K, Croci L, Landy MA, Lai HC, Consalez GG, de Chevigny A, Lallemend F, Adameyko I, Hadjab S. Single cell RNA sequencing identifies early diversity of sensory neurons forming via bi-potential intermediates. Nat Commun 2020; 11:4175. [PMID: 32826903 PMCID: PMC7442800 DOI: 10.1038/s41467-020-17929-4] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 07/23/2020] [Indexed: 12/23/2022] Open
Abstract
Somatic sensation is defined by the existence of a diversity of primary sensory neurons with unique biological features and response profiles to external and internal stimuli. However, there is no coherent picture about how this diversity of cell states is transcriptionally generated. Here, we use deep single cell analysis to resolve fate splits and molecular biasing processes during sensory neurogenesis in mice. Our results identify a complex series of successive and specific transcriptional changes in post-mitotic neurons that delineate hierarchical regulatory states leading to the generation of the main sensory neuron classes. In addition, our analysis identifies previously undetected early gene modules expressed long before fate determination although being clearly associated with defined sensory subtypes. Overall, the early diversity of sensory neurons is generated through successive bi-potential intermediates in which synchronization of relevant gene modules and concurrent repression of competing fate programs precede cell fate stabilization and final commitment.
Collapse
Affiliation(s)
- Louis Faure
- Department of Molecular Neurosciences, Center for Brain Research, Medical University Vienna, 1090, Vienna, Austria
| | - Yiqiao Wang
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Maria Eleni Kastriti
- Department of Molecular Neurosciences, Center for Brain Research, Medical University Vienna, 1090, Vienna, Austria
| | - Paula Fontanet
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Kylie K Y Cheung
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Charles Petitpré
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Haohao Wu
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Lynn Linyu Sun
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Karen Runge
- INMED INSERM U1249, Aix-Marseille University, Marseille, France
| | - Laura Croci
- Università Vita-Salute San Raffaele, 20132, Milan, Italy
| | - Mark A Landy
- Department of Neuroscience, UT Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390, USA
| | - Helen C Lai
- Department of Neuroscience, UT Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390, USA
| | | | | | - François Lallemend
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
- Ming-Wai Lau Centre for Reparative Medicine, Stockholm node, Karolinska Institutet, Stockholm, Sweden
| | - Igor Adameyko
- Department of Molecular Neurosciences, Center for Brain Research, Medical University Vienna, 1090, Vienna, Austria
- Department of Physiology and Pharmacology, Karolinska Institutet, 17177, Stockholm, Sweden
| | - Saida Hadjab
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.
| |
Collapse
|
35
|
Katebi A, Kohar V, Lu M. Random Parametric Perturbations of Gene Regulatory Circuit Uncover State Transitions in Cell Cycle. iScience 2020; 23:101150. [PMID: 32450514 PMCID: PMC7251928 DOI: 10.1016/j.isci.2020.101150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Revised: 03/05/2020] [Accepted: 05/05/2020] [Indexed: 02/03/2023] Open
Abstract
Many biological processes involve precise cellular state transitions controlled by complex gene regulation. Here, we use budding yeast cell cycle as a model system and explore how a gene regulatory circuit encodes essential information of state transitions. We present a generalized random circuit perturbation method for circuits containing heterogeneous regulation types and its usage to analyze both steady and oscillatory states from an ensemble of circuit models with random kinetic parameters. The stable steady states form robust clusters with a circular structure that are associated with cell cycle phases. This circular structure in the clusters is consistent with single-cell RNA sequencing data. The oscillatory states specify the irreversible state transitions along cell cycle progression. Furthermore, we identify possible mechanisms to understand the irreversible state transitions from the steady states. We expect this approach to be robust and generally applicable to unbiasedly predict dynamical transitions of a gene regulatory circuit.
Collapse
Affiliation(s)
- Ataur Katebi
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Vivek Kohar
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Mingyang Lu
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA.
| |
Collapse
|