1
|
Schettini GP, Morozyuk M, Biase FH. Identification of novel cattle (Bos taurus) genes and biological insights of their function in pre-implantation embryo development. BMC Genomics 2024; 25:775. [PMID: 39118001 PMCID: PMC11313146 DOI: 10.1186/s12864-024-10685-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 08/02/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND Appropriate regulation of genes expressed in oocytes and embryos is essential for acquisition of developmental competence in mammals. Here, we hypothesized that several genes expressed in oocytes and pre-implantation embryos remain unknown. Our goal was to reconstruct the transcriptome of oocytes (germinal vesicle and metaphase II) and pre-implantation cattle embryos (blastocysts) using short-read and long-read sequences to identify putative new genes. RESULTS We identified 274,342 transcript sequences and 3,033 of those loci do not match a gene present in official annotations and thus are potential new genes. Notably, 63.67% (1,931/3,033) of potential novel genes exhibited coding potential. Also noteworthy, 97.92% of the putative novel genes overlapped annotation with transposable elements. Comparative analysis of transcript abundance identified that 1,840 novel genes (recently added to the annotation) or potential new genes were differentially expressed between developmental stages (FDR < 0.01). We also determined that 522 novel or potential new genes (448 and 34, respectively) were upregulated at eight-cell embryos compared to oocytes (FDR < 0.01). In eight-cell embryos, 102 novel or putative new genes were co-expressed (|r|> 0.85, P < 1 × 10-8) with several genes annotated with gene ontology biological processes related to pluripotency maintenance and embryo development. CRISPR-Cas9 genome editing confirmed that the disruption of one of the novel genes highly expressed in eight-cell embryos reduced blastocyst development (ENSBTAG00000068261, P = 1.55 × 10-7). CONCLUSIONS Our results revealed several putative new genes that need careful annotation. Many of the putative new genes have dynamic regulation during pre-implantation development and are important components of gene regulatory networks involved in pluripotency and blastocyst formation.
Collapse
Affiliation(s)
- Gustavo P Schettini
- School of Animal Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | - Michael Morozyuk
- School of Animal Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | - Fernando H Biase
- School of Animal Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA.
| |
Collapse
|
2
|
Li R, Guan J, Wang Z, Zhou S. A new and effective two-step clustering approach for single cell RNA sequencing data. BMC Genomics 2023; 23:864. [PMID: 37946133 PMCID: PMC10636845 DOI: 10.1186/s12864-023-09577-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 08/10/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND The rapid devolvement of single cell RNA sequencing (scRNA-seq) technology leads to huge amounts of scRNA-seq data, which greatly advance the research of many biomedical fields involving tissue heterogeneity, pathogenesis of disease and drug resistance etc. One major task in scRNA-seq data analysis is to cluster cells in terms of their expression characteristics. Up to now, a number of methods have been proposed to infer cell clusters, yet there is still much space to improve their performance. RESULTS In this paper, we develop a new two-step clustering approach to effectively cluster scRNA-seq data, which is called TSC - the abbreviation of Two-Step Clustering. Particularly, by dividing all cells into two types: core cells (those possibly lying around the centers of clusters) and non-core cells (those locating in the boundary areas of clusters), we first clusters the core cells by hierarchical clustering (the first step) and then assigns the non-core cells to the corresponding nearest clusters (the second step). Extensive experiments on 12 real scRNA-seq datasets show that TSC outperforms the state of the art methods. CONCLUSION TSC is an effective clustering method due to its two-steps clustering strategy, and it is a useful tool for scRNA-seq data analysis.
Collapse
Affiliation(s)
- Ruiyi Li
- Translational Medical Center for Stem Cell Therapy, Shanghai East Hospital, and School of Medicine, Tongji University, 1239 Siping Road, 200092, Shanghai, China
- Department of Computer Science and Technology, Tongji University, 4800 Caoan Road, 201804, Shanghai, China
| | - Jihong Guan
- Department of Computer Science and Technology, Tongji University, 4800 Caoan Road, 201804, Shanghai, China.
| | - Zhiye Wang
- Department of Computer Science and Technology, Tongji University, 4800 Caoan Road, 201804, Shanghai, China
| | - Shuigeng Zhou
- Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, 2005 Songhu Road, 200438, Shanghai, China.
| |
Collapse
|
3
|
Cotner M, Meng S, Jost T, Gardner A, De Santiago C, Brock A. Integration of quantitative methods and mathematical approaches for the modeling of cancer cell proliferation dynamics. Am J Physiol Cell Physiol 2023; 324:C247-C262. [PMID: 36503241 PMCID: PMC9886359 DOI: 10.1152/ajpcell.00185.2022] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 11/21/2022] [Accepted: 11/21/2022] [Indexed: 12/15/2022]
Abstract
Physiological processes rely on the control of cell proliferation, and the dysregulation of these processes underlies various pathological conditions, including cancer. Mathematical modeling can provide new insights into the complex regulation of cell proliferation dynamics. In this review, we first examine quantitative experimental approaches for measuring cell proliferation dynamics in vitro and compare the various types of data that can be obtained in these settings. We then explore the toolbox of common mathematical modeling frameworks that can describe cell behavior, dynamics, and interactions of proliferation. We discuss how these wet-laboratory studies may be integrated with different mathematical modeling approaches to aid the interpretation of the results and to enable the prediction of cell behaviors, specifically in the context of cancer.
Collapse
Affiliation(s)
- Michael Cotner
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas
| | - Sarah Meng
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas
| | - Tyler Jost
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas
| | - Andrea Gardner
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas
| | - Carolina De Santiago
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas
| | - Amy Brock
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas
| |
Collapse
|
4
|
Zhou L, Pan Q. Inverse weighting method with jackknife variance estimator for differential expression analysis of single-cell RNA sequencing data. Comput Biol Chem 2022; 100:107733. [DOI: 10.1016/j.compbiolchem.2022.107733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 07/12/2022] [Indexed: 11/03/2022]
|
5
|
Li Z, Seehawer M, Polyak K. Untangling the web of intratumour heterogeneity. Nat Cell Biol 2022; 24:1192-1201. [PMID: 35941364 DOI: 10.1038/s41556-022-00969-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 06/27/2022] [Indexed: 02/06/2023]
Abstract
Intratumour heterogeneity (ITH) is a hallmark of cancer that drives tumour evolution and disease progression. Technological and computational advances have enabled us to assess ITH at unprecedented depths, yet this accumulating knowledge has not had a substantial clinical impact. This is in part due to a limited understanding of the functional relevance of ITH and the inadequacy of preclinical experimental models to reproduce it. Here, we discuss progress made in these areas and illuminate future directions.
Collapse
Affiliation(s)
- Zheqi Li
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.,Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.,Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Marco Seehawer
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.,Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.,Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Kornelia Polyak
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA. .,Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA. .,Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
6
|
Li RY, Guan J, Zhou S. Boosting scRNA-seq data clustering by cluster-aware feature weighting. BMC Bioinformatics 2021; 22:130. [PMID: 34078287 PMCID: PMC8171019 DOI: 10.1186/s12859-021-04033-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 02/16/2021] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND The rapid development of single-cell RNA sequencing (scRNA-seq) enables the exploration of cell heterogeneity, which is usually done by scRNA-seq data clustering. The essence of scRNA-seq data clustering is to group cells by measuring the similarities among genes/transcripts of cells. And the selection of features for cell similarity evaluation is of great importance, which will significantly impact clustering effectiveness and efficiency. RESULTS In this paper, we propose a novel method called CaFew to select genes based on cluster-aware feature weighting. By optimizing the clustering objective function, CaFew obtains a feature weight matrix, which is further used for feature selection. The genes have large weights in at least one cluster or the genes whose weights vary greatly in different clusters are selected. Experiments on 8 real scRNA-seq datasets show that CaFew can obviously improve the clustering performance of existing scRNA-seq data clustering methods. Particularly, the combination of CaFew with SC3 achieves the state-of-art performance. Furthermore, CaFew also benefits the visualization of scRNA-seq data. CONCLUSION CaFew is an effective scRNA-seq data clustering method due to its gene selection mechanism based on cluster-aware feature weighting, and it is a useful tool for scRNA-seq data analysis.
Collapse
Affiliation(s)
- Rui-Yi Li
- Department of Computer Science and Technology, Tongji University, 4800 Caoan Road, Shanghai, 201804 China
| | - Jihong Guan
- Department of Computer Science and Technology, Tongji University, 4800 Caoan Road, Shanghai, 201804 China
| | - Shuigeng Zhou
- Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, 220 Handan Road, Shanghai, 200433 China
| |
Collapse
|
7
|
Coticchio G, Lagalla C, Sturmey R, Pennetta F, Borini A. The enigmatic morula: mechanisms of development, cell fate determination, self-correction and implications for ART. Hum Reprod Update 2020; 25:422-438. [PMID: 30855681 DOI: 10.1093/humupd/dmz008] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Revised: 01/20/2019] [Accepted: 02/11/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Assisted reproduction technology offers the opportunity to observe the very early stages of human development. However, due to practical constraints, for decades morphological examination of embryo development has been undertaken at a few isolated time points at the stages of fertilisation (Day 1), cleavage (Day 2-3) and blastocyst (Day 5-6). Rather surprisingly, the morula stage (Day 3-4) has been so far neglected, despite its involvement in crucial cellular processes and developmental decisions. OBJECTIVE AND RATIONALE The objective of this review is to collate novel and unsuspected insights into developmental processes occurring during formation of the morula, highlighting the key importance of this stage for a better understanding of preimplantation development and an improvement of ART. SEARCH METHODS PubMed was used to search the MEDLINE database for peer-reviewed English-language original articles and reviews concerning the morula stage in mammals. Searches were performed by adopting 'embryo', 'morula', 'compaction', 'cell fate' and 'IVF/assisted reproduction' as main terms, in association with other keywords expressing concepts relevant to the subject (e.g. cell polarity). The most relevant publications, i.e. those concerning major phenomena occurring during formation of the morula in established experimental models and the human species, were assessed and discussed critically. OUTCOMES Novel live cell imaging technologies and cell biology studies have extended our understanding of morula formation as a key stage for the development of the blastocyst and determination of the inner cell mass (ICM) and the trophectoderm (TE). Cellular processes, such as dynamic formation of filopodia and cytoskeleton-mediated zippering cell-to-cell interactions, intervene to allow cell compaction (a geometrical requisite essential for development) and formation of the blastocoel, respectively. At the same time, differential orientation of cleavage planes, cell polarity and cortical tensile forces interact and cooperate to position blastomeres either internally or externally, thereby influencing their cellular fate. Recent time lapse microscopy (TLM) observations also suggest that in the human the process of compaction may represent an important checkpoint for embryo viability, through which chromosomally abnormal blastomeres are sensed and eliminated by the embryo. WIDER IMPLICATIONS In clinical embryology, the morula stage has been always perceived as a 'black box' in the continuum of preimplantation development. This has dictated its virtual exclusion from mainstream ART procedures. Recent findings described in this review indicate that the morula, and the associated process of compaction, as a crucial stage not only for the formation of the blastocyst, but also for the health of the conceptus. This understanding may open new avenues for innovative approaches to embryo manipulation, assessment and treatment.
Collapse
Affiliation(s)
| | - Cristina Lagalla
- 9.Baby, Family and Fertility Center, Via Dante 15, Bologna, Italy
| | - Roger Sturmey
- Centre for Cardiovascular Metabolic Research, Hull York Medical School, University of Hull, Hull, United Kingdom
| | | | - Andrea Borini
- 9.Baby, Family and Fertility Center, Via Dante 15, Bologna, Italy
| |
Collapse
|
8
|
Keren-Shaul H, Kenigsberg E, Jaitin DA, David E, Paul F, Tanay A, Amit I. MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing. Nat Protoc 2019; 14:1841-1862. [PMID: 31101904 DOI: 10.1038/s41596-019-0164-4] [Citation(s) in RCA: 159] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Accepted: 03/12/2019] [Indexed: 11/09/2022]
Abstract
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4-9 orders of magnitude. Relying solely on unbiased sampling to characterize cellular niches becomes infeasible, as the marginal utility of collecting more cells diminishes quickly. Furthermore, in many clinical samples, the relevant cell types are scarce and efficient processing is critical. We developed an integrated pipeline for index sorting and massively parallel single-cell RNA sequencing (MARS-seq2.0) that builds on our previously published MARS-seq approach. MARS-seq2.0 is based on >1 million cells sequenced with this pipeline and allows identification of unique cell types across different tissues and diseases, as well as unique model systems and organisms. Here, we present a detailed step-by-step procedure for applying the method. In the improved procedure, we combine sub-microliter reaction volumes, optimization of enzymatic mixtures and an enhanced analytical pipeline to substantially lower the cost, improve reproducibility and reduce well-to-well contamination. Data analysis combines multiple layers of quality assessment and error detection and correction, graphically presenting key statistics for library complexity, noise distribution and sequencing saturation. Importantly, our combined FACS and single-cell RNA sequencing (scRNA-seq) workflow enables intuitive approaches for depletion or enrichment of cell populations in a data-driven manner that is essential to efficient sampling of complex tissues. The experimental protocol, from cell sorting to a ready-to-sequence library, takes 2-3 d. Sequencing and processing the data through the analytical pipeline take another 1-2 d.
Collapse
Affiliation(s)
- Hadas Keren-Shaul
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel.,Life Science Core Facility, Weizmann Institute of Science, Rehovot, Israel
| | - Ephraim Kenigsberg
- Precision Immunology Institute, Icahn Institute for Data Science and Genomic Technology, and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Eyal David
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
| | - Franziska Paul
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
| | - Amos Tanay
- Department of Computer Science and Applied Mathematics and Department of Biological Regulation, Weizmann Institute, Rehovot, Israel.
| | - Ido Amit
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
9
|
Single-cell transcriptomic analysis of mouse neocortical development. Nat Commun 2019; 10:134. [PMID: 30635555 PMCID: PMC6329831 DOI: 10.1038/s41467-018-08079-9] [Citation(s) in RCA: 161] [Impact Index Per Article: 32.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 12/14/2018] [Indexed: 01/28/2023] Open
Abstract
The development of the mammalian cerebral cortex depends on careful orchestration of proliferation, maturation, and migration events, ultimately giving rise to a wide variety of neuronal and non-neuronal cell types. To better understand cellular and molecular processes that unfold during late corticogenesis, we perform single-cell RNA-seq on the mouse cerebral cortex at a progenitor driven phase (embryonic day 14.5) and at birth-after neurons from all six cortical layers are born. We identify numerous classes of neurons, progenitors, and glia, their proliferative, migratory, and activation states, and their relatedness within and across age. Using the cell-type-specific expression patterns of genes mutated in neurological and psychiatric diseases, we identify putative disease subtypes that associate with clinical phenotypes. Our study reveals the cellular template of a complex neurodevelopmental process, and provides a window into the cellular origins of brain diseases.
Collapse
|