1
|
Jin YT, Tan Y, Gan ZH, Hao YD, Wang TY, Lin H, Tang B. Identification of DNase I hypersensitive sites in the human genome by multiple sequence descriptors. Methods 2024; 229:125-132. [PMID: 38964595 DOI: 10.1016/j.ymeth.2024.06.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 06/01/2024] [Accepted: 06/27/2024] [Indexed: 07/06/2024] Open
Abstract
DNase I hypersensitive sites (DHSs) are chromatin regions highly sensitive to DNase I enzymes. Studying DHSs is crucial for understanding complex transcriptional regulation mechanisms and localizing cis-regulatory elements (CREs). Numerous studies have indicated that disease-related loci are often enriched in DHSs regions, underscoring the importance of identifying DHSs. Although wet experiments exist for DHSs identification, they are often labor-intensive. Therefore, there is a strong need to develop computational methods for this purpose. In this study, we used experimental data to construct a benchmark dataset. Seven feature extraction methods were employed to capture information about human DHSs. The F-score was applied to filter the features. By comparing the prediction performance of various classification algorithms through five-fold cross-validation, random forest was proposed to perform the final model construction. The model could produce an overall prediction accuracy of 0.859 with an AUC value of 0.837. We hope that this model can assist scholars conducting DNase research in identifying these sites.
Collapse
Affiliation(s)
- Yan-Ting Jin
- School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China.
| | - Yang Tan
- School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Zhong-Hua Gan
- Department of Pathology, The Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Yu-Duo Hao
- School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China.
| | - Tian-Yu Wang
- School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China.
| | - Bo Tang
- Department of Pathology, The Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, Sichuan, China.
| |
Collapse
|
2
|
MacCarthy CM, Wu G, Malik V, Menuchin-Lasowski Y, Velychko T, Keshet G, Fan R, Bedzhov I, Church GM, Jauch R, Cojocaru V, Schöler HR, Velychko S. Highly cooperative chimeric super-SOX induces naive pluripotency across species. Cell Stem Cell 2024; 31:127-147.e9. [PMID: 38141611 DOI: 10.1016/j.stem.2023.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 09/02/2023] [Accepted: 11/20/2023] [Indexed: 12/25/2023]
Abstract
Our understanding of pluripotency remains limited: iPSC generation has only been established for a few model species, pluripotent stem cell lines exhibit inconsistent developmental potential, and germline transmission has only been demonstrated for mice and rats. By swapping structural elements between Sox2 and Sox17, we built a chimeric super-SOX factor, Sox2-17, that enhanced iPSC generation in five tested species: mouse, human, cynomolgus monkey, cow, and pig. A swap of alanine to valine at the interface between Sox2 and Oct4 delivered a gain of function by stabilizing Sox2/Oct4 dimerization on DNA, enabling generation of high-quality OSKM iPSCs capable of supporting the development of healthy all-iPSC mice. Sox2/Oct4 dimerization emerged as the core driver of naive pluripotency with its levels diminished upon priming. Transient overexpression of the SK cocktail (Sox+Klf4) restored the dimerization and boosted the developmental potential of pluripotent stem cells across species, providing a universal method for naive reset in mammals.
Collapse
Affiliation(s)
| | - Guangming Wu
- Max Planck Institute for Molecular Biomedicine, Münster, Germany; International Bio Island, Guangzhou, China; MingCeler Biotech, Guangzhou, China
| | - Vikas Malik
- Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
| | | | - Taras Velychko
- Max Planck Institute for Molecular Biomedicine, Münster, Germany; Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Gal Keshet
- Hebrew University of Jerusalem, Jerusalem, Israel
| | - Rui Fan
- Max Planck Institute for Molecular Biomedicine, Münster, Germany
| | - Ivan Bedzhov
- Max Planck Institute for Molecular Biomedicine, Münster, Germany
| | - George M Church
- Department of Genetics, Harvard Medical School, Boston, MA, USA; Wyss Institute, Harvard University, Boston, MA, USA
| | - Ralf Jauch
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Centre for Translational Stem Cell Biology, Hong Kong SAR, China
| | - Vlad Cojocaru
- Max Planck Institute for Molecular Biomedicine, Münster, Germany; University of Utrecht, Utrecht, the Netherlands; STAR-UBB Institute, Babeş-Bolyai University, Cluj-Napoca, Romania
| | - Hans R Schöler
- Max Planck Institute for Molecular Biomedicine, Münster, Germany.
| | - Sergiy Velychko
- Max Planck Institute for Molecular Biomedicine, Münster, Germany; Department of Genetics, Harvard Medical School, Boston, MA, USA; Wyss Institute, Harvard University, Boston, MA, USA.
| |
Collapse
|
3
|
Long C, Li H, Liang P, Chao L, Hong Y, Zhang J, Xi Q, Zuo Y. Deciphering the decisive factors driving fate bifurcations in somatic cell reprogramming. MOLECULAR THERAPY. NUCLEIC ACIDS 2023; 34:102044. [PMID: 37869261 PMCID: PMC10585637 DOI: 10.1016/j.omtn.2023.102044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 09/28/2023] [Indexed: 10/24/2023]
Abstract
Single-cell studies have demonstrated that somatic cell reprogramming is a continuous process of cell fates transition. Only partial reprogramming intermediates can overcome the molecular bottlenecks to acquire pluripotency. To decipher the underlying decisive factors driving cell fate, we identified induced pluripotent stem cells or stromal-like cells (iPSCs/SLCs) and iPSCs or trophoblast-like cells (iPSCs/TLCs) fate bifurcations by reconstructing cellular trajectory. The mesenchymal-epithelial transition and the activation of pluripotency networks are the main molecular series in successful reprogramming. Correspondingly, intermediates diverge into SLCs accompanied by the inhibition of cell cycle genes and the activation of extracellular matrix genes, whereas the TLCs fate is characterized by the up-regulation of placenta development genes. Combining putative gene regulatory networks, seven (Taf7, Ezh2, Klf2, etc.) and three key factors (Cdc5l, Klf4, and Nanog) were individually identified as drivers of the successful reprogramming by triggering downstream pluripotent networks during iPSCs/SLCs and iPSCs/TLCs fate bifurcation. Conversely, 11 factors (Cebpb, Sox4, Junb, etc.) and four factors (Gata2, Jund, Ctnnb1, etc.) drive SLCs fate and TLCs fate, respectively. Our study sheds new light on the understanding of decisive factors driving cell fate, which is helpful for improving reprogramming efficiency through manipulating cell fates to avoid alternative fates.
Collapse
Affiliation(s)
- Chunshen Long
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Hanshuang Li
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Pengfei Liang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Lemuge Chao
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Yan Hong
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Junping Zhang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
- School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China
| | - Qilemuge Xi
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| |
Collapse
|
4
|
Su D, Xiong Y, Wei H, Wang S, Ke J, Liang P, Zhang H, Yu Y, Zuo Y, Yang L. Integrated analysis of ovarian cancer patients from prospective transcription factor activity reveals subtypes of prognostic significance. Heliyon 2023; 9:e16147. [PMID: 37215759 PMCID: PMC10199194 DOI: 10.1016/j.heliyon.2023.e16147] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 05/04/2023] [Accepted: 05/07/2023] [Indexed: 05/24/2023] Open
Abstract
Transcription factors are protein molecules that act as regulators of gene expression. Aberrant protein activity of transcription factors can have a significant impact on tumor progression and metastasis in tumor patients. In this study, 868 immune-related transcription factors were identified from the transcription factor activity profile of 1823 ovarian cancer patients. The prognosis-related transcription factors were identified through univariate Cox analysis and random survival tree analysis, and two distinct clustering subtypes were subsequently derived based on these transcription factors. We assessed the clinical significance and genomics landscape of the two clustering subtypes and found statistically significant differences in prognosis, response to immunotherapy, and chemotherapy among ovarian cancer patients with different subtypes. Multi-scale Embedded Gene Co-expression Network Analysis was used to identify differential gene modules between the two clustering subtypes, which allowed us to conduct further analysis of biological pathways that exhibited significant differences between them. Finally, a ceRNA network was constructed to analyze lncRNA-miRNA-mRNA regulatory pairs with differential expression levels between two clustering subtypes. We expected that our study may provide some useful references for stratifying and treating patients with ovarian cancer.
Collapse
Affiliation(s)
- Dongqing Su
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yuqiang Xiong
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Haodong Wei
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Shiyuan Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Jiawei Ke
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Pengfei Liang
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Haoxin Zhang
- Department of Gastrointestinal Oncology, Harbin Medical University Cancer Hospital, Harbin 150081, China
| | - Yao Yu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yongchun Zuo
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot, 010010, China
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| |
Collapse
|
5
|
Li H, Long C, Hong Y, Luo L, Zuo Y. Characterizing Cellular Differentiation Potency and Waddington Landscape via Energy Indicator. RESEARCH (WASHINGTON, D.C.) 2023; 6:0118. [PMID: 37223479 PMCID: PMC10202187 DOI: 10.34133/research.0118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Accepted: 03/20/2023] [Indexed: 05/25/2023]
Abstract
The precise characterization of cellular differentiation potency remains an open question, which is fundamentally important for deciphering the dynamics mechanism related to cell fate transition. We quantitatively evaluated the differentiation potency of different stem cells based on the Hopfield neural network (HNN). The results emphasized that cellular differentiation potency can be approximated by Hopfield energy values. We then profiled the Waddington energy landscape of embryogenesis and cell reprogramming processes. The energy landscape at single-cell resolution further confirmed that cell fate decision is progressively specified in a continuous process. Moreover, the transition of cells from one steady state to another in embryogenesis and cell reprogramming processes was dynamically simulated on the energy ladder. These two processes can be metaphorized as the motion of descending and ascending ladders, respectively. We further deciphered the dynamics of the gene regulatory network (GRN) for driving cell fate transition. Our study proposes a new energy indicator to quantitatively characterize cellular differentiation potency without prior knowledge, facilitating the further exploration of the potential mechanism of cellular plasticity.
Collapse
Affiliation(s)
- Hanshuang Li
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences,
Inner Mongolia University, Hohhot 010070, China
| | - Chunshen Long
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences,
Inner Mongolia University, Hohhot 010070, China
| | - Yan Hong
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences,
Inner Mongolia University, Hohhot 010070, China
| | - Liaofu Luo
- Department of Physics,
Inner Mongolia University, Hohhot 010070, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences,
Inner Mongolia University, Hohhot 010070, China
| |
Collapse
|
6
|
Chao L, Yang S, Li H, Long C, Xi Q, Zuo Y. Competitive binding of TET1 and DNMT3A/B cooperates the DNA methylation pattern in human embryonic stem cells. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2022; 1865:194861. [PMID: 35998875 DOI: 10.1016/j.bbagrm.2022.194861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 08/06/2022] [Accepted: 08/12/2022] [Indexed: 06/15/2023]
Abstract
DNMT3A/B and TET1 play indispensable roles in regulating DNA methylation that undergoes extensive reprogramming during mammalian embryogenesis. Yet the competitive and cooperative relationships between TET1 and DNMT3A/B remain largely unknown in the human embryonic stem cells. Here, we revealed that the main DNA-binding domain of TET1 contains more positive charges by using charge reduction of amino acid alphabet, followed by DNMT3A and DNMT3B. The genome-wide binding profiles showed that TET1 prefers binding to the proximal promoters and CpG islands compared with DNMT3A/B. Moreover, the binding regions of these three transcription factors can be divided into specific and co-binding regions. And a stronger inhibitory effect of DNMT3A on TET1 demethylation was observed in co-binding regions. Furthermore, we integrated TET1 knockout data to further discuss the competitive binding patterns of TET1 and DNMT3A/B. The lack of TET1 increased the occupation of DNMT3A/B at the specific binding regions of TET1 causing focal hypermethylation. The knockout of TET1 was also accompanied by a reduction of DNMT3A/B binding in the co-binding regions, further confirming the cooperative binding function between TET1 and DNMT3A/B. In conclusion, our studies found that the competitive binding of TET1 and DNMT3A/B cooperatively shapes the global DNA methylation pattern in human embryonic stem cells.
Collapse
Affiliation(s)
- Lemuge Chao
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Siqi Yang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Hanshuang Li
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Chunshen Long
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Qilemuge Xi
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China..
| |
Collapse
|
7
|
The Cumulative Formation of R-loop Interacts with Histone Modifications to Shape Cell Reprogramming. Int J Mol Sci 2022; 23:ijms23031567. [PMID: 35163490 PMCID: PMC8835745 DOI: 10.3390/ijms23031567] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/26/2022] [Accepted: 01/26/2022] [Indexed: 12/27/2022] Open
Abstract
R-loop, a three-stranded RNA/DNA structure, plays important roles in modulating genome stability and gene expression, but the molecular mechanism of R-loops in cell reprogramming remains elusive. Here, we comprehensively profiled the genome-wide landscape of R-loops during cell reprogramming. The results showed that the R-loop formation on most different types of repetitive elements is stage-specific in cell reprogramming. We unveiled that the cumulative deposition of an R-loop subset is positively correlated with gene expression during reprogramming. More importantly, the dynamic turnover of this R-loop subset is accompanied by the activation of the pluripotent transcriptional regulatory network (TRN). Moreover, the large accumulation of the active histone marker H3K4me3 and the reduction in H3K27me3 were also observed in these R-loop regions. Finally, we characterized the dynamic network of R-loops that facilitates cell fate transitions in reprogramming. Together, our study provides a new clue for deciphering the interplay mechanism between R-loops and HMs to control cell reprogramming.
Collapse
|
8
|
Zhao Z, Yang W, Zhai Y, Liang Y, Zhao Y. Identify DNA-Binding Proteins Through the Extreme Gradient Boosting Algorithm. Front Genet 2022; 12:821996. [PMID: 35154264 PMCID: PMC8837382 DOI: 10.3389/fgene.2021.821996] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 12/07/2021] [Indexed: 12/13/2022] Open
Abstract
The exploration of DNA-binding proteins (DBPs) is an important aspect of studying biological life activities. Research on life activities requires the support of scientific research results on DBPs. The decline in many life activities is closely related to DBPs. Generally, the detection method for identifying DBPs is achieved through biochemical experiments. This method is inefficient and requires considerable manpower, material resources and time. At present, several computational approaches have been developed to detect DBPs, among which machine learning (ML) algorithm-based computational techniques have shown excellent performance. In our experiments, our method uses fewer features and simpler recognition methods than other methods and simultaneously obtains satisfactory results. First, we use six feature extraction methods to extract sequence features from the same group of DBPs. Then, this feature information is spliced together, and the data are standardized. Finally, the extreme gradient boosting (XGBoost) model is used to construct an effective predictive model. Compared with other excellent methods, our proposed method has achieved better results. The accuracy achieved by our method is 78.26% for PDB2272 and 85.48% for PDB186. The accuracy of the experimental results achieved by our strategy is similar to that of previous detection methods.
Collapse
Affiliation(s)
- Ziye Zhao
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Wen Yang
- International Medical Center, Shenzhen University General Hospital, Shenzhen, China
| | - Yixiao Zhai
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Yingjian Liang
- Department of Obstetrics and Gynecology, The First Affiliated Hospital of Harbin Medical University, Harbin, China
- *Correspondence: Yingjian Liang, ; Yuming Zhao,
| | - Yuming Zhao
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
- *Correspondence: Yingjian Liang, ; Yuming Zhao,
| |
Collapse
|
9
|
WGBS combined with RNA-seq analysis revealed that Dnmt1 affects the methylation modification and gene expression changes during mouse oocyte vitrification. Theriogenology 2022; 177:11-21. [PMID: 34653792 DOI: 10.1016/j.theriogenology.2021.09.032] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 09/28/2021] [Accepted: 09/29/2021] [Indexed: 01/01/2023]
Abstract
Understanding the molecular level changes of oocyte cryopreservation and the subsequent warming process is essential for improving the oocyte cryopreservation technologies. Here, we collected the mature metaphase II (MII) oocytes from mice and vitrified. After thawing, single-cell whole-genome bisulphite sequencing (scWGBS) and single-cell RNA sequencing (scRNA-seq) were used to investigate the molecular attributes of this process. Compared to the fresh oocytes, the vitrified oocytes had lower global methylation and gene expression levels, and 1426 genes up-regulated and 3321 genes down-regulated. The 1426 up-regulated differentially expressed genes (DEGs) in the vitrified oocytes were mainly associated with the histone ubiquitination, while the 3321 down-regulated genes were mainly enriched in the mitochondrion organisation and ATP metabolism processes. The differentially methylated regions (DMRs) were mainly located in promoter, intron and exon region of genes, and the length of DMRs in the vitrified oocytes were also significantly lower than that of the fresh oocytes. Notably, there were no significant difference in the expression levels of DNA demethylases (Tet1, Tet2 and Tet3) and methyltransferases (Dnmt3a and Dnmt3b) between two treatments of oocytes. However, Dnmt1 and kcnq1ot1, which are responsible for maintaining DNA methylation, were significantly down regulated in the vitrified oocytes. Gene regulatory network (GRN) analysis showed the Dnmt1 and kcnq1ot1 play a core role in regulating methylation and expression levels of downstream genes. Moreover, some genes associated with oocyte quality were significantly down-regulated in the vitrified oocytes. The present data provides a new perspective for understanding the impact of vitrification on oocytes.
Collapse
|
10
|
Long C, Li H, Li X, Yang W, Zuo Y. Nuclear Transfer Arrest Embryos Show Massive Dysregulation of Genes Involved in Transcription Pathways. Int J Mol Sci 2021; 22:ijms22158187. [PMID: 34360962 PMCID: PMC8347363 DOI: 10.3390/ijms22158187] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 07/27/2021] [Accepted: 07/28/2021] [Indexed: 11/28/2022] Open
Abstract
Somatic cell nuclear transfer (SCNT) technology can reprogram terminally differentiated cell nuclei into a totipotent state. However, the underlying molecular barriers of SCNT embryo development remain incompletely elucidated. Here, we observed that transcription-related pathways were incompletely activated in nuclear transfer arrest (NTA) embryos compared to normal SCNT embryos and in vivo fertilized (WT) embryos, which hinders the development of SCNT embryos. We further revealed the transcription pathway associated gene regulatory networks (GRNs) and found the aberrant transcription pathways can lead to the massive dysregulation of genes in NTA embryos. The predicted target genes of transcription pathways contain a series of crucial factors in WT embryos, which play an important role in catabolic process, pluripotency regulation, epigenetic modification and signal transduction. In NTA embryos, however, these genes were varying degrees of inhibition and show a defect in synergy. Overall, our research found that the incomplete activation of transcription pathways is another potential molecular barrier for SCNT embryos besides the incomplete reprogramming of epigenetic modifications, broadening the understanding of molecular mechanism of SCNT embryonic development.
Collapse
|
11
|
Min X, Lu F, Li C. Sequence-Based Deep Learning Frameworks on Enhancer-Promoter Interactions Prediction. Curr Pharm Des 2021; 27:1847-1855. [PMID: 33234095 DOI: 10.2174/1381612826666201124112710] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Revised: 07/29/2020] [Accepted: 08/06/2020] [Indexed: 11/22/2022]
Abstract
Enhancer-promoter interactions (EPIs) in the human genome are of great significance to transcriptional regulation, which tightly controls gene expression. Identification of EPIs can help us better decipher gene regulation and understand disease mechanisms. However, experimental methods to identify EPIs are constrained by funds, time, and manpower, while computational methods using DNA sequences and genomic features are viable alternatives. Deep learning methods have shown promising prospects in classification and efforts that have been utilized to identify EPIs. In this survey, we specifically focus on sequence-based deep learning methods and conduct a comprehensive review of the literature. First, we briefly introduce existing sequence- based frameworks on EPIs prediction and their technique details. After that, we elaborate on the dataset, pre-processing means, and evaluation strategies. Finally, we concluded with the challenges these methods are confronted with and suggest several future opportunities. We hope this review will provide a useful reference for further studies on enhancer-promoter interactions.
Collapse
Affiliation(s)
- Xiaoping Min
- School of Informatics, Xiamen University, Xiamen 361005, China
| | - Fengqing Lu
- School of Informatics, Xiamen University, Xiamen 361005, China
| | - Chunyan Li
- Graduate School, Yunnan Minzu University, Kunming 650504, China
| |
Collapse
|
12
|
Dao FY, Lv H, Su W, Sun ZJ, Huang QL, Lin H. iDHS-Deep: an integrated tool for predicting DNase I hypersensitive sites by deep neural network. Brief Bioinform 2021; 22:6158360. [PMID: 33751027 DOI: 10.1093/bib/bbab047] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Revised: 01/28/2021] [Accepted: 01/29/2021] [Indexed: 01/09/2023] Open
Abstract
DNase I hypersensitive site (DHS) refers to the hypersensitive region of chromatin for the DNase I enzyme. It is an important part of the noncoding region and contains a variety of regulatory elements, such as promoter, enhancer, and transcription factor-binding site, etc. Moreover, the related locus of disease (or trait) are usually enriched in the DHS regions. Therefore, the detection of DHS region is of great significance. In this study, we develop a deep learning-based algorithm to identify whether an unknown sequence region would be potential DHS. The proposed method showed high prediction performance on both training datasets and independent datasets in different cell types and developmental stages, demonstrating that the method has excellent superiority in the identification of DHSs. Furthermore, for the convenience of related wet-experimental researchers, the user-friendly web-server iDHS-Deep was established at http://lin-group.cn/server/iDHS-Deep/, by which users can easily distinguish DHS and non-DHS and obtain the corresponding developmental stage ofDHS.
Collapse
Affiliation(s)
- Fu-Ying Dao
- Informational Biology at University of Electronic Science and Technology of China, China
| | - Hao Lv
- Informational Biology at University of Electronic Science and Technology of China, China
| | - Wei Su
- Informational Biology at University of Electronic Science and Technology of China, China
| | - Zi-Jie Sun
- Informational Biology at University of Electronic Science and Technology of China, China
| | - Qin-Lai Huang
- Informational Biology at University of Electronic Science and Technology of China, China
| | - Hao Lin
- Informational Biology at University of Electronic Science and Technology of China, China
| |
Collapse
|
13
|
Wang Z, Liu D, Xu B, Tian R, Zuo Y. Modular arrangements of sequence motifs determine the functional diversity of KDM proteins. Brief Bioinform 2020; 22:5912575. [PMID: 32987405 DOI: 10.1093/bib/bbaa215] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Histone lysine demethylases (KDMs) play a vital role in regulating chromatin dynamics and transcription. KDM proteins are given modular activities by its sequence motifs with obvious roles division, which endow the complex and diverse functions. In our review, according to functional features, we classify sequence motifs into four classes: catalytic motifs, targeting motifs, regulatory motifs and potential motifs. JmjC, as the main catalytic motif, combines to Fe2+ and α-ketoglutarate by residues H-D/E-H and S-N-N/Y-K-N/Y-T/S. Targeting motifs make catalytic motifs recognize specific methylated lysines, such as PHD that helps KDM5 to demethylate H3K4me3. Regulatory motifs consist of a functional network. For example, NLS, Ser-rich, TPR and JmjN motifs regulate the nuclear localization. And interactions through the CW-type-C4H2C2-SWIRM are necessary to the demethylase activity of KDM1B. Additionally, many conservative domains that have potential functions but no deep exploration are reviewed for the first time. These conservative domains are usually amino acid-rich regions, which have great research value. The arrangements of four types of sequence motifs generate that KDM proteins diversify toward modular activities and biological functions. Finally, we draw a blueprint of functional mechanisms to discuss the modular activity of KDMs.
Collapse
Affiliation(s)
- Zerong Wang
- State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of life sciences, Inner Mongolia University
| | - Dongyang Liu
- State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of life sciences, Inner Mongolia University. He is now studying for a master's degree at the institute of botany of the Chinese Academy of Sciences. His research interests include bioinformatics and computational genomics
| | - Baofang Xu
- State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of life sciences, Inner Mongolia University
| | - Ruixia Tian
- State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of life sciences, Inner Mongolia University
| | - Yongchun Zuo
- State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of life sciences, Inner Mongolia University. His research interests include bioinformatics and integration analysis of multiomics in cell reprogramming
| |
Collapse
|
14
|
Cao P, Li H, Zuo Y, Nashun B. Characterization of DNA Methylation Patterns and Mining of Epigenetic Markers During Genomic Reprogramming in SCNT Embryos. Front Cell Dev Biol 2020; 8:570107. [PMID: 32984351 PMCID: PMC7492385 DOI: 10.3389/fcell.2020.570107] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 08/13/2020] [Indexed: 12/15/2022] Open
Abstract
Somatic cell nuclear transfer (SCNT), also known as somatic cell cloning, is a commonly used technique to study epigenetic reprogramming. Although SCNT has the advantages of being safe and able to obtain pluripotent cells, early developmental arrest happens in most SCNT embryos. Overcoming epigenetic barriers is currently the primary strategy for improving reprogramming efficiency and improving developmental rate in SCNT embryos. In this study, we analyzed DNA methylation profiles of in vivo fertilized embryos and SCNT embryos with different developmental fates. Overall DNA methylation level was higher in SCNT embryos during global de-methylation process compared to in vivo fertilized embryos. In addition, promoter region, first intron and 3′UTR were found to be the major genomic regions that were hyper-methylated in SCNT embryos. Surprisingly, we found the length of re-methylated region was directly related to the change of methylation level. Furthermore, a number of genes including Dppa2 and Dppa4 which are important for early zygotic genome activation (ZGA) were not properly activated in SCNT embryos. This study comprehensively analyzed genome-wide DNA methylation patterns in SCNT embryos and provided candidate target genes for improving efficiency of genomic reprogramming in SCNT embryos. Since SCNT technology has been widely used in agricultural and pastoral production, protection of endangered animals, and therapeutic cloning, the findings of this study have significant importance for all these fields.
Collapse
Affiliation(s)
- Pengbo Cao
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Hanshuang Li
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Buhe Nashun
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot, China
| |
Collapse
|
15
|
Predicting Preference of Transcription Factors for Methylated DNA Using Sequence Information. MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 22:1043-1050. [PMID: 33294291 PMCID: PMC7691157 DOI: 10.1016/j.omtn.2020.07.035] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 07/28/2020] [Indexed: 12/12/2022]
Abstract
Transcription factors play key roles in cell-fate decisions by regulating 3D genome conformation and gene expression. The traditional view is that methylation of DNA hinders transcription factors binding to them, but recent research has shown that many transcription factors prefer to bind to methylated DNA. Therefore, identifying such transcription factors and understanding their functions is a stepping-stone for studying methylation-mediated biological processes. In this paper, a two-step discriminated method was proposed to recognize transcription factors and their preference for methylated DNA based only on sequences information. In the first step, the proposed model was used to discriminate transcription factors from non-transcription factors. The areas under the curve (AUCs) are 0.9183 and 0.9116, respectively, for the 5-fold cross-validation test and independent dataset test. Subsequently, for the classification of transcription factors that prefer methylated DNA and transcription factors that prefer non-methylated DNA, our model could produce the AUCs of 0.7744 and 0.7356, respectively, for the 5-fold cross-validation test and independent dataset test. Based on the proposed model, a user-friendly web server called TFPred was built, which can be freely accessed at http://lin-group.cn/server/TFPred/.
Collapse
|
16
|
Liang P, Yang W, Chen X, Long C, Zheng L, Li H, Zuo Y. Machine Learning of Single-Cell Transcriptome Highly Identifies mRNA Signature by Comparing F-Score Selection with DGE Analysis. MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 20:155-163. [PMID: 32169803 PMCID: PMC7066034 DOI: 10.1016/j.omtn.2020.02.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 12/27/2019] [Accepted: 02/05/2020] [Indexed: 12/21/2022]
Abstract
Human preimplantation development is a complex process involving dramatic changes in transcriptional architecture. For a better understanding of their time-spatial development, it is indispensable to identify key genes. Although the single-cell RNA sequencing (RNA-seq) techniques could provide detailed clustering signatures, the identification of decisive factors remains difficult. Additionally, it requires high experimental cost and a long experimental period. Thus, it is highly desired to develop computational methods for identifying effective genes of development signature. In this study, we first developed a predictor called EmPredictor to identify developmental stages of human preimplantation embryogenesis. First, we compared the F-score of feature selection algorithms with differential gene expression (DGE) analysis to find specific signatures of the development stage. In addition, by training the support vector machine (SVM), four types of signature subsets were comprehensively discussed. The prediction results showed that a feature subset with 1,881 genes from the F-score algorithm obtained the best predictive performance, which achieved the highest accuracy of 93.3% on the cross-validation set. Further function enrichment demonstrated that the gene set selected by the feature selection method was involved in more development-related pathways and cell fate determination biomarkers. This indicates that the F-score algorithm should be preferentially proposed for detecting key genes of multi-period data in mammalian early development.
Collapse
Affiliation(s)
- Pengfei Liang
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Wuritu Yang
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Xing Chen
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Chunshen Long
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Lei Zheng
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Hanshuang Li
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Yongchun Zuo
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China.
| |
Collapse
|
17
|
Dao FY, Lv H, Yang YH, Zulfiqar H, Gao H, Lin H. Computational identification of N6-methyladenosine sites in multiple tissues of mammals. Comput Struct Biotechnol J 2020; 18:1084-1091. [PMID: 32435427 PMCID: PMC7229270 DOI: 10.1016/j.csbj.2020.04.015] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2020] [Revised: 04/20/2020] [Accepted: 04/21/2020] [Indexed: 12/12/2022] Open
Abstract
N6-methyladenosine (m6A) is the methylation of the adenosine at the nitrogen-6 position, which is the most abundant RNA methylation modification and involves a series of important biological processes. Accurate identification of m6A sites in genome-wide is invaluable for better understanding their biological functions. In this work, an ensemble predictor named iRNA-m6A was established to identify m6A sites in multiple tissues of human, mouse and rat based on the data from high-throughput sequencing techniques. In the proposed predictor, RNA sequences were encoded by physical-chemical property matrix, mono-nucleotide binary encoding and nucleotide chemical property. Subsequently, these features were optimized by using minimum Redundancy Maximum Relevance (mRMR) feature selection method. Based on the optimal feature subset, the best m6A classification models were trained by Support Vector Machine (SVM) with 5-fold cross-validation test. Prediction results on independent dataset showed that our proposed method could produce the excellent generalization ability. We also established a user-friendly webserver called iRNA-m6A which can be freely accessible at http://lin-group.cn/server/iRNA-m6A. This tool will provide more convenience to users for studying m6A modification in different tissues.
Collapse
Affiliation(s)
| | | | - Yu-He Yang
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hasan Zulfiqar
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hui Gao
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hao Lin
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
18
|
Wang S, Zhang Q, Yu C, Cao Y, Zuo Y, Yang L. Immune cell infiltration-based signature for prognosis and immunogenomic analysis in breast cancer. Brief Bioinform 2020; 22:2020-2031. [PMID: 32141494 DOI: 10.1093/bib/bbaa026] [Citation(s) in RCA: 90] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Revised: 01/30/2020] [Accepted: 02/17/2020] [Indexed: 12/18/2022] Open
Abstract
Breast cancer is one of the most human malignant diseases and the leading cause of cancer-related death in the world. However, the prognostic and therapeutic benefits of breast cancer patients cannot be predicted accurately by the current stratifying system. In this study, an immune-related prognostic score was established in 22 breast cancer cohorts with a total of 6415 samples. An extensive immunogenomic analysis was conducted to explore the relationships between immune score, prognostic significance, infiltrating immune cells, cancer genotypes and potential immune escape mechanisms. Our analysis revealed that this immune score was a promising biomarker for estimating overall survival in breast cancer. This immune score was associated with important immunophenotypic factors, such as immune escape and mutation load. Further analysis revealed that patients with high immune scores exhibited therapeutic benefits from chemotherapy and immunotherapy. Based on these results, we can conclude that this immune score may be a useful tool for overall survival prediction and treatment guidance for patients with breast cancer.
Collapse
|
19
|
Li H, Song M, Yang W, Cao P, Zheng L, Zuo Y. A Comparative Analysis of Single-Cell Transcriptome Identifies Reprogramming Driver Factors for Efficiency Improvement. MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 19:1053-1064. [PMID: 32045876 PMCID: PMC7015826 DOI: 10.1016/j.omtn.2019.12.035] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Revised: 12/23/2019] [Accepted: 12/26/2019] [Indexed: 12/11/2022]
Abstract
Terminally differentiated somatic cells can be reprogrammed into a totipotent state through somatic cell nuclear transfer (SCNT). The incomplete reprogramming is the major reason for developmental arrest of SCNT embryos at early stages. In our studies, we found that pathways for autophagy, endocytosis, and apoptosis were incompletely activated in nuclear transfer (NT) 2-cell arrest embryos, whereas extensively inhibited pathways for stem cell pluripotency maintenance, DNA repair, cell cycle, and autophagy may result in NT 4-cell embryos arrest. As for NT normal embryos, a significant shift in expression of developmental transcription factors (TFs) Id1, Pou6f1, Cited1, and Zscan4c was observed. Compared with pluripotent gene Ascl2 being activated only in NT 2-cell, Nanog, Dppa2, and Sall4 had major expression waves in normal development of both NT 2-cell and 4-cell embryos. Additionally, Kdm4b/4d and Kdm5b had been confirmed as key markers in NT 2-cell and 4-cell embryos, respectively. Histone acetylases Kat8, Elp6, and Eid1 were co-activated in NT 2-cell and 4-cell embryos to facilitate normal development. Gadd45a as a key driver functions with Tet1 and Tet2 to improve the efficiency of NT reprogramming. Taken together, our findings provided an important theoretical basis for elucidating the potential molecular mechanisms and identified reprogramming driver factor to improve the efficiency of SCNT reprogramming.
Collapse
Affiliation(s)
- Hanshuang Li
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Mingmin Song
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Wuritu Yang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Pengbo Cao
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Lei Zheng
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China.
| |
Collapse
|