1
|
Huang Y, Wang Q, Liu X, Du W, Hao Z, Wang Y. Transcriptional Signatures of a Dynamic Epilepsy Process Reveal Potential Immune Regulation. Mol Neurobiol 2024; 61:3384-3396. [PMID: 37989981 PMCID: PMC11087345 DOI: 10.1007/s12035-023-03786-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 11/09/2023] [Indexed: 11/23/2023]
Abstract
Epilepsy is a progression of development and advancement over time. However, the molecular features of epilepsy were poorly studied from a dynamic developmental perspective. We intend to investigate the key mechanisms in the process of epilepsy by exploring the roles of stage-specifically expressed genes. By using time-course transcriptomic data of epileptic samples, we first analyzed the molecular features of epilepsy in different stages and divided it into progression and remission stages based on their transcriptomic features. 34 stage-specifically expressed genes were then identified by the Tau index and verified in other epileptic datasets. These genes were then enriched for immune-related biological functions. Furthermore, we found that the level of immune infiltration and mechanisms at different stages were different, which may result from different types of immune cells playing leading roles in distinct stages. Our findings indicated an essential role of immune regulation as the potential mechanism of epilepsy development.
Collapse
Affiliation(s)
- Yanruo Huang
- Department of Anesthesiology, Huashan Hospital, Fudan University, 12 Middle Wulumuqi Road, Shanghai, 200040, People's Republic of China
| | - Qihang Wang
- State Key Laboratory of Molecular Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, People's Republic of China
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
| | - Xiaoyin Liu
- Department of Neurosurgery, West China Medical School, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, People's Republic of China
| | - Wenjie Du
- Department of Anesthesiology, Huashan Hospital, Fudan University, 12 Middle Wulumuqi Road, Shanghai, 200040, People's Republic of China
| | - Zijian Hao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200433, People's Republic of China.
- MOE Frontiers Center for Brain Science, Fudan University, Shanghai, 200433, People's Republic of China.
| | - Yingwei Wang
- Department of Anesthesiology, Huashan Hospital, Fudan University, 12 Middle Wulumuqi Road, Shanghai, 200040, People's Republic of China.
| |
Collapse
|
2
|
Luo M, Miao YR, Ke YJ, Guo AY, Zhang Q. A comprehensive landscape of transcription profiles and data resources for human leukemia. Blood Adv 2023; 7:3435-3449. [PMID: 36595475 PMCID: PMC10362280 DOI: 10.1182/bloodadvances.2022008410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 12/13/2022] [Accepted: 12/15/2022] [Indexed: 01/04/2023] Open
Abstract
As a heterogeneous group of hematologic malignancies, leukemia has been widely studied at the transcriptome level. However, a comprehensive transcriptomic landscape and resources for different leukemia subtypes are lacking. Thus, in this study, we integrated the RNA sequencing data sets of >3000 samples from 14 leukemia subtypes and 53 related cell lines via a unified analysis pipeline. We depicted the corresponding transcriptomic landscape and developed a user-friendly data portal LeukemiaDB. LeukemiaDB was designed with 5 main modules: protein-coding gene, long noncoding RNA (lncRNA), circular RNA, alternative splicing, and fusion gene modules. In LeukemiaDB, users can search and browse the expression level, regulatory modules, and molecular information across leukemia subtypes or cell lines. In addition, a comprehensive analysis of data in LeukemiaDB demonstrates that (1) different leukemia subtypes or cell lines have similar expression distribution of the protein-coding gene and lncRNA; (2) some alternative splicing events are shared among nearly all leukemia subtypes, for example, MYL6 in A3SS, MYB in A5SS, HMBS in retained intron, GTPBP10 in mutually exclusive exons, and POLL in skipped exon; (3) some leukemia-specific protein-coding genes, for example, ABCA6, ARHGAP44, WNT3, and BLACE, and fusion genes, for example, BCR-ABL1 and KMT2A-AFF1 are involved in leukemogenesis; (4) some highly correlated regulatory modules were also identified in different leukemia subtypes, for example, the HOXA9 module in acute myeloid leukemia and the NOTCH1 module in T-cell acute lymphoblastic leukemia. In summary, the developed LeukemiaDB provides valuable insights into oncogenesis and progression of leukemia and, to the best of our knowledge, is the most comprehensive transcriptome resource of human leukemia available to the research community.
Collapse
Affiliation(s)
- Mei Luo
- Research Center of Clinical Medicine, Affiliated Hospital of Nantong University, Nantong, China
- Department of Dermatology, Xiangya Hospital, Central South University, Changsha, People’s Republic of China
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, People’s Republic of China
| | - Ya-Ru Miao
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Ya-Juan Ke
- Dian Diagnostics Group Co, Ltd, Hangzhou, China
- Key Laboratory of Digital Technology in Medical Diagnostics of Zhejiang Province, Hangzhou, China
| | - An-Yuan Guo
- Research Center of Clinical Medicine, Affiliated Hospital of Nantong University, Nantong, China
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Qiong Zhang
- Research Center of Clinical Medicine, Affiliated Hospital of Nantong University, Nantong, China
| |
Collapse
|
3
|
Hu Qian S, Shi MW, Wang DY, Fear JM, Chen L, Tu YX, Liu HS, Zhang Y, Zhang SJ, Yu SS, Oliver B, Chen ZX. Integrating massive RNA-seq data to elucidate transcriptome dynamics in Drosophila melanogaster. Brief Bioinform 2023; 24:bbad177. [PMID: 37232385 PMCID: PMC10505420 DOI: 10.1093/bib/bbad177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 04/19/2023] [Accepted: 04/20/2023] [Indexed: 05/27/2023] Open
Abstract
The volume of ribonucleic acid (RNA)-seq data has increased exponentially, providing numerous new insights into various biological processes. However, due to significant practical challenges, such as data heterogeneity, it is still difficult to ensure the quality of these data when integrated. Although some quality control methods have been developed, sample consistency is rarely considered and these methods are susceptible to artificial factors. Here, we developed MassiveQC, an unsupervised machine learning-based approach, to automatically download and filter large-scale high-throughput data. In addition to the read quality used in other tools, MassiveQC also uses the alignment and expression quality as model features. Meanwhile, it is user-friendly since the cutoff is generated from self-reporting and is applicable to multimodal data. To explore its value, we applied MassiveQC to Drosophila RNA-seq data and generated a comprehensive transcriptome atlas across 28 tissues from embryogenesis to adulthood. We systematically characterized fly gene expression dynamics and found that genes with high expression dynamics were likely to be evolutionarily young and expressed at late developmental stages, exhibiting high nonsynonymous substitution rates and low phenotypic severity, and they were involved in simple regulatory programs. We also discovered that human and Drosophila had strong positive correlations in gene expression in orthologous organs, revealing the great potential of the Drosophila system for studying human development and disease.
Collapse
Affiliation(s)
- Sheng Hu Qian
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Meng-Wei Shi
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Dan-Yang Wang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Justin M Fear
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lu Chen
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Yi-Xuan Tu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Hong-Shan Liu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan Zhang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Shuai-Jie Zhang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Shan-Shan Yu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Brian Oliver
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Zhen-Xia Chen
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
- Interdisciplinary Sciences Institute, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Shenzhen 518000, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518000, China
| |
Collapse
|
4
|
Han B, Wu D, Zhang Y, Li DZ, Xu W, Liu A. Epigenetic regulation of seed-specific gene expression by DNA methylation valleys in castor bean. BMC Biol 2022; 20:57. [PMID: 35227267 PMCID: PMC8886767 DOI: 10.1186/s12915-022-01259-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 02/18/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Understanding the processes governing angiosperm seed growth and development is essential both for fundamental plant biology and for agronomic purposes. Master regulators of angiosperm seed development are expressed in a seed-specific manner. However, it is unclear how this seed specificity of transcription is established. In some vertebrates, DNA methylation valleys (DMVs) are highly conserved and strongly associated with key developmental genes, but comparable studies in plants are limited to Arabidopsis and soybean. Castor bean (Ricinus communis) is a valuable model system for the study of seed biology in dicots and source of economically important castor oil. Unlike other dicots such as Arabidopsis and soybean, castor bean seeds have a relatively large and persistent endosperm throughout seed development, representing substantial structural differences in mature seeds. Here, we performed an integrated analysis of RNA-seq, whole-genome bisulfite sequencing, and ChIP-seq for various histone marks in the castor bean. RESULTS We present a gene expression atlas covering 16 representative tissues and identified 1162 seed-specific genes in castor bean (Ricinus communis), a valuable model for the study of seed biology in dicots. Upon whole-genome DNA methylation analyses, we detected 32,567 DMVs across five tissues, covering ~33% of the castor bean genome. These DMVs are highly hypomethylated during development and conserved across plant species. We found that DMVs have the potential to activate transcription, especially that of tissue-specific genes. Focusing on seed development, we found that many key developmental regulators of seed/endosperm development, including AGL61, AGL62, LEC1, LEC2, ABI3, and WRI1, were located within DMVs. ChIP-seq for five histone modifications in leaves and seeds clearly showed that the vast majority of histone modification peaks were enriched within DMVs, and their remodeling within DMVs has a critical role in the regulation of seed-specific gene expression. Importantly, further experiment analysis revealed that distal DMVs may act as cis-regulatory elements, like enhancers, to activate downstream gene expression. CONCLUSIONS Our results point to the importance of DMVs and special distal DMVs behaving like enhancers, in the regulation of seed-specific genes, via the reprogramming of histone modifications within DMVs. Furthermore, these results provide a comprehensive understanding of the epigenetic regulator roles in seed development in castor bean and other important crops.
Collapse
Affiliation(s)
- Bing Han
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Di Wu
- Key Laboratory of Economic Plants and Biotechnology, Yunnan Key Laboratory for Wild Plant Resources, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yanyu Zhang
- Key Laboratory of Economic Plants and Biotechnology, Yunnan Key Laboratory for Wild Plant Resources, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Wei Xu
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
| | - Aizhong Liu
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, Southwest Forestry University, Kunming, 650224, China.
| |
Collapse
|
5
|
Xie G, Liu C, Miao Y, Xia M, Zhang Q, Guo A. A comprehensive platelet expression atlas (PEA) resource and platelet transcriptome landscape. Am J Hematol 2022; 97:E18-E21. [PMID: 34714959 DOI: 10.1002/ajh.26393] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 10/21/2021] [Accepted: 10/24/2021] [Indexed: 01/16/2023]
Affiliation(s)
- Gui‐Yan Xie
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology Huazhong University of Science and Technology Wuhan China
| | - Chun‐Jie Liu
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology Huazhong University of Science and Technology Wuhan China
| | - Ya‐Ru Miao
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology Huazhong University of Science and Technology Wuhan China
| | - Mengxuan Xia
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology Huazhong University of Science and Technology Wuhan China
| | - Qiong Zhang
- Research Center of Clinical Medicine Affiliated Hospital of Nantong University Nantong China
| | - An‐Yuan Guo
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology Huazhong University of Science and Technology Wuhan China
- Research Center of Clinical Medicine Affiliated Hospital of Nantong University Nantong China
| |
Collapse
|
6
|
Li J, Long J, Zhang Q, Shen H, Guo AY, Ma Z, Zhang G. Hypothalamic long noncoding RNA AK044061 is involved in the development of dietary obesity in mice. Int J Obes (Lond) 2021; 45:2638-2647. [PMID: 34446844 DOI: 10.1038/s41366-021-00945-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 08/16/2021] [Accepted: 08/16/2021] [Indexed: 02/07/2023]
Abstract
BACKGROUND Long noncoding RNAs (lncRNAs) have been implicated in various important biological processes, however, its role in energy balance and obesity remains largely unknown. METHODS Differentially expressed lncRNAs in the hypothalamus of diet-induced obesity (DIO) mice versus chow-fed mice were identified by RNA sequencing. Lentivirus-mediated overexpression and knockdown of a novel lncRNA, AK044061, were used to assess its role in energy balance and the development of DIO. RNA immunoprecipitation (RIP) and pull down assays were carried out to analyze the interaction between lncRNA AK044061 and RelA, an NF-κB subunit. RESULTS LncRNA AK044061 was upregulated in the hypothalamus of DIO mice. Acute intracerebroventricular (i.c.v.) infusion of glucose reduced the expression of lncRNA AK044061, whereas an overnight of fasting enhanced its expression. RNA in situ hybridization data showed that AK044061 was expressed in the neurons of the arcuate nucleus (ARC). Lentivirus-mediated overexpression of AK044061 in ARC cells, or in the neurons of the ARC nucleus led to an obesity-like phenotype and related metabolic disorders. Furthermore, knockdown of lncRNA AK044061 in Agouti-related peptide (AgRP)-expressing neurons mitigated DIO and its related metabolic dysregulations. In mechanism, we showed that lncRNA AK044061 was associated with RelA and could enhance the NF-κB reporter activity. The effect of lncRNA AK044061 on energy balance is mediated by NF-κB. CONCLUSIONS Our findings suggest that excessive lncRNA AK044061 in the ARC nucleus leads to energy imbalance and obesity. LncRNA AK044061 expressed in the AgRP neurons is important in the development of dietary obesity in mice.
Collapse
Affiliation(s)
- Juan Li
- Key Laboratory of Environmental Health, Ministry of Education, Department of Toxicology, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.,Institute for Brain Research, Collaborative Innovation Center for Brain Science, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Jinlie Long
- Key Laboratory of Environmental Health, Ministry of Education, Department of Toxicology, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Qiong Zhang
- Hubei Bioinformatics & Molecular Imaging Key Laboratory, Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Hongyuan Shen
- Singapore Nuclear Research and Safety Initiative, National University of Singapore, Singapore, Singapore
| | - An-Yuan Guo
- Hubei Bioinformatics & Molecular Imaging Key Laboratory, Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China.
| | - Zhaowu Ma
- School of Basic Medicine, Health Science Center, Yangtze University, Jingzhou, Hubei, China.
| | - Guo Zhang
- Key Laboratory of Environmental Health, Ministry of Education, Department of Toxicology, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China. .,Institute for Brain Research, Collaborative Innovation Center for Brain Science, Huazhong University of Science and Technology, Wuhan, Hubei, China.
| |
Collapse
|
7
|
Jiang T, Zhou W, Chang Z, Zou H, Bai J, Sun Q, Pan T, Xu J, Li Y, Li X. ImmReg: the regulon atlas of immune-related pathways across cancer types. Nucleic Acids Res 2021; 49:12106-12118. [PMID: 34755873 PMCID: PMC8643631 DOI: 10.1093/nar/gkab1041] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 10/10/2021] [Accepted: 10/14/2021] [Indexed: 01/05/2023] Open
Abstract
Immune system gene regulation perturbation has been found to be a major cause of the development of various types of cancer. Numbers of mechanisms contribute to gene expression regulation, thus, systematically identification of potential regulons of immune-related pathways is critical to cancer immunotherapy. Here, we comprehensively chart the landscape of transcription factors, microRNAs, RNA binding proteins and long noncoding RNAs regulation in 17 immune-related pathways across 33 cancers. The potential immunology regulons are likely to exhibit higher expressions in immune cells, show expression perturbations in cancer, and are significantly correlated with immune cell infiltrations. We also identify a panel of clinically relevant immunology regulons across cancers. Moreover, the regulon atlas of immune-related pathways helps prioritizing cancer-related genes (i.e. ETV7, miR-146a-5p, ZFP36 and HCP5). We further identified two molecular subtypes of glioma (cold and hot tumour phenotypes), which were characterized by differences in immune cell infiltrations, expression of checkpoints, and prognosis. Finally, we developed a user-friendly resource, ImmReg (http://bio-bigdata.hrbmu.edu.cn/ImmReg/), with multiple modules to visualize, browse, and download immunology regulation. Our study provides a comprehensive landscape of immunology regulons, which will shed light on future development of RNA-based cancer immunotherapies.
Collapse
Affiliation(s)
- Tiantongfei Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Weiwei Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Zhenghong Chang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Haozhe Zou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Jing Bai
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Qisen Sun
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Tao Pan
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Juan Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Yongsheng Li
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China.,Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| |
Collapse
|
8
|
Jin G, Ma PF, Wu X, Gu L, Long M, Zhang C, Li DZ. New Genes Interacted with Recent Whole Genome Duplicates in the Fast Stem Growth of Bamboos. Mol Biol Evol 2021; 38:5752-5768. [PMID: 34581782 PMCID: PMC8662795 DOI: 10.1093/molbev/msab288] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
As drivers of evolutionary innovations, new genes allow organisms to explore new niches. However, clear examples of this process remain scarce. Bamboos, the unique grass lineage diversifying into the forest, have evolved with a key innovation of fast growth of woody stem, reaching up to 1 m/day. Here, we identify 1,622 bamboo-specific orphan genes that appeared in recent 46 million years, and 19 of them evolved from noncoding ancestral sequences with entire de novo origination process reconstructed. The new genes evolved gradually in exon−intron structure, protein length, expression specificity, and evolutionary constraint. These new genes, whether or not from de novo origination, are dominantly expressed in the rapidly developing shoots, and make transcriptomes of shoots the youngest among various bamboo tissues, rather than reproductive tissue in other plants. Additionally, the particularity of bamboo shoots has also been shaped by recent whole-genome duplicates (WGDs), which evolved divergent expression patterns from ancestral states. New genes and WGDs have been evolutionarily recruited into coexpression networks to underline fast-growing trait of bamboo shoot. Our study highlights the importance of interactions between new genes and genome duplicates in generating morphological innovation.
Collapse
Affiliation(s)
- Guihua Jin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Peng-Fei Ma
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Xiaopei Wu
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Lianfeng Gu
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, 60637, USA
| | - Chengjun Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| |
Collapse
|
9
|
Liu CJ, Xie GY, Miao YR, Xia M, Wang Y, Lei Q, Zhang Q, Guo AY. EVAtlas: a comprehensive database for ncRNA expression in human extracellular vesicles. Nucleic Acids Res 2021; 50:D111-D117. [PMID: 34387689 PMCID: PMC8728297 DOI: 10.1093/nar/gkab668] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 07/01/2021] [Accepted: 07/23/2021] [Indexed: 12/23/2022] Open
Abstract
Extracellular vesicles (EVs) packing various molecules play vital roles in intercellular communication. Non-coding RNAs (ncRNAs) are important functional molecules and biomarkers in EVs. A comprehensive investigation of ncRNAs expression in EVs under different conditions is a fundamental step for functional discovery and application of EVs. Here, we curated 2030 small RNA-seq datasets for human EVs (1506 sEV and 524 lEV) in 24 conditions and over 40 diseases. We performed a unified reads dynamic assignment algorithm (RDAA) considering mismatch and multi-mapping reads to quantify the expression profiles of seven ncRNA types (miRNA, snoRNA, piRNA, snRNA, rRNA, tRNA and Y RNA). We constructed EVAtlas (http://bioinfo.life.hust.edu.cn/EVAtlas), a comprehensive database for ncRNA expression in EVs with four functional modules: (i) browse and compare the distribution of ncRNAs in EVs from 24 conditions and eight sources (plasma, serum, saliva, urine, sperm, breast milk, primary cell and cell line); (ii) prioritize candidate ncRNAs in condition related tissues based on their expression; (iii) explore the specifically expressed ncRNAs in EVs from 24 conditions; (iv) investigate ncRNA functions, related drugs, target genes and EVs isolation methods. EVAtlas contains the most comprehensive ncRNA expression in EVs and will be a key resource in this field.
Collapse
Affiliation(s)
- Chun-Jie Liu
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China
| | - Gui-Yan Xie
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China
| | - Ya-Ru Miao
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China
| | - Mengxuan Xia
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China
| | - Yi Wang
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China
| | - Qian Lei
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China
| | - Qiong Zhang
- Research Center of Clinical Medicine, Affiliated Hospital of Nantong University, Nantong 226001, China
| | - An-Yuan Guo
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan 430074, China.,Research Center of Clinical Medicine, Affiliated Hospital of Nantong University, Nantong 226001, China
| |
Collapse
|
10
|
Hu H, Zhang Q, Hu FF, Liu CJ, Guo AY. A comprehensive survey for human transcription factors on expression, regulation, interaction, phenotype and cancer survival. Brief Bioinform 2021; 22:6124917. [PMID: 33517372 DOI: 10.1093/bib/bbab002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Revised: 12/30/2020] [Accepted: 01/02/2021] [Indexed: 11/13/2022] Open
Abstract
Transcription factors (TFs) act as key regulators in biological processes through controlling gene expression. Here, we conducted a systematic study for all human TFs on the expression, regulation, interaction, mutation, phenotype and cancer survival. We revealed that the average expression levels of TFs in normal tissues were lower than 50% expression of non-TFs, whereas TF expression was increased in cancers. TFs that are specifically expressed in an individual tissue or cancer may be potential marker genes. For instance, TGIF2LX/Y were preferentially expressed in testis and NEUROG1, PRDM14, SRY, ZNF705A and ZNF716 were specifically highly expressed in germ cell tumors. We found different distributions of target genes and TF co-regulations in different TF families. Some small TF families have huge protein interaction pairs, suggesting their central roles in transcriptional regulation. The bZIP family is a small family involving many signaling pathways. Survival analysis indicated that most TFs significantly affect survival of one or more cancers. Some survival-related TFs were also specifically highly expressed in the corresponding cancer types, which may be potential targets for cancer therapy. Finally, we identified 43 TFs whose mutations were closely correlated to survival, suggesting their cancer-driven roles. The systematic analysis of TFs provides useful clues for further investigation of TF regulatory mechanisms and the role of TFs in diseases.
Collapse
Affiliation(s)
- Hui Hu
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Qiong Zhang
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Fei-Fei Hu
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Chun-Jie Liu
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - An-Yuan Guo
- Center for Artificial Intelligence Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| |
Collapse
|
11
|
Liao Q, Du R, Gou J, Guo L, Shen H, Liu H, Nguyen JK, Ming R, Yin T, Huang S, Yan J. The genomic architecture of the sex-determining region and sex-related metabolic variation in Ginkgobiloba. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 104:1399-1409. [PMID: 33015884 DOI: 10.1111/tpj.15009] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 08/14/2020] [Accepted: 09/01/2020] [Indexed: 06/11/2023]
Abstract
Sex differences and evolutionary differences are critical biological issues. Ginkgo is an ancient lineage of dioecious gymnosperms with special value for studying the mechanism of sex determination in plants. However, the major genetic basic underlying sex chromosomes remains to be uncovered. In this study, we identify the sex-determining region of Ginkgo and locate it to the area from megabases 48 to 75 on chromosome 2. We find that the male sex-determining region of Ginkgo contains more than 200 genes, including four MADS-box genes, demonstrating that the Ginkgo sex determination system is of the XY type. We also find that genetic sex differences result in specialized flavonoid metabolism and regulation in each sex. These findings establish a foundation for revealing the molecular mechanism of sexual dimorphism and promoting the development of the Ginkgo industry.
Collapse
Affiliation(s)
- Qinggang Liao
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Ran Du
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Junbo Gou
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Lvjun Guo
- Center for Synthetic and Systems Biology, MOE Key Laboratory of Bioinformatics, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - He Shen
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Hailin Liu
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
- Key Laboratory for Cultivar Innovation and Germplasm Improvement for Salicaceae Species, College of Forestry, Nanjing Forestry University, Nanjing, 210037, China
| | - Julie K Nguyen
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Ray Ming
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Tongming Yin
- Key Laboratory for Cultivar Innovation and Germplasm Improvement for Salicaceae Species, College of Forestry, Nanjing Forestry University, Nanjing, 210037, China
| | - Sanwen Huang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Jianbin Yan
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| |
Collapse
|
12
|
Zhang Q, Liu W, Zhang HM, Xie GY, Miao YR, Xia M, Guo AY. hTFtarget: A Comprehensive Database for Regulations of Human Transcription Factors and Their Targets. GENOMICS PROTEOMICS & BIOINFORMATICS 2020; 18:120-128. [PMID: 32858223 PMCID: PMC7647694 DOI: 10.1016/j.gpb.2019.09.006] [Citation(s) in RCA: 190] [Impact Index Per Article: 47.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 08/17/2019] [Accepted: 10/23/2019] [Indexed: 01/07/2023]
Abstract
Transcription factors (TFs) as key regulators play crucial roles in biological processes. The identification of TF–target regulatory relationships is a key step for revealing functions of TFs and their regulations on gene expression. The accumulated data of chromatin immunoprecipitation sequencing (ChIP-seq) provide great opportunities to discover the TF–target regulations across different conditions. In this study, we constructed a database named hTFtarget, which integrated huge human TF target resources (7190 ChIP-seq samples of 659 TFs and high-confidence binding sites of 699 TFs) and epigenetic modification information to predict accurate TF–target regulations. hTFtarget offers the following functions for users to explore TF–target regulations: (1) browse or search general targets of a query TF across datasets; (2) browse TF–target regulations for a query TF in a specific dataset or tissue; (3) search potential TFs for a given target gene or non-coding RNA; (4) investigate co-association between TFs in cell lines; (5) explore potential co-regulations for given target genes or TFs; (6) predict candidate TF binding sites on given DNA sequences; (7) visualize ChIP-seq peaks for different TFs and conditions in a genome browser. hTFtarget provides a comprehensive, reliable and user-friendly resource for exploring human TF–target regulations, which will be very useful for a wide range of users in the TF and gene expression regulation community. hTFtarget is available at http://bioinfo.life.hust.edu.cn/hTFtarget.
Collapse
Affiliation(s)
- Qiong Zhang
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Wei Liu
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China; Clinical Center of Human Genomic Research, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Hong-Mei Zhang
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Gui-Yan Xie
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Ya-Ru Miao
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Mengxuan Xia
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - An-Yuan Guo
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
| |
Collapse
|
13
|
Zhang Q, Luo M, Liu CJ, Guo AY. CCLA: an accurate method and web server for cancer cell line authentication using gene expression profiles. Brief Bioinform 2020; 22:5854406. [PMID: 32510568 DOI: 10.1093/bib/bbaa093] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 04/26/2020] [Accepted: 04/28/2020] [Indexed: 01/28/2023] Open
Abstract
Cancer cell lines (CCLs) as important model systems play critical roles in cancer research. The misidentification and contamination of CCLs are serious problems, leading to unreliable results and waste of resources. Current methods for CCL authentication are mainly based on the CCL-specific genetic polymorphism, whereas no method is available for CCL authentication using gene expression profiles. Here, we developed a novel method and homonymic web server (CCLA, Cancer Cell Line Authentication, http://bioinfo.life.hust.edu.cn/web/CCLA/) to authenticate 1291 human CCLs of 28 tissues using gene expression profiles. CCLA showed an excellent speed advantage and high accuracy for CCL authentication, a top 1 accuracy of 96.58 or 92.15% (top 3 accuracy of 100 or 95.11%) for microarray or RNA-Seq validation data (719 samples, 461 CCLs), respectively. To the best of our knowledge, CCLA is the first approach to authenticate CCLs using gene expression data. Users can freely and conveniently authenticate CCLs using gene expression profiles or NCBI GEO accession on CCLA website.
Collapse
|
14
|
Liu T, Zhang Q, Zhang J, Li C, Miao YR, Lei Q, Li Q, Guo AY. EVmiRNA: a database of miRNA profiling in extracellular vesicles. Nucleic Acids Res 2020; 47:D89-D93. [PMID: 30335161 PMCID: PMC6323938 DOI: 10.1093/nar/gky985] [Citation(s) in RCA: 193] [Impact Index Per Article: 48.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 10/09/2018] [Indexed: 12/12/2022] Open
Abstract
Extracellular vesicles (EVs), such as exosomes and microvesicles, acted as cell-to-cell communication vectors and potential biomarkers for diseases. microRNAs (miRNAs) are the most well studied molecules in EVs, thus a comprehensive investigation of miRNA expression profiles in EVs will be helpful to explore their functions and biomarkers. We curated 462 small RNA sequencing samples of EVs from 17 sources/diseases and constructed the EVmiRNA database (http://bioinfo.life.hust.edu.cn/EVmiRNA) to show the miRNA expression profiles. We found >1000 miRNAs expressed in these EVs and detected specific miRNAs for EVs of each source/disease. EVmiRNA provides three functional modules: (i) the miRNA expression profiles and the sample information of EVs from different sources (such as blood, breast milk etc.); (ii) the specifically expressed miRNAs in different EVs that would be helpful for biomarker identification; (iii) the miRNA annotations including the miRNA expression in EVs and TCGA cancer types, miRNA pathway regulations as well as miRNA function and publications. EVmiRNA has a user-friendly web interface with powerful browse and search functions, as well as data downloading. It is the first database focusing on miRNA expression profiles in EVs and will be useful for the research and application community of EV biomarker, miRNA function and liquid biopsy.
Collapse
Affiliation(s)
- Teng Liu
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China.,Huazhong University of Science and Technology Ezhou Industrial Technology Research Institute, Ezhou, Hubei 436044, PR China
| | - Qiong Zhang
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Jiankun Zhang
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Chao Li
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Ya-Ru Miao
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China.,Huazhong University of Science and Technology Ezhou Industrial Technology Research Institute, Ezhou, Hubei 436044, PR China
| | - Qian Lei
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China.,Institute of Hematology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Qiubai Li
- Institute of Hematology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - An-Yuan Guo
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China.,Huazhong University of Science and Technology Ezhou Industrial Technology Research Institute, Ezhou, Hubei 436044, PR China
| |
Collapse
|
15
|
Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study. Int J Mol Sci 2020; 21:ijms21062181. [PMID: 32235704 PMCID: PMC7139673 DOI: 10.3390/ijms21062181] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 03/09/2020] [Accepted: 03/20/2020] [Indexed: 12/30/2022] Open
Abstract
With recent advances in single-cell RNA sequencing, enormous transcriptome datasets have been generated. These datasets have furthered our understanding of cellular heterogeneity and its underlying mechanisms in homogeneous populations. Single-cell RNA sequencing (scRNA-seq) data clustering can group cells belonging to the same cell type based on patterns embedded in gene expression. However, scRNA-seq data are high-dimensional, noisy, and sparse, owing to the limitation of existing scRNA-seq technologies. Traditional clustering methods are not effective and efficient for high-dimensional and sparse matrix computations. Therefore, several dimension reduction methods have been introduced. To validate a reliable and standard research routine, we conducted a comprehensive review and evaluation of four classical dimension reduction methods and five clustering models. Four experiments were progressively performed on two large scRNA-seq datasets using 20 models. Results showed that the feature selection method contributed positively to high-dimensional and sparse scRNA-seq data. Moreover, feature-extraction methods were able to promote clustering performance, although this was not eternally immutable. Independent component analysis (ICA) performed well in those small compressed feature spaces, whereas principal component analysis was steadier than all the other feature-extraction methods. In addition, ICA was not ideal for fuzzy C-means clustering in scRNA-seq data analysis. K-means clustering was combined with feature-extraction methods to achieve good results.
Collapse
|
16
|
Xie GY, Xia M, Miao YR, Luo M, Zhang Q, Guo AY. FFLtool: a web server for transcription factor and miRNA feed forward loop analysis in human. Bioinformatics 2019; 36:2605-2607. [DOI: 10.1093/bioinformatics/btz929] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 12/01/2019] [Accepted: 12/09/2019] [Indexed: 11/13/2022] Open
Abstract
Abstract
Summary
Transcription factors (TFs) and microRNAs (miRNAs) are two kinds of important regulators for transcriptional and post-transcriptional regulations. Understanding cross-talks between the two regulators and their targets is critical to reveal complex molecular regulatory mechanisms. Here, we developed FFLtool, a web server for detecting potential feed forward loop (FFL) of TF-miRNA-target regulation in human. In FFLtool, we integrated comprehensive regulations of TF-target and miRNA-target, and developed two functional modules: (i) The ‘FFL Analysis’ module can detect potential FFLs and internal regulatory networks in a user-defined gene set. FFLtool also provides three levels of evidence to illustrate the reliability for each FFL and enrichment functions for co-target genes of the same TF and miRNA; (ii) The ‘Browse FFLs’ module displays FFLs comprised of differentially or specifically expressed TFs and miRNAs and their target genes in cancers. FFLtool is a valuable resource for investigating gene expression regulation and mechanism study in biological processes and diseases.
Availability and implementation
FFLtool is available on http://bioinfo.life.hust.edu.cn/FFLtool/.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gui-Yan Xie
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Mengxuan Xia
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Ya-Ru Miao
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Mei Luo
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Qiong Zhang
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - An-Yuan Guo
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| |
Collapse
|
17
|
Lv Y, Lin SY, Hu FF, Ye Z, Zhang Q, Wang Y, Guo AY. Landscape of cancer diagnostic biomarkers from specifically expressed genes. Brief Bioinform 2019; 21:2175-2184. [PMID: 31814027 DOI: 10.1093/bib/bbz131] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 08/25/2019] [Accepted: 09/08/2019] [Indexed: 12/31/2022] Open
Abstract
Although there has been great progress in cancer treatment, cancer remains a serious health threat to humans because of the lack of biomarkers for diagnosis, especially for early-stage diagnosis. In this study, we comprehensively surveyed the specifically expressed genes (SEGs) using the SEGtool based on the big data of gene expression from the The Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) projects. In 15 solid tumors, we identified 233 cancer-specific SEGs (cSEGs), which were specifically expressed in only one cancer and showed great potential to be diagnostic biomarkers. Among them, three cSEGs (OGDH, MUDENG and ACO2) had a sample frequency >80% in kidney cancer, suggesting their high sensitivity. Furthermore, we identified 254 cSEGs as early-stage diagnostic biomarkers across 17 cancers. A two-gene combination strategy was applied to improve the sensitivity of diagnostic biomarkers, and hundreds of two-gene combinations were identified with high frequency. We also observed that 13 SEGs were targets of various drugs and nearly half of these drugs may be repurposed to treat cancers with SEGs as their targets. Several SEGs were regulated by specific transcription factors in the corresponding cancer, and 39 cSEGs were prognosis-related genes in 7 cancers. This work provides a survey of cancer biomarkers for diagnosis and early diagnosis and new insights to drug repurposing. These biomarkers may have great potential in cancer research and application.
Collapse
Affiliation(s)
- Yao Lv
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China
| | - Sheng-Yan Lin
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China
| | - Fei-Fei Hu
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China
| | - Zheng Ye
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China.,Tianjin Key Laboratory of Medical Epigenetics, Key Laboratory of Breast Cancer Prevention and Therapy (Ministry of Education), Tianjin Key Laboratory of Spine and Spinal Cord, Department of Biochemistry and Molecular Biology, Tianjin Medical University, Tianjin 300070, P. R. China
| | - Qiong Zhang
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China
| | - Yan Wang
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China
| | - An-Yuan Guo
- Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P. R. China
| |
Collapse
|