1
|
Vahab N, Bonu T, Kuhlmann L, Ramialison M, Tyagi S. Uncovering co-regulatory modules and gene regulatory networks in the heart through machine learning-based analysis of large-scale epigenomic data. Comput Biol Med 2024; 171:108068. [PMID: 38354497 DOI: 10.1016/j.compbiomed.2024.108068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 12/30/2023] [Accepted: 01/27/2024] [Indexed: 02/16/2024]
Abstract
The availability of large-scale epigenomic data from various cell types and conditions has yielded valuable insights for evaluating and learning features predicting the co-binding of transcription factors (TF). However, prior attempts to develop models predicting motif co-occurrence lacked scalability for globally analyzing any motif combination or making cross-species predictions. Moreover, mapping co-regulatory modules (CRM) to gene regulatory networks (GRN) is crucial for understanding underlying function. Currently, no comprehensive pipeline exists for large-scale, rapid, and accurate CRM and GRN identification. In this study, we analyzed and evaluated different TF binding characteristics facilitating biologically significant co-binding to identify all potential clusters of co-binding TFs. We curated the UniBind database, containing ChIP-Seq data from over 1983 samples and 232 TFs, and implemented two machine learning models to predict CRMs and the potential regulatory networks they operate on. Two machine learning models, Convolution Neural Networks (CNN) and Random Forest Classifier(RFC), used to predict co-binding between TFs, were compared using precision-recall Receiver Operating Characteristic (ROC) curves. CNN outperformed RFC (AUC 0.94 vs. 0.88) and achieved higher F1 scores (0.938 vs. 0.872). The CRMs generated by the clustering algorithm were validated against ChipAtlas and MCOT, revealing additional motifs forming CRMs. We predicted 200k CRMs for 50k+ human genes, validated against recent CRM prediction methods with 100% overlap. Further, we narrowed our focus to study heart-related regulatory motifs, filtering the generated CRMs to report 1784 Cardiac CRMs containing at least four cardiac TFs. Identified cardiac CRMs revealed potential novel regulators like ARID3A and RXRB for SCAD, including known TFs like PPARG for F11R. Our findings highlight the importance of the NKX family of transcription factors in cardiac development and provide potential targets for further investigation in cardiac disease.
Collapse
Affiliation(s)
- Naima Vahab
- School of Computational Technologies, RMIT University, Melbourne VIC 3000, Australia; Department of Infectious Diseases, Alfred Hospital, Prahran VIC 3008, Australia
| | - Tarun Bonu
- Faculty of Information Technology, Monash University, Clayton VIC 3800, Australia
| | - Levin Kuhlmann
- Faculty of Information Technology, Monash University, Clayton VIC 3800, Australia
| | | | - Sonika Tyagi
- School of Computational Technologies, RMIT University, Melbourne VIC 3000, Australia; Department of Infectious Diseases, Alfred Hospital, Prahran VIC 3008, Australia.
| |
Collapse
|
2
|
Islam MK, Mummadi ST, Liu S, Wei H. Regulation of regeneration in Arabidopsis thaliana. ABIOTECH 2023; 4:332-351. [PMID: 38106435 PMCID: PMC10721781 DOI: 10.1007/s42994-023-00121-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 10/06/2023] [Indexed: 12/19/2023]
Abstract
We employed several algorithms with high efficacy to analyze the public transcriptomic data, aiming to identify key transcription factors (TFs) that regulate regeneration in Arabidopsis thaliana. Initially, we utilized CollaborativeNet, also known as TF-Cluster, to construct a collaborative network of all TFs, which was subsequently decomposed into many subnetworks using the Triple-Link and Compound Spring Embedder (CoSE) algorithms. Functional analysis of these subnetworks led to the identification of nine subnetworks closely associated with regeneration. We further applied principal component analysis and gene ontology (GO) enrichment analysis to reduce the subnetworks from nine to three, namely subnetworks 1, 12, and 17. Searching for TF-binding sites in the promoters of the co-expressed and co-regulated (CCGs) genes of all TFs in these three subnetworks and Triple-Gene Mutual Interaction analysis of TFs in these three subnetworks with the CCGs involved in regeneration enabled us to rank the TFs in each subnetwork. Finally, six potential candidate TFs-WOX9A, LEC2, PGA37, WIP5, PEI1, and AIL1 from subnetwork 1-were identified, and their roles in somatic embryogenesis (GO:0010262) and regeneration (GO:0031099) were discussed, so were the TFs in Subnetwork 12 and 17 associated with regeneration. The TFs identified were also assessed using the CIS-BP database and Expression Atlas. Our analyses suggest some novel TFs that may have regulatory roles in regeneration and embryogenesis and provide valuable data and insights into the regulatory mechanisms related to regeneration. The tools and the procedures used here are instrumental for analyzing high-throughput transcriptomic data and advancing our understanding of the regulation of various biological processes of interest. Supplementary Information The online version contains supplementary material available at 10.1007/s42994-023-00121-9.
Collapse
Affiliation(s)
- Md Khairul Islam
- Computational Science and Engineering Program, Michigan Technological University, Houghton, MI 49931 USA
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI 49931 USA
| | - Sai Teja Mummadi
- Computer Science, Michigan Technological University, Houghton, MI 49931 USA
| | - Sanzhen Liu
- Department of Plant Pathology, Kansas State University, Manhattan, KS 66506 USA
| | - Hairong Wei
- Computational Science and Engineering Program, Michigan Technological University, Houghton, MI 49931 USA
- Computer Science, Michigan Technological University, Houghton, MI 49931 USA
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI 49931 USA
| |
Collapse
|
3
|
Borthakur D, Busov V, Cao XH, Du Q, Gailing O, Isik F, Ko JH, Li C, Li Q, Niu S, Qu G, Vu THG, Wang XR, Wei Z, Zhang L, Wei H. Current status and trends in forest genomics. FORESTRY RESEARCH 2022; 2:11. [PMID: 39525413 PMCID: PMC11524260 DOI: 10.48130/fr-2022-0011] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 08/19/2022] [Indexed: 11/16/2024]
Abstract
Forests are not only the most predominant of the Earth's terrestrial ecosystems, but are also the core supply for essential products for human use. However, global climate change and ongoing population explosion severely threatens the health of the forest ecosystem and aggravtes the deforestation and forest degradation. Forest genomics has great potential of increasing forest productivity and adaptation to the changing climate. In the last two decades, the field of forest genomics has advanced quickly owing to the advent of multiple high-throughput sequencing technologies, single cell RNA-seq, clustered regularly interspaced short palindromic repeats (CRISPR)-mediated genome editing, and spatial transcriptomes, as well as bioinformatics analysis technologies, which have led to the generation of multidimensional, multilayered, and spatiotemporal gene expression data. These technologies, together with basic technologies routinely used in plant biotechnology, enable us to tackle many important or unique issues in forest biology, and provide a panoramic view and an integrative elucidation of molecular regulatory mechanisms underlying phenotypic changes and variations. In this review, we recapitulated the advancement and current status of 12 research branches of forest genomics, and then provided future research directions and focuses for each area. Evidently, a shift from simple biotechnology-based research to advanced and integrative genomics research, and a setup for investigation and interpretation of many spatiotemporal development and differentiation issues in forest genomics have just begun to emerge.
Collapse
Affiliation(s)
- Dulal Borthakur
- Dulal Borthakur, Department of Molecular Biosciences and Bioengineering, University of Hawaii at Manoa, 1955 East-West Road, Honolulu, HI 96822, USA
| | - Victor Busov
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI 49931, USA
| | - Xuan Hieu Cao
- Forest Genetics and Forest Tree Breeding, Faculty for Forest Sciences and Forest Ecology, University of Göttingen, Büsgenweg 2, 37077 Göttingen, Germany
| | - Qingzhang Du
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, P.R. China
| | - Oliver Gailing
- Forest Genetics and Forest Tree Breeding, Faculty for Forest Sciences and Forest Ecology, University of Göttingen, Büsgenweg 2, 37077 Göttingen, Germany
| | - Fikret Isik
- Cooperative Tree Improvement Program, North Carolina State University, Raleigh, NC 27695, USA
| | - Jae-Heung Ko
- Department of Plant & Environmental New Resources, Kyung Hee University, 1732 Deogyeong-daero, Yongin 17104, Republic of Korea
| | - Chenghao Li
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin 150040, P.R. China
| | - Quanzi Li
- State Key Laboratory of Tree Genetics and Breeding, Chinese Academy of Forestry, Beijing 100093, P.R. China
| | - Shihui Niu
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, P.R. China
| | - Guanzheng Qu
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin 150040, P.R. China
| | - Thi Ha Giang Vu
- Forest Genetics and Forest Tree Breeding, Faculty for Forest Sciences and Forest Ecology, University of Göttingen, Büsgenweg 2, 37077 Göttingen, Germany
| | - Xiao-Ru Wang
- Department of Ecology and Environmental Science, Umeå Plant Science Centre, Umeå University, Umeå 90187, Sweden
| | - Zhigang Wei
- College of Life Sciences, Heilongjiang University, Harbin 150080, P. R. China
| | - Lin Zhang
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees, Ministry of Education, Central South University of Forestry and Technology, Changsha 410004, Hunan Province, P.R. China
| | - Hairong Wei
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI 49931, USA
| |
Collapse
|
4
|
Tao S, Zhang W. Network and epigenetic characterization of subsets of genes specifically expressed in maize bundle sheath cells. Comput Struct Biotechnol J 2022; 20:3581-3590. [PMID: 35860403 PMCID: PMC9287181 DOI: 10.1016/j.csbj.2022.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 07/02/2022] [Accepted: 07/02/2022] [Indexed: 11/21/2022] Open
Abstract
Bundle sheath (BS) cells exhibit dramatically structural differences and functional variations at physiological, biochemical and epigenetic levels as compared to mesophyll (M) cells in maize. The regulatory mechanisms controlling functional divergences between M and BS have been extensively investigated. However, BS cell-related regulatory networks are still not completely characterized. To address this, we herein conducted WGCNA-related co-expression assays using bulk M and BS cell RNA-seq data sets and identified a module containing 384 genes highly expressed in BS cells (including 20 hub TFs) instead of M cells. According to the hub TF centered regulatory network, we found that Dof22 and Dof30 might act as key regulators in the regulation of expression of BS-specific genes, and several MYB TFs exhibited a high collaboration with Dof TFs. By comparing the enrichment levels of histone modifications, we found that genes in the aforementioned module were more enriched with histone acetylation as compared to other BS-enriched DEGs with similar expression levels. Moreover, we found that a subset of genes functioning in photosynthesis, protein auto processing and enzymatic activities were significantly enriched with broad H3K4me3. Thus, our study provides evidence showing that regulatory network and histone modifications may play vital roles in the regulation of a subset of genes with important functions in BS cells.
Collapse
Affiliation(s)
- Shentong Tao
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Collaborative Innovation Center for Modern Crop Production co-sponsored by Province and Ministry (CIC-MCP), Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, PR China
| | - Wenli Zhang
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Collaborative Innovation Center for Modern Crop Production co-sponsored by Province and Ministry (CIC-MCP), Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, PR China
| |
Collapse
|
5
|
Wang Y, Wang L, Chen S, Chen S. A study of RNA-editing in Populus trichocarpa nuclei revealed acquisition of RNA-editing on the endosymbiont-derived genes, and a preference for intracellular remodeling genes in adaptation to endosymbiosis. FORESTRY RESEARCH 2021; 1:20. [PMID: 39524518 PMCID: PMC11524294 DOI: 10.48130/fr-2021-0020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 11/01/2021] [Indexed: 11/16/2024]
Abstract
RNA-editing is a post-transcriptional modification that can diversify genome-encoded information by modifying individual RNA bases. In contrast to the well-studied RNA-editing in organelles, little is known about nuclear RNA-editing in higher plants. We performed a genome-wide study of RNA-editing in Populus trichocarpa nuclei using the RNA-seq data generated from the sequenced poplar genotype, 'Nisqually-1'. A total of 24,653 nuclear RNA-editing sites present in 8,603 transcripts were identified. Notably, RNA-editing in P. trichocarpa nuclei tended to occur on endosymbiont-derived genes. We then scrutinized RNA-editing in a cyanobacterial strain closely related to chloroplast. No RNA-editing sites were identified therein, implying that RNA-editing of these endosymbiont-derived genes was acquired after endosymbiosis. Gene ontology enrichment analysis of all the edited genes in P. trichocarpa nuclei demonstrated that nuclear RNA-editing was primarily focused on genes involved in intracellular remodeling processes, which suggests that RNA-editing plays contributing roles in organellar establishment during endosymbiosis. We built a coexpression network using all C-to-U edited genes and then decomposed it to obtain 18 clusters, six of which contained a conserved core motif, A/G-C-A/G. Such a short core motif not only attracted the RNA-editing machinery but also enabled large numbers of sites to be targeted though further study is necessary to verify this finding.
Collapse
Affiliation(s)
- Yiran Wang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin 150040, China
| | - Lihu Wang
- School of Landscape and Ecological Engineering, Hebei University of Engineering, Handan 056000, China
| | - Su Chen
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin 150040, China
| | - Song Chen
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin 150040, China
| |
Collapse
|
6
|
Ma JJ, Chen X, Song YT, Zhang GF, Zhou XQ, Que SP, Mao F, Pervaiz T, Lin JX, Li Y, Li W, Wu HX, Niu SH. MADS-box transcription factors MADS11 and DAL1 interact to mediate the vegetative-to-reproductive transition in pine. PLANT PHYSIOLOGY 2021; 187:247-262. [PMID: 34618133 PMCID: PMC8418398 DOI: 10.1093/plphys/kiab250] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 05/12/2021] [Indexed: 06/13/2023]
Abstract
The reproductive transition is an important event that is crucial for plant survival and reproduction. Relative to the thorough understanding of the vegetative phase transition in angiosperms, a little is known about this process in perennial conifers. To gain insight into the molecular basis of the regulatory mechanism in conifers, we used temporal dynamic transcriptome analysis with samples from seven different ages of Pinus tabuliformis to identify a gene module substantially associated with aging. The results first demonstrated that the phase change in P. tabuliformis occurred as an unexpectedly rapid transition rather than a slow, gradual progression. The age-related gene module contains 33 transcription factors and was enriched in genes that belong to the MADS (MCMl, AGAMOUS, DEFICIENS, SRF)-box family, including six SOC1-like genes and DAL1 and DAL10. Expression analysis in P. tabuliformis and a late-cone-setting P. bungeana mutant showed a tight association between PtMADS11 and reproductive competence. We then confirmed that MADS11 and DAL1 coordinate the aging pathway through physical interaction. Overexpression of PtMADS11 and PtDAL1 partially rescued the flowering of 35S::miR156A and spl1,2,3,4,5,6 mutants in Arabidopsis (Arabidopsis thaliana), but only PtMADS11 could rescue the flowering of the ft-10 mutant, suggesting PtMADS11 and PtDAL1 play different roles in flowering regulatory networks in Arabidopsis. The PtMADS11 could not alter the flowering phenotype of soc1-1-2, indicating it may function differently from AtSOC1 in Arabidopsis. In this study, we identified the MADS11 gene in pine as a regulatory mediator of the juvenile-to-adult transition with functions differentiated from the angiosperm SOC1.
Collapse
Affiliation(s)
- Jing-Jing Ma
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Xi Chen
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Yi-Tong Song
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Gui-Fang Zhang
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Xian-Qing Zhou
- Qigou State-Owned Forest Farm, Pingquan, Hebei Province 067509, PR China
| | - Shu-Peng Que
- Beijing Ming Tombs Forest Farm, Beijing 102200, PR China, Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre, Swedish University of Agricultural Sciences, Umeå SE-901 83, Sweden
| | - Fei Mao
- Beijing Ming Tombs Forest Farm, Beijing 102200, PR China, Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre, Swedish University of Agricultural Sciences, Umeå SE-901 83, Sweden
| | - Tariq Pervaiz
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Jin-Xing Lin
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Yue Li
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Wei Li
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Harry X. Wu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Shi-Hui Niu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| |
Collapse
|
7
|
Wang L, Xie J, Du Q, Song F, Xiao L, Quan M, Zhang D. Transcription factors involved in the regulatory networks governing the Calvin-Benson-Bassham cycle. TREE PHYSIOLOGY 2019; 39:1159-1172. [PMID: 30941430 DOI: 10.1093/treephys/tpz025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 02/18/2019] [Accepted: 03/31/2019] [Indexed: 06/09/2023]
Abstract
Transcription factors (TFs) play crucial roles in the regulation of photosynthesis; elucidating these roles will facilitate our understanding of photosynthesis and thus accelerate its improvement for enhancing crop yield. Promoter analysis of 52 nuclear-encoded Populus tomentosa Carr. genes involved in the Calvin-Benson-Bassham (CBB) cycle revealed 706 motifs and 326 potentially interacting TFs. A backward elimination random forest (BWERF) algorithm reduced the number of TFs to 40, involved in a three-layer gene regulatory network (GRN) including 46 photosynthesis genes (bottom layer), 25 TFs (second layer) and 15 TFs (top layer). Phenotype-genotype association identified 248 single-nucleotide polymorphisms (SNPs) within 72 genes associated with 11 photosynthesis traits. Of the regulatory pairs identified by the BWERF (202 pairs), 77 TF-target combinations harbored SNPs associated with the same trait, supporting similar mechanisms of phenotype modulation. We used expression quantitative trait nucleotide (eQTN) analysis to identify causal SNPs affecting gene expression, identifying 1851 eQTN signals for 50 eGenes (genes whose expressions are regulated by eQTNs). Distribution patterns identified 14 eQTNs from seven TFs associated with eight expression levels of their downstream targets (defined in the GRN), whereas seven TF-target pairs were also identified by phenotype-genotype associations. To further validate the roles of TFs at the metabolic level, we selected 6764 SNPs from 55 genes (identified by GRN-association or GRN-eQTN pairs or both) for metabolic association, identifying variants within 10 TFs affecting metabolic processes underlying the CBB cycle. Our study provides new insights into the photosynthesis pathway in poplar and may facilitate understanding of processes underlying photosynthesis improvement.
Collapse
Affiliation(s)
- Longxin Wang
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Jianbo Xie
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Qingzhang Du
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Fangyuan Song
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Liang Xiao
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Mingyang Quan
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Deqiang Zhang
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| |
Collapse
|
8
|
Niu SH, Liu SW, Ma JJ, Han FX, Li Y, Li W. The transcriptional activity of a temperature-sensitive transcription factor module is associated with pollen shedding time in pine. TREE PHYSIOLOGY 2019; 39:1173-1186. [PMID: 31073594 DOI: 10.1093/treephys/tpz023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Revised: 02/07/2019] [Accepted: 02/18/2019] [Indexed: 06/09/2023]
Abstract
It has long been known that the pollen shedding time in pine trees is correlated with temperature, but the molecular basis for this has remained largely unknown. To better understand the mechanisms driving temperature response and to identify the hub regulators of pollen shedding time regulation in Pinus tabuliformis Carr., we identified a set of temperature-sensitive genes by carrying out a comparative transcriptome analysis using six early pollen shedding trees (EPs) and six late pollen shedding trees (LPs) during mid-winter and at three consecutive time points in early spring. We carried out a weighted gene co-expression network analysis and constructed a transcription factor (TF) collaborative network, merging the common but differentially expressed TFs of the EPs and LPs into a joint network. We found five hub genes in the core TF module whose expression was rapidly induced by low temperatures. The transcriptional activity of this TF module was strongly associated with pollen shedding time, and likely to produce the fine balance between cold hardiness and growth activity in early spring. We confirmed the key role of temperature in regulating flowering time and identified a transcription factor module associated with pollen shedding time in P. tabuliformis. This suggests that repression of growth activity by repressors is the main mechanism balancing growth and cold hardiness in pine trees in early spring. Our results provide new insights into the molecular mechanisms regulating seasonal flowering time in pines.
Collapse
Affiliation(s)
- Shi-Hui Niu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Forest Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Shuang-Wei Liu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Forest Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Jing-Jing Ma
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Forest Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Fang-Xu Han
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Forest Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Yue Li
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Forest Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| | - Wei Li
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Forest Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| |
Collapse
|
9
|
Alberto D, Couée I, Pateyron S, Sulmon C, Gouesbet G. Low doses of triazine xenobiotics mobilize ABA and cytokinin regulations in a stress- and low-energy-dependent manner. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2018; 274:8-22. [PMID: 30080643 DOI: 10.1016/j.plantsci.2018.04.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Revised: 04/25/2018] [Accepted: 04/28/2018] [Indexed: 06/08/2023]
Abstract
The extent of residual contaminations of pesticides through drift, run-off and leaching is a potential threat to non-target plant communities. Arabidopsis thaliana responds to low doses of the herbicide atrazine, and of its degradation products, desethylatrazine and hydroxyatrazine, not only in the long term, but also under conditions of short-term exposure. In order to investigate underlying molecular mechanisms of low-dose responses and to decipher commonalities and specificities between different chemical treatments, parallel transcriptomic studies of the early effects of the atrazine-desethylatrazine-hydroxyatrazine chemical series were undertaken using whole-genome microarrays. All of the triazines under study produced coordinated and specific changes in gene expression. Hydroxyatrazine-responsive genes were mainly linked to root development, whereas atrazine and desethylatrazine mostly affected molecular signaling networks implicated in stress and hormone responses. Analysis of signaling-related genes, promoter sites and shared-function interaction networks highlighted the involvement of energy-, stress-, abscisic acid- and cytokinin-regulated processes, and emphasized the importance of cold-, heat- and drought-related signaling in the perception of low doses of triazines. These links between low-dose xenobiotic impacts and stress-hormone crosstalk pathways give novel insights into plant-pesticide interactions and plant-pollution interactions that are essential for toxicity evaluation in the context of environmental risk assessment.
Collapse
Affiliation(s)
- Diana Alberto
- Université de Rennes 1 / Centre National de la Recherche Scientifique, UMR 6553 ECOBIO, Rennes, F-35000, France
| | - Ivan Couée
- Université de Rennes 1 / Centre National de la Recherche Scientifique, UMR 6553 ECOBIO, Rennes, F-35000, France
| | - Stéphanie Pateyron
- Institute of Plant Sciences Paris Saclay IPS2, CNRS, INRA, Université Paris-Sud, Université Evry, Université Paris-Saclay, Orsay, France; Institute of Plant Sciences Paris-Saclay IPS2, Paris Diderot, Sorbonne Paris-Cité, Orsay, France
| | - Cécile Sulmon
- Université de Rennes 1 / Centre National de la Recherche Scientifique, UMR 6553 ECOBIO, Rennes, F-35000, France
| | - Gwenola Gouesbet
- Université de Rennes 1 / Centre National de la Recherche Scientifique, UMR 6553 ECOBIO, Rennes, F-35000, France.
| |
Collapse
|
10
|
Gonzalez-Dominguez J, Martin MJ. MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1732-1737. [PMID: 29028205 DOI: 10.1109/tcbb.2017.2761340] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
In this work, we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. The source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.
Collapse
|
11
|
Ji X, Chen S, Li JC, Deng W, Wei Z, Wei H. SSGA and MSGA: two seed-growing algorithms for constructing collaborative subnetworks. Sci Rep 2017; 7:1446. [PMID: 28469138 PMCID: PMC5431152 DOI: 10.1038/s41598-017-01556-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 03/30/2017] [Indexed: 11/09/2022] Open
Abstract
The establishment of a collaborative network of transcription factors (TFs) followed by decomposition and then construction of subnetworks is an effective way to obtain sets of collaborative TFs; each set controls a biological process or a complex trait. We previously developed eight gene association methods for genome-wide coexpression analysis between each TF and all other genomic genes and then constructing collaborative networks of TFs but only one algorithm, called Triple-Link Algorithm, for building collaborative subnetworks. In this study, we developed two more algorithms, Single Seed-Growing Algorithm (SSGA) and Multi-Seed Growing Algorithm (MSGA), for building collaborative subnetworks of TFs by identifying the fully-linked triple-node seeds from a decomposed collaborative network and then growing them into subnetworks with two different strategies. The subnetworks built from the three algorithms described above were comparatively appraised in terms of both functional cohesion and intra-subnetwork association strengths versus inter-subnetwork association strengths. We concluded that SSGA and MSGA, which performed more systemic comparisons and analyses of edge weights and network connectivity during subnetwork construction processes, yielded more functional and cohesive subnetworks than Triple-Link Algorithm. Together, these three algorithms provide alternate approaches for acquiring subnetworks of collaborative TFs. We also presented a framework to outline how to use these three algorithms to obtain collaborative TF sets governing biological processes or complex traits.
Collapse
Affiliation(s)
- Xiaohui Ji
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, Heilongjiang, 150040, P.R. China.,State Key Lab of Forest Genetics and Breeding, Northeast Forestry University, Harbin, Heilongjiang, 150040, P.R. China
| | - Su Chen
- State Key Lab of Forest Genetics and Breeding, Northeast Forestry University, Harbin, Heilongjiang, 150040, P.R. China
| | - Jun Cheng Li
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangzhou, 510642, P.R. China
| | - Wenping Deng
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA
| | - Zhigang Wei
- State Key Lab of Forest Genetics and Breeding, Northeast Forestry University, Harbin, Heilongjiang, 150040, P.R. China
| | - Hairong Wei
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA. .,Department of Computer Science, Michigan Technological University, Houghton, MI, 49931, USA. .,Life Science and Technology Institute, Michigan Technological University, Houghton, MI, 49931, USA.
| |
Collapse
|
12
|
Hermosilla VE, Hepp MI, Escobar D, Farkas C, Riffo EN, Castro AF, Pincheira R. Developmental SALL2 transcription factor: a new player in cancer. Carcinogenesis 2017; 38:680-690. [DOI: 10.1093/carcin/bgx036] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2016] [Accepted: 04/11/2017] [Indexed: 11/12/2022] Open
|
13
|
Kumari S, Deng W, Gunasekara C, Chiang V, Chen HS, Ma H, Davis X, Wei H. Bottom-up GGM algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways or processes. BMC Bioinformatics 2016; 17:132. [PMID: 26993098 PMCID: PMC4797117 DOI: 10.1186/s12859-016-0981-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Accepted: 03/09/2016] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Multilayered hierarchical gene regulatory networks (ML-hGRNs) are very important for understanding genetics regulation of biological pathways. However, there are currently no computational algorithms available for directly building ML-hGRNs that regulate biological pathways. RESULTS A bottom-up graphic Gaussian model (GGM) algorithm was developed for constructing ML-hGRN operating above a biological pathway using small- to medium-sized microarray or RNA-seq data sets. The algorithm first placed genes of a pathway at the bottom layer and began to construct a ML-hGRN by evaluating all combined triple genes: two pathway genes and one regulatory gene. The algorithm retained all triple genes where a regulatory gene significantly interfered two paired pathway genes. The regulatory genes with highest interference frequency were kept as the second layer and the number kept is based on an optimization function. Thereafter, the algorithm was used recursively to build a ML-hGRN in layer-by-layer fashion until the defined number of layers was obtained or terminated automatically. CONCLUSIONS We validated the algorithm and demonstrated its high efficiency in constructing ML-hGRNs governing biological pathways. The algorithm is instrumental for biologists to learn the hierarchical regulators associated with a given biological pathway from even small-sized microarray or RNA-seq data sets.
Collapse
Affiliation(s)
- Sapna Kumari
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA
| | - Wenping Deng
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA
| | - Chathura Gunasekara
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA
| | - Vincent Chiang
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA
| | - Huann-Sheng Chen
- Statistical Methodology and Applications Branch, Division of Cancer Control and Population Sciences, National Cancer Institute, National Institutes of Health, Rockville, MD, 20850, USA
| | - Hao Ma
- NCCWA, USDA ARS, Kearneysville, WV, 25430, USA
| | - Xin Davis
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA
| | - Hairong Wei
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA.
| |
Collapse
|
14
|
Clustering and Differential Alignment Algorithm: Identification of Early Stage Regulators in the Arabidopsis thaliana Iron Deficiency Response. PLoS One 2015; 10:e0136591. [PMID: 26317202 PMCID: PMC4552565 DOI: 10.1371/journal.pone.0136591] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2015] [Accepted: 08/05/2015] [Indexed: 11/25/2022] Open
Abstract
Time course transcriptome datasets are commonly used to predict key gene regulators associated with stress responses and to explore gene functionality. Techniques developed to extract causal relationships between genes from high throughput time course expression data are limited by low signal levels coupled with noise and sparseness in time points. We deal with these limitations by proposing the Cluster and Differential Alignment Algorithm (CDAA). This algorithm was designed to process transcriptome data by first grouping genes based on stages of activity and then using similarities in gene expression to predict influential connections between individual genes. Regulatory relationships are assigned based on pairwise alignment scores generated using the expression patterns of two genes and some inferred delay between the regulator and the observed activity of the target. We applied the CDAA to an iron deficiency time course microarray dataset to identify regulators that influence 7 target transcription factors known to participate in the Arabidopsis thaliana iron deficiency response. The algorithm predicted that 7 regulators previously unlinked to iron homeostasis influence the expression of these known transcription factors. We validated over half of predicted influential relationships using qRT-PCR expression analysis in mutant backgrounds. One predicted regulator-target relationship was shown to be a direct binding interaction according to yeast one-hybrid (Y1H) analysis. These results serve as a proof of concept emphasizing the utility of the CDAA for identifying unknown or missing nodes in regulatory cascades, providing the fundamental knowledge needed for constructing predictive gene regulatory networks. We propose that this tool can be used successfully for similar time course datasets to extract additional information and infer reliable regulatory connections for individual genes.
Collapse
|
15
|
Elucidating functional context within microarray data by integrated transcription factor-focused gene-interaction and regulatory network analysis. Eur Cytokine Netw 2014; 24:75-90. [PMID: 23822978 DOI: 10.1684/ecn.2013.0336] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Microarrays do not yield direct evidence for functional connections between genes. However, transcription factors (TFs) and their binding sites (TFBSs) in promoters are important for inducing and coordinating changes in RNA levels, and thus represent the first layer of functional interaction. Similar to genes, TFs act only in context, which is why a TF/TFBS-based promoter analysis of genes needs to be done in the form of gene(TF)-gene networks, not individual TFs or TFBSs. In addition, integration of the literature and various databases (e.g. GO, MeSH, etc) allows the adding of genes relevant for the functional context of the data even if they were initially missed by the microarray as their RNA levels did not change significantly. Here, we outline a TF-TFBSs network-based strategy to assess the involvement of transcription factors in agonist signaling and demonstrate its utility in deciphering the response of human microvascular endothelial cells (HMEC-1) to leukemia inhibitory factor (LIF). Our strategy identified a central core of eight TFs, of which only STAT3 had previously been definitively linked to LIF in endothelial cells. We also found potential molecular mechanisms of gene regulation in HMEC-1 upon stimulation with LIF that allow for the prediction of changes of genes not used in the analysis. Our approach, which is readily applicable to a wide variety of expression microarray and next generation sequencing RNA-seq results, illustrates the power of a TF-gene networking approach for elucidation of the underlying biology.
Collapse
|
16
|
DeGNServer: deciphering genome-scale gene networks through high performance reverse engineering analysis. BIOMED RESEARCH INTERNATIONAL 2013; 2013:856325. [PMID: 24328032 PMCID: PMC3847961 DOI: 10.1155/2013/856325] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Accepted: 10/01/2013] [Indexed: 12/23/2022]
Abstract
Analysis of genome-scale gene networks (GNs) using large-scale gene expression data provides unprecedented opportunities to uncover gene interactions and regulatory networks involved in various biological processes and developmental programs, leading to accelerated discovery of novel knowledge of various biological processes, pathways and systems. The widely used context likelihood of relatedness (CLR) method based on the mutual information (MI) for scoring the similarity of gene pairs is one of the accurate methods currently available for inferring GNs. However, the MI-based reverse engineering method can achieve satisfactory performance only when sample size exceeds one hundred. This in turn limits their applications for GN construction from expression data set with small sample size. We developed a high performance web server, DeGNServer, to reverse engineering and decipher genome-scale networks. It extended the CLR method by integration of different correlation methods that are suitable for analyzing data sets ranging from moderate to large scale such as expression profiles with tens to hundreds of microarray hybridizations, and implemented all analysis algorithms using parallel computing techniques to infer gene-gene association at extraordinary speed. In addition, we integrated the SNBuilder and GeNa algorithms for subnetwork extraction and functional module discovery. DeGNServer is publicly and freely available online.
Collapse
|
17
|
Li J, Wei H, Liu T, Zhao PX. GPLEXUS: enabling genome-scale gene association network reconstruction and analysis for very large-scale expression data. Nucleic Acids Res 2013; 42:e32. [PMID: 24178033 PMCID: PMC3950724 DOI: 10.1093/nar/gkt983] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The accurate construction and interpretation of gene association networks (GANs) is challenging, but crucial, to the understanding of gene function, interaction and cellular behavior at the genome level. Most current state-of-the-art computational methods for genome-wide GAN reconstruction require high-performance computational resources. However, even high-performance computing cannot fully address the complexity involved with constructing GANs from very large-scale expression profile datasets, especially for the organisms with medium to large size of genomes, such as those of most plant species. Here, we present a new approach, GPLEXUS (http://plantgrn.noble.org/GPLEXUS/), which integrates a series of novel algorithms in a parallel-computing environment to construct and analyze genome-wide GANs. GPLEXUS adopts an ultra-fast estimation for pairwise mutual information computing that is similar in accuracy and sensitivity to the Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNE) method and runs ∼1000 times faster. GPLEXUS integrates Markov Clustering Algorithm to effectively identify functional subnetworks. Furthermore, GPLEXUS includes a novel ‘condition-removing’ method to identify the major experimental conditions in which each subnetwork operates from very large-scale gene expression datasets across several experimental conditions, which allows users to annotate the various subnetworks with experiment-specific conditions. We demonstrate GPLEXUS’s capabilities by construing global GANs and analyzing subnetworks related to defense against biotic and abiotic stress, cell cycle growth and division in Arabidopsis thaliana.
Collapse
Affiliation(s)
- Jun Li
- Plant Biology Division, the Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA and School of Forest Resources and Environmental Science, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA
| | | | | | | |
Collapse
|
18
|
Wei H, Yordanov YS, Georgieva T, Li X, Busov V. Nitrogen deprivation promotes Populus root growth through global transcriptome reprogramming and activation of hierarchical genetic networks. THE NEW PHYTOLOGIST 2013; 200:483-497. [PMID: 23795675 DOI: 10.1111/nph.12375] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 05/20/2013] [Indexed: 05/07/2023]
Abstract
We show a distinct and previously poorly characterized response of poplar (Populus tremula × Populus alba) roots to low nitrogen (LN), which involves activation of root growth and significant transcriptome reprogramming. Analysis of the temporal patterns of enriched ontologies among the differentially expressed genes revealed an ordered assembly of functionally cohesive biological events that aligned well with growth and morphological responses. A core set of 28 biological processes was significantly enriched across the whole studied period and 21 of these were also enriched in the roots of Arabidopsis thaliana during the LN response. More than half (15) of the 28 processes belong to gene ontology (GO) terms associated with signaling and signal transduction pathways, suggesting the presence of conserved signaling mechanisms triggered by LN. A reconstruction of genetic regulatory network analysis revealed a sub-network centered on a PtaNAC1 (P. tremula × alba NAM, ATAF, CUC 1) transcription factor. PtaNAC1 root-specific up-regulation increased root biomass and significantly changed the expression of the connected hub genes specifically under LN. Our results provide evidence that the root response to LN involves hierarchically structured genetic networks centered on key regulatory factors. Targeting these factors via genetic engineering or breeding approaches can allow dynamic adjustment of root architecture in response to variable nitrogen availabilities in the soil.
Collapse
Affiliation(s)
- Hairong Wei
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931-1295, USA
- Biotechnology Research Center, Michigan Technological University, 1400 Townsend Drive, Houghton, MI, 49931, USA
- Computer Science, Michigan Technological University, 1400 Townsend Drive, Houghton, MI, 49931, USA
| | - Yordan S Yordanov
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931-1295, USA
| | - Tatyana Georgieva
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931-1295, USA
| | - Xiang Li
- Computer Science, Michigan Technological University, 1400 Townsend Drive, Houghton, MI, 49931, USA
| | - Victor Busov
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931-1295, USA
- Biotechnology Research Center, Michigan Technological University, 1400 Townsend Drive, Houghton, MI, 49931, USA
| |
Collapse
|
19
|
Kumari S, Nie J, Chen HS, Ma H, Stewart R, Li X, Lu MZ, Taylor WM, Wei H. Evaluation of gene association methods for coexpression network construction and biological knowledge discovery. PLoS One 2012; 7:e50411. [PMID: 23226279 PMCID: PMC3511551 DOI: 10.1371/journal.pone.0050411] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2012] [Accepted: 10/18/2012] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS In this study, we compared eight gene association methods - Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson - and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction.
Collapse
Affiliation(s)
- Sapna Kumari
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, United States of America
| | - Jeff Nie
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Huann-Sheng Chen
- Statistical Methodology and Applications Branch, Division of Cancer Control and Population Sciences, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Hao Ma
- Division of Animal and Nutritional Sciences, West Virginia University, Morgantown, West Virginia, United States of America
| | - Ron Stewart
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Xiang Li
- Department of Computer Science, Michigan Technological University, Houghton, Michigan, United States of America
| | - Meng-Zhu Lu
- State Key Laboratory of Tree Genetics and Breeding, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, P.R. China
| | - William M. Taylor
- Department of Computer Science, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Hairong Wei
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, United States of America
- Department of Computer Science, Michigan Technological University, Houghton, Michigan, United States of America
- Biotechnology Research Center, Michigan Technological University, Houghton, Michigan, United States of America
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, Michigan, United States of America
| |
Collapse
|
20
|
Mu HZ, Liu ZJ, Lin L, Li HY, Jiang J, Liu GF. Transcriptomic analysis of phenotypic changes in birch (Betula platyphylla) autotetraploids. Int J Mol Sci 2012. [PMID: 23202935 PMCID: PMC3497309 DOI: 10.3390/ijms131013012] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Plant breeders have focused much attention on polyploid trees because of their importance to forestry. To evaluate the impact of intraspecies genome duplication on the transcriptome, a series of Betula platyphylla autotetraploids and diploids were generated from four full-sib families. The phenotypes and transcriptomes of these autotetraploid individuals were compared with those of diploid trees. Autotetraploids were generally superior in breast-height diameter, volume, leaf, fruit and stoma and were generally inferior in height compared to diploids. Transcriptome data revealed numerous changes in gene expression attributable to autotetraploidization, which resulted in the upregulation of 7052 unigenes and the downregulation of 3658 unigenes. Pathway analysis revealed that the biosynthesis and signal transduction of indoleacetate (IAA) and ethylene were altered after genome duplication, which may have contributed to phenotypic changes. These results shed light on variations in birch autotetraploidization and help identify important genes for the genetic engineering of birch trees.
Collapse
Affiliation(s)
- Huai-Zhi Mu
- State Key Laboratory of Forest Genetics and Tree Breeding, Northeast Forestry University, 26 Hexing Road, Harbin 150040, China.
| | | | | | | | | | | |
Collapse
|