1
|
Wang G, Alias SB, Sun Z, Wang F, Fan A, Hu H. Influential nodes identification method based on adaptive adjustment of voting ability. Heliyon 2023; 9:e16112. [PMID: 37215850 PMCID: PMC10196995 DOI: 10.1016/j.heliyon.2023.e16112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 04/30/2023] [Accepted: 05/05/2023] [Indexed: 05/24/2023] Open
Abstract
Influential nodes identification technology is one of the important topics which has been widely applied to logistics node location, social information dissemination, transportation network carrying, biological virus dissemination, power network anti-destruction, etc. At present, a large number of influential nodes identification methods have been studied, but the algorithms that are simple to execute, have high accuracy and can be better applied to real networks are still the focus of research. Therefore, due to the advantages of simple to execute in voting mechanism, a novel algorithm based on adaptive adjustment of voting ability (AAVA) to identify the influential nodes is presented by considering the local attributes of node and the voting contribution of its neighbor nodes, to solve the problem of low accuracy and discrimination of the existing algorithms. This proposed algorithm uses the similarity between the voting node and the voted node to dynamically adjust its voting ability without setting any parameters, so that a node can contribute different voting abilities to different neighbor nodes. To verify the performance of AAVA algorithm, the running results of 13 algorithms are analyzed and compared on 10 different networks with the SIR model as a reference. The experimental results show that the influential nodes identified by AAVA have high consistency with SIR model in Top-10 nodes and Kendall correlation, and have better infection effect of the network. Therefore, it is proved that AAV algorithm has high accuracy and effectiveness, and can be applied to real complex networks of different types and sizes.
Collapse
Affiliation(s)
- Guan Wang
- School of Information Engineering, Pingdingshan University, Pingdingshan, Henan, China
- Faculty of Engineering, Built Environment & Information Technology, SEGI University, Malaysia
| | - Syazwina Binti Alias
- Faculty of Engineering, Built Environment & Information Technology, SEGI University, Malaysia
| | - Zejun Sun
- School of Information Engineering, Pingdingshan University, Pingdingshan, Henan, China
| | - Feifei Wang
- School of Information Engineering, Pingdingshan University, Pingdingshan, Henan, China
| | - Aiwan Fan
- School of Information Engineering, Pingdingshan University, Pingdingshan, Henan, China
| | - Haifeng Hu
- School of Information Engineering, Pingdingshan University, Pingdingshan, Henan, China
| |
Collapse
|
2
|
Tang R, Miao Z, Jiang S, Chen X, Wang H, Wang W. Interlayer Link Prediction in Multiplex Social Networks Based on Multiple Types of Consistency Between Embedding Vectors. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:2426-2439. [PMID: 34735350 DOI: 10.1109/tcyb.2021.3120134] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Online users are typically active on multiple social media networks (SMNs), which constitute a multiplex social network. With improvements in cybersecurity awareness, users increasingly choose different usernames and provide different profiles on different SMNs. Thus, it is becoming increasingly challenging to determine whether given accounts on different SMNs belong to the same user; this can be expressed as an interlayer link prediction problem in a multiplex network. To address the challenge of predicting interlayer links, feature or structure information is leveraged. Existing methods that use network embedding techniques to address this problem focus on learning a mapping function to unify all nodes into a common latent representation space for prediction; positional relationships between unmatched nodes and their common matched neighbors (CMNs) are not utilized. Furthermore, the layers are often modeled as unweighted graphs, ignoring the strengths of the relationships between nodes. To address these limitations, we propose a framework based on multiple types of consistency between embedding vectors (MulCEVs). In MulCEV, the traditional embedding-based method is applied to obtain the degree of consistency between the vectors representing the unmatched nodes, and a proposed distance consistency index based on the positions of nodes in each latent space provides additional clues for prediction. By associating these two types of consistency, the effective information in the latent spaces is fully utilized. In addition, MulCEV models the layers as weighted graphs to obtain representation. In this way, the higher the strength of the relationship between nodes, the more similar their embedding vectors in the latent representation space will be. The results of our experiments on several real-world and synthetic datasets demonstrate that the proposed MulCEV framework markedly outperforms current embedding-based methods, especially when the number of training iterations is small.
Collapse
|
3
|
Su H, Chen D, Pan GJ, Zeng Z. Identification of Network Topology Variations Based on Spectral Entropy. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:10468-10478. [PMID: 33878010 DOI: 10.1109/tcyb.2021.3070080] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Based on the fact that the traditional probability distribution entropy describing a local feature of the system cannot effectively capture the global topology variations of the network, some indicators constructed by the network adjacency matrix and Laplacian matrix come into being. Specifically, these measures are based on the eigenvalues of the scaled Laplace matrix, the eigenvalues of the network communicability matrix, and the spectral entropy based on information diffusion that has been proposed recently, respectively. In this article, we systematically study the dependence of these measures on the topological structure of the network. We prove from various aspects that spectral entropy has a better ability to identify the global topology than the traditional distribution entropy. Furthermore, the indicator based on the eigenvalues of the network communicability matrix achieves good results in some aspects while, overall, the spectral entropy is able to identify network topology variations from a global perspective.
Collapse
|
4
|
Wang P, Wang D. Gene Differential Co-Expression Networks Based on RNA-Seq: Construction and Its Applications. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2829-2841. [PMID: 34383649 DOI: 10.1109/tcbb.2021.3103280] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Gene co-expression network (GCN) becomes an increasingly important tool in omics data analysis. A great challenge for GCN construction is that the sample size is far lower than the number of genes. Traditional methods rely on considerable samples. Moreover, association signals are likely weak, nonlinear and stochastic, which are difficult to be identified among thousands of candidates. In this paper, the gray correlation coefficient (GCC) is introduced, and a novel method to construct gene differential co-expression networks (GDCNs) is proposed. Based on the GDCNs, three measures are proposed to explore informative genes. The proposed method can make full use of the information provided by a handful of samples and overcome the shortages of GCNs, which can evaluate the changes of co-expression relationships that are possibly triggered by treatments. Based on RNA-seq data of Brassica napus, GDCNs under multiple experimental conditions are constructed and investigated. It is found that the GCC-based method is very robust to data processing. The GDCNs facilitate the inference of gene functions and the identification of informative genes that are responsible for stress responsiveness. The GDCN-based approaches integrate the 'guilt by association' and the 'guilt by rewiring' rules, which provide alternative tools for omics data analysis.
Collapse
|
5
|
Li C, Gao Z, Su B, Xu G, Lin X. Data analysis methods for defining biomarkers from omics data. Anal Bioanal Chem 2021; 414:235-250. [PMID: 34951658 DOI: 10.1007/s00216-021-03813-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 11/26/2021] [Accepted: 11/29/2021] [Indexed: 02/01/2023]
Abstract
Omics mainly includes genomics, epigenomics, transcriptomics, proteomics and metabolomics. The rapid development of omics technology has opened up new ways to study disease diagnosis and prognosis and to define prospective information of complex diseases. Since omics data are usually large and complex, the method used to analyze the data and to define important information is crucial in omics study. In this review, we focus on advances in biomarker discovery methods based on omics data in the last decade, and categorize them as individual feature analysis, combinatorial feature analysis and network analysis. We also discuss the challenges and perspectives in this field.
Collapse
Affiliation(s)
- Chao Li
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, Liaoning, China
| | - Zhenbo Gao
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China
| | - Benzhe Su
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, Liaoning, China
| | - Xiaohui Lin
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China.
| |
Collapse
|
6
|
Wang P. Statistical Identification of Important Nodes in Biological Systems. JOURNAL OF SYSTEMS SCIENCE AND COMPLEXITY 2021; 34:1454-1470. [PMID: 34393461 PMCID: PMC8353063 DOI: 10.1007/s11424-020-0013-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/23/2020] [Indexed: 06/13/2023]
Abstract
Biological systems can be modeled and described by biological networks. Biological networks are typical complex networks with widely real-world applications. Many problems arising in biological systems can be boiled down to the identification of important nodes. For example, biomedical researchers frequently need to identify important genes that potentially leaded to disease phenotypes in animal and explore crucial genes that were responsible for stress responsiveness in plants. To facilitate the identification of important nodes in biological systems, one needs to know network structures or behavioral data of nodes (such as gene expression data). If network topology was known, various centrality measures can be developed to solve the problem; while if only behavioral data of nodes were given, some sophisticated statistical methods can be employed. This paper reviewed some of the recent works on statistical identification of important nodes in biological systems from three aspects, that is, 1) in general complex networks based on complex networks theory and epidemic dynamic models; 2) in biological networks based on network motifs; and 3) in plants based on RNA-seq data. The identification of important nodes in a complex system can be seen as a mapping from the system to the ranking score vector of nodes, such mapping is not necessarily with explicit form. The three aspects reflected three typical approaches on ranking nodes in biological systems and can be integrated into one general framework. This paper also proposed some challenges and future works on the related topics. The associated investigations have potential real-world applications in the control of biological systems, network medicine and new variety cultivation of crops.
Collapse
Affiliation(s)
- Pei Wang
- School of Mathematics and Statistics, Institute of Applied Mathematics, Laboratory of Data Analysis Technology, Henan University, Kaifeng, 475004 China
| |
Collapse
|
7
|
Zhao TF, Chen WN, Kwong S, Gu TL, Yuan HQ, Zhang J, Zhang J. Evolutionary Divide-and-Conquer Algorithm for Virus Spreading Control Over Networks. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:3752-3766. [PMID: 32175884 DOI: 10.1109/tcyb.2020.2975530] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The control of virus spreading over complex networks with a limited budget has attracted much attention but remains challenging. This article aims at addressing the combinatorial, discrete resource allocation problems (RAPs) in virus spreading control. To meet the challenges of increasing network scales and improve the solving efficiency, an evolutionary divide-and-conquer algorithm is proposed, namely, a coevolutionary algorithm with network-community-based decomposition (NCD-CEA). It is characterized by the community-based dividing technique and cooperative coevolution conquering thought. First, to reduce the time complexity, NCD-CEA divides a network into multiple communities by a modified community detection method such that the most relevant variables in the solution space are clustered together. The problem and the global swarm are subsequently decomposed into subproblems and subswarms with low-dimensional embeddings. Second, to obtain high-quality solutions, an alternative evolutionary approach is designed by promoting the evolution of subswarms and the global swarm, in turn, with subsolutions evaluated by local fitness functions and global solutions evaluated by a global fitness function. Extensive experiments on different networks show that NCD-CEA has a competitive performance in solving RAPs. This article advances toward controlling virus spreading over large-scale networks.
Collapse
|
8
|
Wang Z, Xia C, Chen Z, Chen G. Epidemic Propagation With Positive and Negative Preventive Information in Multiplex Networks. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:1454-1462. [PMID: 31940584 DOI: 10.1109/tcyb.2019.2960605] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
We propose a novel epidemic model based on two-layered multiplex networks to explore the influence of positive and negative preventive information on epidemic propagation. In the model, one layer represents a social network with positive and negative preventive information spreading competitively, while the other one denotes the physical contact network with epidemic propagation. The individuals who are aware of positive prevention will take more effective measures to avoid being infected than those who are aware of negative prevention. Taking the microscopic Markov chain (MMC) approach, we analytically derive the expression of the epidemic threshold for the proposed epidemic model, which indicates that the diffusion of positive and negative prevention information, as well as the topology of the physical contact network have a significant impact on the epidemic threshold. By comparing the results obtained with MMC and those with the Monte Carlo (MC) simulations, it is found that they are in good agreement, but MMC can well describe the dynamics of the proposed model. Meanwhile, through extensive simulations, we demonstrate the impact of positive and negative preventive information on the epidemic threshold, as well as the prevalence of infectious diseases. We also find that the epidemic prevalence and the epidemic outbreaks can be suppressed by the diffusion of positive preventive information and be promoted by the diffusion of negative preventive information.
Collapse
|
9
|
Wang P. Statistical Identification of Important Nodes in Biological Systems. JOURNAL OF SYSTEMS SCIENCE AND COMPLEXITY 2021:1-17. [PMID: 33456274 PMCID: PMC7801784 DOI: 10.1007/s11424-021-0001-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/23/2020] [Indexed: 05/08/2023]
Abstract
Biological systems can be modeled and described by biological networks. Biological networks are typical complex networks with widely real-world applications. Many problems arising in biological systems can be boiled down to the identification of important nodes. For example, biomedical researchers frequently need to identify important genes that potentially leaded to disease phenotypes in animal and explore crucial genes that were responsible for stress responsiveness in plants. To facilitate the identification of important nodes in biological systems, one needs to know network structures or behavioral data of nodes (such as gene expression data). If network topology was known, various centrality measures can be developed to solve the problem; while if only behavioral data of nodes were given, some sophisticated statistical methods can be employed. This paper reviewed some of the recent works on statistical identification of important nodes in biological systems from three aspects, that is, 1) in general complex networks based on complex networks theory and epidemic dynamic models; 2) in biological networks based on network motifs; and 3) in plants based on RNA-seq data. The identification of important nodes in a complex system can be seen as a mapping from the system to the ranking score vector of nodes, such mapping is not necessarily with explicit form. The three aspects reflected three typical approaches on ranking nodes in biological systems and can be integrated into one general framework. This paper also proposed some challenges and future works on the related topics. The associated investigations have potential real-world applications in the control of biological systems, network medicine and new variety cultivation of crops.
Collapse
Affiliation(s)
- Pei Wang
- School of Mathematics and Statistics, Institute of Applied Mathematics, Laboratory of Data Analysis Technology, Henan University, Kaifeng, 475004 China
| |
Collapse
|
10
|
Xu S, Zhang C, Wang P, Zhang J. Variational Bayesian weighted complex network reconstruction. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.02.050] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
11
|
Wang P, Lu JA, Jin Y, Zhu M, Wang L, Chen S. Statistical and network analysis of 1212 COVID-19 patients in Henan, China. Int J Infect Dis 2020; 95:391-398. [PMID: 32339715 PMCID: PMC7180361 DOI: 10.1016/j.ijid.2020.04.051] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 04/15/2020] [Accepted: 04/18/2020] [Indexed: 11/24/2022] Open
Abstract
Almost all currently infected COVID-19 patients in Henan province were analyzed. COVID-19 patients in Henan province show gender and age preferences, migrant workers or college students are at high risk. The incubation period was statistically estimated. The state transition diagram can reveal the time-phased nature of the COVID-19 epidemic. Network analysis reveals the aggregate outbreak phenomena of COVID-19.
Background COVID-19 is spreading quickly all over the world. Publicly released data for 1212 COVID-19 patients in Henan of China were analyzed in this paper. Methods Various statistical and network analysis methods were employed. Results We found that COVID-19 patients show gender (55% vs 45%) and age (81% aged between 21 and 60) preferences; possible causes were explored. The estimated average, mode and median incubation periods are 7.4, 4 and 7 days. Incubation periods of 92% of patients were no more than 14 days. The epidemic in Henan has undergone three stages and has shown high correlations with the numbers of patients recently returned from Wuhan. Network analysis revealed that 208 cases were clustering infected, and various People's Hospitals are the main force in treating COVID-19. Conclusions The incubation period was statistically estimated, and the proposed state transition diagram can explore the epidemic stages of emerging infectious disease. We suggest that although the quarantine measures are gradually working, strong measures still might be needed for a period of time, since ∼7.45% of patients may have very long incubation periods. Migrant workers or college students are at high risk. State transition diagrams can help us to recognize the time-phased nature of the epidemic. Our investigations have implications for the prevention and control of COVID-19 in other regions of the world.
Collapse
Affiliation(s)
- Pei Wang
- School of Mathematics and Statistics, Henan University, Kaifeng, 475004, China; Institute of Applied Mathematics, Henan University, Kaifeng, 475004, China; Laboratory of Data Analysis Technology, Henan University, 475004, Kaifeng, China.
| | - Jun-An Lu
- School of Mathematics and Statistics, Wuhan University, Wuhan, 430070, China.
| | - Yanyu Jin
- School of Mathematics and Statistics, Henan University, Kaifeng, 475004, China
| | - Mengfan Zhu
- School of Mathematics and Statistics, Zhongnan University of Economics and Law, Wuhan, 430073, China
| | - Lingling Wang
- School of Mathematics and Statistics, Henan University, Kaifeng, 475004, China
| | - Shunjie Chen
- School of Mathematics and Statistics, Henan University, Kaifeng, 475004, China
| |
Collapse
|
12
|
Liu C, Wu X, Niu R, Wu X, Fan R. A new SAIR model on complex networks for analysing the 2019 novel coronavirus (COVID-19). NONLINEAR DYNAMICS 2020; 101:1777-1787. [PMID: 32836802 PMCID: PMC7299147 DOI: 10.1007/s11071-020-05704-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Accepted: 05/17/2020] [Indexed: 05/04/2023]
Abstract
Nowadays, the novel coronavirus (COVID-19) is spreading around the world and has attracted extremely wide public attention. From the beginning of the outbreak to now, there have been many mathematical models proposed to describe the spread of the pandemic, and most of them are established with the assumption that people contact with each other in a homogeneous pattern. However, owing to the difference of individuals in reality, social contact is usually heterogeneous, and the models on homogeneous networks cannot accurately describe the outbreak. Thus, we propose a susceptible-asymptomatic-infected-removed (SAIR) model on social networks to describe the spread of COVID-19 and analyse the outbreak based on the epidemic data of Wuhan from January 24 to March 2. Then, according to the results of the simulations, we discover that the measures that can curb the spread of COVID-19 include increasing the recovery rate and the removed rate, cutting off connections between symptomatically infected individuals and their neighbours, and cutting off connections between hub nodes and their neighbours. The feasible measures proposed in the paper are in fair agreement with the measures that the government took to suppress the outbreak. Furthermore, effective measures should be carried out immediately, otherwise the pandemic would spread more rapidly and last longer. In addition, we use the epidemic data of Wuhan from January 24 to March 2 to analyse the outbreak in the city and explain why the number of the infected rose in the early stage of the outbreak though a total lockdown was implemented. Moreover, besides the above measures, a feasible way to curb the spread of COVID-19 is to reduce the density of social networks, such as restricting mobility and decreasing in-person social contacts. This work provides a series of effective measures, which can facilitate the selection of appropriate approaches for controlling the spread of the COVID-19 pandemic to mitigate its adverse impact on people's livelihood, societies and economies.
Collapse
Affiliation(s)
- Congying Liu
- School of Mathematics and Statistics, Wuhan University, Hubei, 430072 China
| | - Xiaoqun Wu
- School of Mathematics and Statistics, Wuhan University, Hubei, 430072 China
- Hubei Key Laboratory of Computational Science, Wuhan University, Hubei, 430072 China
| | - Riuwu Niu
- College of Mathematics and Statistics, Shenzhen University, Shenzhen, 518060 China
| | - Xiuqi Wu
- School of Mathematics and Statistics, Wuhan University, Hubei, 430072 China
| | - Ruguo Fan
- School of Economics and Management, Wuhan University, Hubei, 430072 China
| |
Collapse
|
13
|
Abstract
Identifying critical components are of great significance to the overall reliability of service-oriented systems (SOSs). As the size of the SOS increases, identifying critical components in the process of predicting the SOS reliability can reduce the number of components that need to be predicted and shorten the prediction time. Moreover, predicting the reliability of critical components can also ensure the stability of the SOS. Therefore, we transform the relationships among service components of the SOS into a service dependency graph. Then, an improved weighted LeaderRank algorithm (IW-LeaderRank) is proposed to measure the importance of components and obtain the sequence of critical components. Through experimental analysis, the method can accurately and efficiently identify critical components in SOSs.
Collapse
|
14
|
Wang P, Yang C, Chen H, Luo L, Leng Q, Li S, Han Z, Li X, Song C, Zhang X, Wang D. Exploring transcription factors reveals crucial members and regulatory networks involved in different abiotic stresses in Brassica napus L. BMC PLANT BIOLOGY 2018; 18:202. [PMID: 30231862 PMCID: PMC6146658 DOI: 10.1186/s12870-018-1417-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 09/05/2018] [Indexed: 05/04/2023]
Abstract
BACKGROUND Brassica napus (B. napus) encompasses diverse transcription factors (TFs), but thorough identification and characterization of TF families, as well as their transcriptional responsiveness to multifarious stresses are still not clear. RESULTS Totally 2167 TFs belonging to five families were genome-widely identified in B. napus, including 518 BnAP2/EREBPs, 252 BnbZIPs, 721 BnMYBs, 398 BnNACs and 278 BnWRKYs, which contained some novel members in comparison with existing results. Sub-genome distributions of BnAP2/EREBPs and BnMYBs indicated that the two families might have suffered from duplication and divergence during evolution. Synteny analysis revealed strong co-linearity between B. napus and its two ancestors, although chromosomal rearrangements have occurred and 85 TFs were lost. About 7.6% and 9.4% TFs of the five families in B. napus were novel genes and conserved genes, which both showed preference on the C sub-genome. RNA-Seq revealed that more than 80% TFs were abiotic stress inducible and 315 crucial differentially expressed genes (DEGs) were screened out. Network analysis revealed that the 315 DEGs are highly co-expressed. The homologous gene network in A. thaliana revealed that a considerable amount of TFs could trigger the differential expression of targeted genes, resulting in a complex clustered network with clusters of genes responsible for targeted stress responsiveness. CONCLUSIONS We identified and characterized five TF families in B. napus. Some crucial members and regulatory networks involved in different abiotic stresses have been explored. The investigations deepen our understanding of TFs for stress tolerance in B. napus.
Collapse
Affiliation(s)
- Pei Wang
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Cuiling Yang
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Hao Chen
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Longhai Luo
- Beijing igeneCode Biotech Co.,Ltd, Changping District Xisanqi Center for the Olympic Century, Beijing, 100096 China
| | - Qiuli Leng
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Shicong Li
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Zujing Han
- Beijing igeneCode Biotech Co.,Ltd, Changping District Xisanqi Center for the Olympic Century, Beijing, 100096 China
| | - Xinchun Li
- Beijing igeneCode Biotech Co.,Ltd, Changping District Xisanqi Center for the Olympic Century, Beijing, 100096 China
| | - Chunpeng Song
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Xiao Zhang
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| | - Daojie Wang
- Key Laboratory of Plant Stress Biology; School of Mathematics and Statistics; State Key Laboratory of Cotton Biology; College of Life Sciences; Institute of Applied Mathematics; Laboratory of Data Analysis Technology; Henan University, Kaifeng, Henan, 475004, China, Jinming avenue, Kaifeng, 475004 China
| |
Collapse
|