1
|
Chen D, Ma Y, Xiao H, Yan Z. Development trends of etiological research contents and methods of noncommunicable diseases. HEALTH CARE SCIENCE 2023; 2:352-357. [PMID: 38938587 PMCID: PMC11080801 DOI: 10.1002/hcs2.69] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 07/26/2023] [Indexed: 06/29/2024]
Affiliation(s)
- Dafang Chen
- Department of Epidemiology and Biostatistics, School of Public HealthPeking UniversityBeijingChina
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of EducationPeking UniversityBeijingChina
| | - Yujia Ma
- Department of Epidemiology and Biostatistics, School of Public HealthPeking UniversityBeijingChina
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of EducationPeking UniversityBeijingChina
| | - Han Xiao
- Department of Epidemiology and Biostatistics, School of Public HealthPeking UniversityBeijingChina
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of EducationPeking UniversityBeijingChina
| | - Zeyu Yan
- Department of Epidemiology and Biostatistics, School of Public HealthPeking UniversityBeijingChina
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of EducationPeking UniversityBeijingChina
| |
Collapse
|
2
|
Jin X, Zhang L, Ji J, Ju T, Zhao J, Yuan Z. Network regression analysis in transcriptome-wide association studies. BMC Genomics 2022; 23:562. [PMID: 35933330 PMCID: PMC9356418 DOI: 10.1186/s12864-022-08809-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 08/02/2022] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Transcriptome-wide association studies (TWASs) have shown great promise in interpreting the findings from genome-wide association studies (GWASs) and exploring the disease mechanisms, by integrating GWAS and eQTL mapping studies. Almost all TWAS methods only focus on one gene at a time, with exception of only two published multiple-gene methods nevertheless failing to account for the inter-dependence as well as the network structure among multiple genes, which may lead to power loss in TWAS analysis as complex disease often owe to multiple genes that interact with each other as a biological network. We therefore developed a Network Regression method in a two-stage TWAS framework (NeRiT) to detect whether a given network is associated with the traits of interest. NeRiT adopts the flexible Bayesian Dirichlet process regression to obtain the gene expression prediction weights in the first stage, uses pointwise mutual information to represent the general between-node correlation in the second stage and can effectively take the network structure among different gene nodes into account. RESULTS Comprehensive and realistic simulations indicated NeRiT had calibrated type I error control for testing both the node effect and edge effect, and yields higher power than the existed methods, especially in testing the edge effect. The results were consistent regardless of the GWAS sample size, the gene expression prediction model in the first step of TWAS, the network structure as well as the correlation pattern among different gene nodes. Real data applications through analyzing systolic blood pressure and diastolic blood pressure from UK Biobank showed that NeRiT can simultaneously identify the trait-related nodes as well as the trait-related edges. CONCLUSIONS NeRiT is a powerful and efficient network regression method in TWAS.
Collapse
Affiliation(s)
- Xiuyuan Jin
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China.,Institute for Medical Dataology, Shandong University, Jinan, 250003, Shandong, China
| | - Liye Zhang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China.,Institute for Medical Dataology, Shandong University, Jinan, 250003, Shandong, China
| | - Jiadong Ji
- Institute for Financial Studies, Shandong University, Jinan, 250100, Shandong, China
| | - Tao Ju
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China.,Institute for Medical Dataology, Shandong University, Jinan, 250003, Shandong, China
| | - Jinghua Zhao
- Department of Public Health and Primary Care, Cardiovascular Epidemiology Unit, University of Cambridge, Cambridge, UK.
| | - Zhongshang Yuan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China. .,Institute for Medical Dataology, Shandong University, Jinan, 250003, Shandong, China.
| |
Collapse
|
3
|
Chen H, Guo Y, He Y, Ji J, Liu L, Shi Y, Wang Y, Yu L, Zhang X. Simultaneous differential network analysis and classification for matrix-variate data with application to brain connectivity. Biostatistics 2022; 23:967-989. [PMID: 33769450 PMCID: PMC9295187 DOI: 10.1093/biostatistics/kxab007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 02/20/2021] [Accepted: 02/22/2021] [Indexed: 01/03/2023] Open
Abstract
Growing evidence has shown that the brain connectivity network experiences alterations for complex diseases such as Alzheimer's disease (AD). Network comparison, also known as differential network analysis, is thus particularly powerful to reveal the disease pathologies and identify clinical biomarkers for medical diagnoses (classification). Data from neurophysiological measurements are multidimensional and in matrix-form. Naive vectorization method is not sufficient as it ignores the structural information within the matrix. In the article, we adopt the Kronecker product covariance matrices framework to capture both spatial and temporal correlations of the matrix-variate data while the temporal covariance matrix is treated as a nuisance parameter. By recognizing that the strengths of network connections may vary across subjects, we develop an ensemble-learning procedure, which identifies the differential interaction patterns of brain regions between the case group and the control group and conducts medical diagnosis (classification) of the disease simultaneously. Simulation studies are conducted to assess the performance of the proposed method. We apply the proposed procedure to the functional connectivity analysis of an functional magnetic resonance imaging study on AD. The hub nodes and differential interaction patterns identified are consistent with existing experimental studies, and satisfactory out-of-sample classification performance is achieved for medical diagnosis of AD.
Collapse
Affiliation(s)
- Hao Chen
- School of Statistics, Shandong University of Finance and
Economics, Jinan, 250014, China
| | - Ying Guo
- Department of Biostatistics and Bioinformatics, Rollins School of Public
Health, Emory University, Atlanta, GA 30322, USA
| | - Yong He
- Institute for Financial Studies, Shandong University, Jinan,
250100, China
| | - Jiadong Ji
- Institute for Financial Studies, Shandong University, Jinan,
250100, China
| | - Lei Liu
- Division of Biostatistics, Washington University in St.Louis,
St. Louis, MO 63110, USA
| | - Yufeng Shi
- Institute for Financial Studies, Shandong University, Jinan,
250100, China
| | - Yikai Wang
- Department of Biostatistics and Bioinformatics, Rollins School of Public
Health, Emory University, Atlanta, GA 30322, USA
| | - Long Yu
- Department of Statistics, School of Management, Fudan
University, Shanghai, 200433, China
| | - Xinsheng Zhang
- Department of Statistics, School of Management, Fudan
University, Shanghai, 200433, China
| |
Collapse
|
4
|
Fan Y, Kao C, Yang F, Wang F, Yin G, Wang Y, He Y, Ji J, Liu L. Integrated Multi-Omics Analysis Model to Identify Biomarkers Associated With Prognosis of Breast Cancer. Front Oncol 2022; 12:899900. [PMID: 35761863 PMCID: PMC9232398 DOI: 10.3389/fonc.2022.899900] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 05/12/2022] [Indexed: 12/13/2022] Open
Abstract
Background With the rapid development and wide application of high-throughput sequencing technology, biomedical research has entered the era of large-scale omics data. We aim to identify genes associated with breast cancer prognosis by integrating multi-omics data. Method Gene-gene interactions were taken into account, and we applied two differential network methods JDINAC and LGCDG to identify differential genes. The patients were divided into case and control groups according to their survival time. The TCGA and METABRIC database were used as the training and validation set respectively. Result In the TCGA dataset, C11orf1, OLA1, RPL31, SPDL1 and IL33 were identified to be associated with prognosis of breast cancer. In the METABRIC database, ZNF273, ZBTB37, TRIM52, TSGA10, ZNF727, TRAF2, TSPAN17, USP28 and ZNF519 were identified as hub genes. In addition, RPL31, TMEM163 and ZNF273 were screened out in both datasets. GO enrichment analysis shows that most of these hub genes were involved in zinc ion binding. Conclusion In this study, a total of 15 hub genes associated with long-term survival of breast cancer were identified, which can promote understanding of the molecular mechanism of breast cancer and provide new insight into clinical research and treatment.
Collapse
Affiliation(s)
- Yeye Fan
- School of Mathematics, Shandong University, Jinan, China
| | - Chunyu Kao
- Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, China
| | - Fu Yang
- Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, China
| | - Fei Wang
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, China
| | - Gengshen Yin
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, China
| | - Yongjiu Wang
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, China
| | - Yong He
- School of Mathematics, Shandong University, Jinan, China.,Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, China
| | - Jiadong Ji
- Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, China
| | - Liyuan Liu
- School of Mathematics, Shandong University, Jinan, China.,Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, China
| |
Collapse
|
5
|
Ji J, He Y, Liu L, Xie L. Brain connectivity alteration detection via matrix-variate differential network model. Biometrics 2021; 77:1409-1421. [PMID: 32829503 PMCID: PMC7900256 DOI: 10.1111/biom.13359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 08/10/2020] [Accepted: 08/14/2020] [Indexed: 10/23/2022]
Abstract
Brain functional connectivity reveals the synchronization of brain systems through correlations in neurophysiological measures of brain activities. Growing evidence now suggests that the brain connectivity network experiences alterations with the presence of numerous neurological disorders, thus differential brain network analysis may provide new insights into disease pathologies. The data from neurophysiological measurement are often multidimensional and in a matrix form, posing a challenge in brain connectivity analysis. Existing graphical model estimation methods either assume a vector normal distribution that in essence requires the columns of the matrix data to be independent or fail to address the estimation of differential networks across different populations. To tackle these issues, we propose an innovative matrix-variate differential network (MVDN) model. We exploit the D-trace loss function and a Lasso-type penalty to directly estimate the spatial differential partial correlation matrix and use an alternating direction method of multipliers algorithm for the optimization problem. Theoretical and simulation studies demonstrate that MVDN significantly outperforms other state-of-the-art methods in dynamic differential network analysis. We illustrate with a functional connectivity analysis of an attention deficit hyperactivity disorder dataset. The hub nodes and differential interaction patterns identified are consistent with existing experimental studies.
Collapse
Affiliation(s)
- Jiadong Ji
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | - Yong He
- Institute for Financial Studies, Shandong University, Jinan, China
| | - Lei Liu
- Division of Biostatistics, Washington University in St. Louis, U.S.A
| | - Lei Xie
- The Graduate Center, The City University of New York, New York, 10016, U.S.A
- Department of Computer Science, Hunter College, The City University of New York, New York, 10065, U.S.A
| |
Collapse
|
6
|
Lin W, Ji J, Zhu Y, Li M, Zhao J, Xue F, Yuan Z. PMINR: Pointwise Mutual Information-Based Network Regression - With Application to Studies of Lung Cancer and Alzheimer's Disease. Front Genet 2020; 11:556259. [PMID: 33193633 PMCID: PMC7594515 DOI: 10.3389/fgene.2020.556259] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 08/12/2020] [Indexed: 11/13/2022] Open
Abstract
Complex diseases are believed to be the consequence of intracellular network(s) involving a range of factors. An improved understanding of a disease-predisposing biological network could lead to better identification of genes and pathways that confer disease risk and therefore inform drug development. The group difference in biological networks, as is often characterized by graphs of nodes and edges, is attributable to effects of these nodes and edges. Here we introduced pointwise mutual information (PMI) as a measure of the connection between a pair of nodes with either a linear relationship or nonlinear dependence. We then proposed a PMI-based network regression (PMINR) model to differentiate patterns of network changes (in node or edge) linking a disease outcome. Through simulation studies with various sample sizes and inter-node correlation structures, we showed that PMINR can accurately identify these changes with higher power than current methods and be robust to the network topology. Finally, we illustrated, with publicly available data on lung cancer and gene methylation data on aging and Alzheimer’s disease, an evaluation of the practical performance of PMINR. We concluded that PMI is able to capture the generic inter-node correlation pattern in biological networks, and PMINR is a powerful and efficient approach for biological network analysis.
Collapse
Affiliation(s)
- Weiqiang Lin
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Jiadong Ji
- Department of Data Science, School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | - Yuchen Zhu
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Mingzhuo Li
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Jinghua Zhao
- Cardiovasucular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
| | - Fuzhong Xue
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Zhongshang Yuan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, China
| |
Collapse
|
7
|
Chen H, He Y, Ji J, Shi Y. A Machine Learning Method for Identifying Critical Interactions Between Gene Pairs in Alzheimer's Disease Prediction. Front Neurol 2019; 10:1162. [PMID: 31736866 PMCID: PMC6834789 DOI: 10.3389/fneur.2019.01162] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Accepted: 10/15/2019] [Indexed: 12/26/2022] Open
Abstract
Background: Alzheimer's disease (AD) is the most common type of dementia. Scientists have discovered that the causes of AD may include a combination of genetic, lifestyle, and environmental factors, but the exact cause has not yet been elucidated. Effective strategies to prevent and treat AD therefore remain elusive. The identified genetic causes of AD mainly focus on individual genes, but growing evidence has shown that complex diseases are usually affected by the interaction of genes in a network. Few studies have focused on the interactions and correlations between genes and how they are gradually destroyed or disappear during AD progression. A differential network analysis has been recognized as an essential tool for identifying the underlying pathogenic mechanisms and significant genes for prediction analysis. We therefore aim to conduct a differential network analysis to reveal potential networks involved in the neuropathogenesis of AD and identify genes for AD prediction. Methods: In this paper, we selected 365 samples from the Religious Orders Study and the Rush Memory and Aging Project, including 193 clinically and neuropathologically confirmed AD subjects and 172 no cognitive impairment (NCI) controls. Then, we selected 158 genes belonging to the AD pathway (hsa05010) of the Kyoto Encyclopedia of Genes and Genomes. We employed a machine learning method, namely, joint density-based non-parametric differential interaction network analysis and classification (JDINAC), in the analysis of gene expression data (RNA-seq data). We searched for the differential networks in the RNA-seq data with a pathological diagnosis of AD. Finally, an optimal prediction model was built through cross-validation, which showed good discrimination and calibration for AD prediction. Results: We used JDINAC to derive a gene co-expression network and to explore the relationship between the interaction of gene pairs and AD, and the top 10 differential gene pairs were identified. We then compared the prediction performance between JDINAC and individual genes based on prediction methods. JDINAC provides better accuracy of classification than the latest methods, such as random forest and penalized logistic regression. Conclusions: The interaction between gene pairs is related to AD and can provide more insight than the individual genes in AD prediction.
Collapse
Affiliation(s)
- Hao Chen
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | - Yong He
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | - Jiadong Ji
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | - Yufeng Shi
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
- Institute for Financial Studies and School of Mathematics, Shandong University, Jinan, China
| |
Collapse
|
8
|
He Y, Ji J, Xie L, Zhang X, Xue F. A new insight into underlying disease mechanism through semi-parametric latent differential network model. BMC Bioinformatics 2018; 19:493. [PMID: 30591011 PMCID: PMC6309076 DOI: 10.1186/s12859-018-2461-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND In genomic studies, to investigate how the structure of a genetic network differs between two experiment conditions is a very interesting but challenging problem, especially in high-dimensional setting. Existing literatures mostly focus on differential network modelling for continuous data. However, in real application, we may encounter discrete data or mixed data, which urges us to propose a unified differential network modelling for various data types. RESULTS We propose a unified latent Gaussian copula differential network model which provides deeper understanding of the unknown mechanism than that among the observed variables. Adaptive rank-based estimation approaches are proposed with the assumption that the true differential network is sparse. The adaptive estimation approaches do not require precision matrices to be sparse, and thus can allow the individual networks to contain hub nodes. Theoretical analysis shows that the proposed methods achieve the same parametric convergence rate for both the difference of the precision matrices estimation and differential structure recovery, which means that the extra modeling flexibility comes at almost no cost of statistical efficiency. Besides theoretical analysis, thorough numerical simulations are conducted to compare the empirical performance of the proposed methods with some other state-of-the-art methods. The result shows that the proposed methods work quite well for various data types. The proposed method is then applied on gene expression data associated with lung cancer to illustrate its empirical usefulness. CONCLUSIONS The proposed latent variable differential network models allows for various data-types and thus are more flexible, which also provide deeper understanding of the unknown mechanism than that among the observed variables. Theoretical analysis, numerical simulation and real application all demonstrate the great advantages of the latent differential network modelling and thus are highly recommended.
Collapse
Affiliation(s)
- Yong He
- School of Statistics, Shandong University of Finance and Economics, Jinan, 250014 China
| | - Jiadong Ji
- School of Statistics, Shandong University of Finance and Economics, Jinan, 250014 China
| | - Lei Xie
- Department of Computer Science, Hunter College, The City University of New York, New York, 10065 USA
- Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, 10016 USA
| | - Xinsheng Zhang
- School of Management, Fudan University, Shanghai, 200433 China
| | - Fuzhong Xue
- School of Public Health, Shandong University, Jinan, 250012 China
| |
Collapse
|
9
|
Ji J, He D, Feng Y, He Y, Xue F, Xie L. JDINAC: joint density-based non-parametric differential interaction network analysis and classification using high-dimensional sparse omics data. Bioinformatics 2018; 33:3080-3087. [PMID: 28582486 DOI: 10.1093/bioinformatics/btx360] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 06/01/2017] [Indexed: 12/26/2022] Open
Abstract
Motivation A complex disease is usually driven by a number of genes interwoven into networks, rather than a single gene product. Network comparison or differential network analysis has become an important means of revealing the underlying mechanism of pathogenesis and identifying clinical biomarkers for disease classification. Most studies, however, are limited to network correlations that mainly capture the linear relationship among genes, or rely on the assumption of a parametric probability distribution of gene measurements. They are restrictive in real application. Results We propose a new Joint density based non-parametric Differential Interaction Network Analysis and Classification (JDINAC) method to identify differential interaction patterns of network activation between two groups. At the same time, JDINAC uses the network biomarkers to build a classification model. The novelty of JDINAC lies in its potential to capture non-linear relations between molecular interactions using high-dimensional sparse data as well as to adjust confounding factors, without the need of the assumption of a parametric probability distribution of gene measurements. Simulation studies demonstrate that JDINAC provides more accurate differential network estimation and lower classification error than that achieved by other state-of-the-art methods. We apply JDINAC to a Breast Invasive Carcinoma dataset, which includes 114 patients who have both tumor and matched normal samples. The hub genes and differential interaction patterns identified were consistent with existing experimental studies. Furthermore, JDINAC discriminated the tumor and normal sample with high accuracy by virtue of the identified biomarkers. JDINAC provides a general framework for feature selection and classification using high-dimensional sparse omics data. Availability and implementation R scripts available at https://github.com/jijiadong/JDINAC. Contact lxie@iscb.org. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jiadong Ji
- Department of Mathematical Statistics, School of Statistics, Shandong University of Finance and Economics, Jinan 250014, China
| | - Di He
- Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, NY 10016, USA
| | - Yang Feng
- Department of Statistics, Columbia University, New York, NY 10027, USA
| | - Yong He
- Department of Mathematical Statistics, School of Statistics, Shandong University of Finance and Economics, Jinan 250014, China
| | - Fuzhong Xue
- Department of Biostatistics, School of Public Health, Shandong University, Jinan 250012, China
| | - Lei Xie
- Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, NY 10016, USA.,Department of Computer Science, Hunter College, The City University of New York, NY 10065, USA
| |
Collapse
|
10
|
Will T, Helms V. Rewiring of the inferred protein interactome during blood development studied with the tool PPICompare. BMC SYSTEMS BIOLOGY 2017; 11:44. [PMID: 28376810 PMCID: PMC5379774 DOI: 10.1186/s12918-017-0400-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2016] [Accepted: 01/26/2017] [Indexed: 12/24/2022]
Abstract
BACKGROUND Differential analysis of cellular conditions is a key approach towards understanding the consequences and driving causes behind biological processes such as developmental transitions or diseases. The progress of whole-genome expression profiling enabled to conveniently capture the state of a cell's transcriptome and to detect the characteristic features that distinguish cells in specific conditions. In contrast, mapping the physical protein interactome for many samples is experimentally infeasible at the moment. For the understanding of the whole system, however, it is equally important how the interactions of proteins are rewired between cellular states. To overcome this deficiency, we recently showed how condition-specific protein interaction networks that even consider alternative splicing can be inferred from transcript expression data. Here, we present the differential network analysis tool PPICompare that was specifically designed for isoform-sensitive protein interaction networks. RESULTS Besides detecting significant rewiring events between the interactomes of grouped samples, PPICompare infers which alterations to the transcriptome caused each rewiring event and what is the minimal set of alterations necessary to explain all between-group changes. When applied to the development of blood cells, we verified that a reasonable amount of rewiring events were reported by the tool and found that differential gene expression was the major determinant of cellular adjustments to the interactome. Alternative splicing events were consistently necessary in each developmental step to explain all significant alterations and were especially important for rewiring in the context of transcriptional control. CONCLUSIONS Applying PPICompare enabled us to investigate the dynamics of the human protein interactome during developmental transitions. A platform-independent implementation of the tool PPICompare is available at https://sourceforge.net/projects/ppicompare/ .
Collapse
Affiliation(s)
- Thorsten Will
- Center for Bioinformatics, Saarland University, Campus E2.1, Saarbrücken, 66123 Germany
- Graduate School of Computer Science, Saarland University, Campus E1.3, Saarbrücken, 66123 Germany
| | - Volkhard Helms
- Center for Bioinformatics, Saarland University, Campus E2.1, Saarbrücken, 66123 Germany
| |
Collapse
|
11
|
A powerful weighted statistic for detecting group differences of directed biological networks. Sci Rep 2016; 6:34159. [PMID: 27686331 PMCID: PMC5054825 DOI: 10.1038/srep34159] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 09/08/2016] [Indexed: 12/15/2022] Open
Abstract
Complex disease is largely determined by a number of biomolecules interwoven into networks, rather than a single biomolecule. Different physiological conditions such as cases and controls may manifest as different networks. Statistical comparison between biological networks can provide not only new insight into the disease mechanism but statistical guidance for drug development. However, the methods developed in previous studies are inadequate to capture the changes in both the nodes and edges, and often ignore the network structure. In this study, we present a powerful weighted statistical test for group differences of directed biological networks, which is independent of the network attributes and can capture the changes in both the nodes and edges, as well as simultaneously accounting for the network structure through putting more weights on the difference of nodes locating on relatively more important position. Simulation studies illustrate that this method had better performance than previous ones under various sample sizes and network structures. One application to GWAS of leprosy successfully identifies the specific gene interaction network contributing to leprosy. Another real data analysis significantly identifies a new biological network, which is related to acute myeloid leukemia. One potential network responsible for lung cancer has also been significantly detected. The source R code is available on our website.
Collapse
|