Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Neto EC, Keller MP, Attie AD, Yandell BS. CAUSAL GRAPHICAL MODELS IN SYSTEMS GENETICS: A UNIFIED FRAMEWORK FOR JOINT INFERENCE OF CAUSAL NETWORK AND GENETIC ARCHITECTURE FOR CORRELATED PHENOTYPES. Ann Appl Stat 2010;4:320-339. [PMID: 21218138 PMCID: PMC3017382 DOI: 10.1214/09-aoas288] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

For:	Neto EC, Keller MP, Attie AD, Yandell BS. CAUSAL GRAPHICAL MODELS IN SYSTEMS GENETICS: A UNIFIED FRAMEWORK FOR JOINT INFERENCE OF CAUSAL NETWORK AND GENETIC ARCHITECTURE FOR CORRELATED PHENOTYPES. Ann Appl Stat 2010;4:320-339. [PMID: 21218138 PMCID: PMC3017382 DOI: 10.1214/09-aoas288] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Number

Cited by Other Article(s)

Fei Y, Yu H, Wu Y, Gong S. The causal relationship between immune cells and ankylosing spondylitis: a bidirectional Mendelian randomization study. Arthritis Res Ther 2024;26:24. [PMID: 38229175 PMCID: PMC10790477 DOI: 10.1186/s13075-024-03266-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 01/09/2024] [Indexed: 01/18/2024] Open

Abstract

BACKGROUND

Ankylosing spondylitis (AS) is one of several disorders known as seronegative spinal arthritis (SpA), the origin of which is unknown. Existing epidemiological data show that inflammatory and immunological factors are important in the development of AS. Previous research on the connection between immunological inflammation and AS, however, has shown inconclusive results.

METHODS

To evaluate the causal association between immunological characteristics and AS, a bidirectional, two-sample Mendelian randomization (MR) approach was performed in this study. We investigated the causal connection between 731 immunological feature characteristic cells and AS risk using large, publically available genome-wide association studies.

RESULTS

After FDR correction, two immunophenotypes were found to be significantly associated with AS risk: CD14 - CD16 + monocyte (OR, 0.669; 95% CI, 0.544 ~ 0.823; P = 1.46 × 10-4; PFDR = 0.043), CD33dim HLA DR + CD11b + (OR, 0.589; 95% CI = 0.446 ~ 0.780; P = 2.12 × 10-4; PFDR = 0.043). AS had statistically significant effects on six immune traits: CD8 on HLA DR + CD8 + T cell (OR, 1.029; 95% CI, 1.015 ~ 1.043; P = 4.46 × 10-5; PFDR = 0.014), IgD on IgD + CD24 + B cell (OR, 0.973; 95% CI, 0.960 ~ 0.987; P = 1.2 × 10-4; PFDR = 0.021), IgD on IgD + CD38 - unswitched memory B cell (OR, 0.962; 95% CI, 0.945 ~ 0.980; P = 3.02 × 10-5; PFDR = 0.014), CD8 + natural killer T %lymphocyte (OR, 0.973; 95% CI, 0.959 ~ 0.987; P = 1.92 × 10-4; PFDR = 0.021), CD8 + natural killer T %T cell (OR, 0.973; 95% CI, 0.959 ~ 0.987; P = 1.65 × 10-4; PFDR = 0.021).

CONCLUSION

Our findings extend genetic research into the intimate link between immune cells and AS, which can help guide future clinical and basic research.

Collapse

Lovis C, Zhang K, Li C, Jiang X, Kim Y. Scalable Causal Structure Learning: Scoping Review of Traditional and Deep Learning Algorithms and New Opportunities in Biomedicine. JMIR Med Inform 2023;11:e38266. [PMID: 36649070 PMCID: PMC9890349 DOI: 10.2196/38266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 08/30/2022] [Accepted: 09/18/2022] [Indexed: 02/04/2023] Open

Abstract

BACKGROUND

Causal structure learning refers to a process of identifying causal structures from observational data, and it can have multiple applications in biomedicine and health care.

OBJECTIVE

This paper provides a practical review and tutorial on scalable causal structure learning models with examples of real-world data to help health care audiences understand and apply them.

METHODS

We reviewed traditional (combinatorial and score-based) methods for causal structure discovery and machine learning-based schemes. Various traditional approaches have been studied to tackle this problem, the most important among these being the Peter Spirtes and Clark Glymour algorithms. This was followed by analyzing the literature on score-based methods, which are computationally faster. Owing to the continuous constraint on acyclicity, there are new deep learning approaches to the problem in addition to traditional and score-based methods. Such methods can also offer scalability, particularly when there is a large amount of data involving multiple variables. Using our own evaluation metrics and experiments on linear, nonlinear, and benchmark Sachs data, we aimed to highlight the various advantages and disadvantages associated with these methods for the health care community. We also highlighted recent developments in biomedicine where causal structure learning can be applied to discover structures such as gene networks, brain connectivity networks, and those in cancer epidemiology.

RESULTS

We also compared the performance of traditional and machine learning-based algorithms for causal discovery over some benchmark data sets. Directed Acyclic Graph-Graph Neural Network has the lowest structural hamming distance (19) and false positive rate (0.13) based on the Sachs data set, whereas Greedy Equivalence Search and Max-Min Hill Climbing have the best false discovery rate (0.68) and true positive rate (0.56), respectively.

CONCLUSIONS

Machine learning-based approaches, including deep learning, have many advantages over traditional approaches, such as scalability, including a greater number of variables, and potentially being applied in a wide range of biomedical applications, such as genetics, if sufficient data are available. Furthermore, these models are more flexible than traditional models and are poised to positively affect many applications in the future.

Collapse

Fan Z, Kernan KF, Sriram A, Benos PV, Canna SW, Carcillo JA, Kim S, Park HJ. Deep neural networks with knockoff features identify nonlinear causal relations and estimate effect sizes in complex biological systems. Gigascience 2022;12:giad044. [PMID: 37395630 PMCID: PMC10316696 DOI: 10.1093/gigascience/giad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 01/31/2023] [Accepted: 05/29/2023] [Indexed: 07/04/2023] Open

Bankier S, Michoel T. eQTLs as causal instruments for the reconstruction of hormone linked gene networks. Front Endocrinol (Lausanne) 2022;13:949061. [PMID: 36060942 PMCID: PMC9428692 DOI: 10.3389/fendo.2022.949061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 07/25/2022] [Indexed: 11/17/2022] Open

Li L, Huang L, Yang A, Feng X, Mo Z, Zhang H, Yang X. Causal Relationship Between Complement C3, C4, and Nonalcoholic Fatty Liver Disease: Bidirectional Mendelian Randomization Analysis. PHENOMICS (CHAM, SWITZERLAND) 2021;1:211-221. [PMID: 36939807 PMCID: PMC9590569 DOI: 10.1007/s43657-021-00023-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 08/07/2021] [Accepted: 08/18/2021] [Indexed: 02/07/2023]

Abstract

The complement system is activated during the development of nonalcoholic fatty liver disease (NAFLD). We aimed to evaluate the causal relationship between serum C3 and C4 levels and NAFLD. After exclusion criteria, a total of 1600 Chinese Han men from the Fangchenggang Area Male Health and Examination Survey cohort were enrolled in cross-sectional analysis, while 572 participants were included in the longitudinal analysis (average follow-up of 4 years). We performed a bidirectional Mendelian randomization (MR) analysis using two C3-related, eight C4-related and three NAFLD-related gene loci as instrumental variables to evaluate the causal associations between C3, C4, and NAFLD risk in cross-sectional analysis. Per SD increase in C3 levels was significantly associated with higher risk of NAFLD (OR = 1.65, 95% CI 1.40, 1.94) in cross-sectional analysis while C4 was not (OR = 1.04, 95% CI 0.89, 1.21). Longitudinal analysis produced similar results (HR_C3 = 1.20, 95% CI 1.02, 1.42; HR_C4 = 1.10, 95% CI 0.94, 1.28). In MR analysis, there were no causal relationships for genetically determined C3 levels and NAFLD risk using unweighted or weighted GRS_C3 (β_{E_unweighted} = -0.019, 95% CI -0.019, -0.019, p = 0.202; β_{E_weighted} = -0.019, 95% CI -0.019, -0.019, p = 0.322). Conversely, serum C3 levels were significantly effected by the genetically determined NAFLD (β_{E_unweighted} = 0.020, 95% CI 0.020, 0.020, p = 0.004; β_{E_weighted} = 0.021, 95% CI 0.020, 0.021, p = 0.004). Neither the direction from C4 to NAFLD nor the one from NAFLD to C4 showed significant association. Our results support that the change in serum C3 levels but not C4 levels might be caused by NAFLD in Chinese Han men.

Supplementary Information

The online version contains supplementary material available at 10.1007/s43657-021-00023-0.

Collapse

Affiliation(s)

Longman Li grid.256607.00000 0004 1798 2653Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021 Guangxi China Nanhu Zhuxi Community Healthcare Center, Qingxiu District, Nanning, 530021 Guangxi China grid.412594.fDepartment of Urology, Institute of Urology and Nephrology, The First Affiliated Hospital of Guangxi Medical University, Nanning, 530021 Guangxi China
Lulu Huang grid.256607.00000 0004 1798 2653Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021 Guangxi China
Aimin Yang grid.194645.b0000000121742757School of Public Health, The University of Hong Kong, Hong Kong SAR, 999077 China
Xiuming Feng grid.256607.00000 0004 1798 2653Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021 Guangxi China grid.256607.00000 0004 1798 2653Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, 530021 Guangxi China
Zengnan Mo grid.256607.00000 0004 1798 2653Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021 Guangxi China grid.412594.fDepartment of Urology, Institute of Urology and Nephrology, The First Affiliated Hospital of Guangxi Medical University, Nanning, 530021 Guangxi China
Haiying Zhang grid.256607.00000 0004 1798 2653Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021 Guangxi China grid.256607.00000 0004 1798 2653Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, 530021 Guangxi China
Xiaobo Yang grid.256607.00000 0004 1798 2653Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021 Guangxi China grid.256607.00000 0004 1798 2653Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, 530021 Guangxi China grid.440719.f0000 0004 1800 187XDepartment of Public Health, School of Medicine, Guangxi University of Science and Technology, Liuzhou, 545006 Guangxi China

Collapse

Ha MJ, Sun W. Estimation of high-dimensional directed acyclic graphs with surrogate intervention. Biostatistics 2020;21:659-675. [PMID: 30596892 DOI: 10.1093/biostatistics/kxy080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 11/18/2018] [Accepted: 11/25/2018] [Indexed: 11/15/2022] Open

Li L, Huang L, Huang S, Luo X, Zhang H, Mo Z, Wu T, Yang X. Non-linear association of serum molybdenum and linear association of serum zinc with nonalcoholic fatty liver disease: Multiple-exposure and Mendelian randomization approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020;720:137655. [PMID: 32146412 DOI: 10.1016/j.scitotenv.2020.137655] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 02/27/2020] [Accepted: 02/29/2020] [Indexed: 06/10/2023]

Affiliation(s)

Longman Li Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China
Lulu Huang Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China
Sifang Huang Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China
Xiaoyu Luo Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China
Haiying Zhang Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China
Zengnan Mo Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China; Institute of Urology and Nephrology, First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Tangchun Wu Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Department of Occupational and Environmental Health, Key Laboratory of Environment and Health, Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.
Xiaobo Yang Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, China.

Collapse

Wang L, Audenaert P, Michoel T. High-Dimensional Bayesian Network Inference From Systems Genetics Data Using Genetic Node Ordering. Front Genet 2019;10:1196. [PMID: 31921278 PMCID: PMC6933017 DOI: 10.3389/fgene.2019.01196] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 10/29/2019] [Indexed: 11/23/2022] Open

Bustos-Korts D, Malosetti M, Chenu K, Chapman S, Boer MP, Zheng B, van Eeuwijk FA. From QTLs to Adaptation Landscapes: Using Genotype-To-Phenotype Models to Characterize G×E Over Time. FRONTIERS IN PLANT SCIENCE 2019;10:1540. [PMID: 31867027 PMCID: PMC6904366 DOI: 10.3389/fpls.2019.01540] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 11/04/2019] [Indexed: 05/18/2023]

Abstract

Genotype by environment interaction (G×E) for the target trait, e.g. yield, is an emerging property of agricultural systems and results from the interplay between a hierarchy of secondary traits involving the capture and allocation of environmental resources during the growing season. This hierarchy of secondary traits ranges from basic traits that correspond to response mechanisms/sensitivities, to intermediate traits that integrate a larger number of processes over time and therefore show a larger amount of G×E. Traits underlying yield differ in their contribution to adaptation across environmental conditions and have different levels of G×E. Here, we provide a framework to study the performance of genotype to phenotype (G2P) modeling approaches. We generate and analyze response surfaces, or adaptation landscapes, for yield and yield related traits, emphasizing the organization of the traits in a hierarchy and their development and interactions over time. We use the crop growth model APSIM-wheat with genotype-dependent parameters as a tool to simulate non-linear trait responses over time with complex trait dependencies and apply it to wheat crops in Australia. For biological realism, APSIM parameters were given a genetic basis of 300 QTLs sampled from a gamma distribution whose shape and rate parameters were estimated from real wheat data. In the simulations, the hierarchical organization of the traits and their interactions over time cause G×E for yield even when underlying traits do not show G×E. Insight into how G×E arises during growth and development helps to improve the accuracy of phenotype predictions within and across environments and to optimize trial networks. We produced a tangible simulated adaptation landscape for yield that we first investigated for its biological credibility by statistical models for G×E that incorporate genotypic and environmental covariables. Subsequently, the simulated trait data were used to evaluate statistical genotype-to-phenotype models for multiple traits and environments and to characterize relationships between traits over time and across environments, as a way to identify traits that could be useful to select for specific adaptation. Designed appropriately, these types of simulated landscapes might also serve as a basis to train other, more deep learning methodologies in order to transfer such network models to real-world situations.

Collapse

Jiang D, Armour CR, Hu C, Mei M, Tian C, Sharpton TJ, Jiang Y. Microbiome Multi-Omics Network Analysis: Statistical Considerations, Limitations, and Opportunities. Front Genet 2019;10:995. [PMID: 31781153 PMCID: PMC6857202 DOI: 10.3389/fgene.2019.00995] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 09/18/2019] [Indexed: 12/21/2022] Open

Rojo C, Zhang Q, Keleş S. iFunMed: Integrative functional mediation analysis of GWAS and eQTL studies. Genet Epidemiol 2019;43:742-760. [PMID: 31328826 DOI: 10.1002/gepi.22217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Revised: 04/17/2019] [Accepted: 05/07/2019] [Indexed: 11/08/2022]

Yu H, Blair RH. Integration of probabilistic regulatory networks into constraint-based models of metabolism with applications to Alzheimer's disease. BMC Bioinformatics 2019;20:386. [PMID: 31291905 PMCID: PMC6617954 DOI: 10.1186/s12859-019-2872-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Accepted: 05/02/2019] [Indexed: 01/08/2023] Open

Abstract

Background

Mathematical models of biological networks can provide important predictions and insights into complex disease. Constraint-based models of cellular metabolism and probabilistic models of gene regulatory networks are two distinct areas that have progressed rapidly in parallel over the past decade. In principle, gene regulatory networks and metabolic networks underly the same complex phenotypes and diseases. However, systematic integration of these two model systems remains a fundamental challenge.

Results

In this work, we address this challenge by fusing probabilistic models of gene regulatory networks into constraint-based models of metabolism. The novel approach utilizes probabilistic reasoning in BN models of regulatory networks serves as the “glue” that enables a natural interface between the two systems. Probabilistic reasoning is used to predict and quantify system-wide effects of perturbation to the regulatory network in the form of constraints for flux variability analysis. In this setting, both regulatory and metabolic networks inherently account for uncertainty. Applications leverage constraint-based metabolic models of brain metabolism and gene regulatory networks parameterized by gene expression data from the hippocampus to investigate the role of the HIF-1 pathway in Alzheimer’s disease. Integrated models support HIF-1A as effective target to reduce the effects of hypoxia in Alzheimer’s disease. However, HIF-1A activation is far less effective in shifting metabolism when compared to brain metabolism in healthy controls.

Conclusions

The direct integration of probabilistic regulatory networks into constraint-based models of metabolism provides novel insights into how perturbations in the regulatory network may influence metabolic states. Predictive modeling of enzymatic activity can be facilitated using probabilistic reasoning, thereby extending the predictive capacity of the network. This framework for model integration is generalizable to other systems.

Electronic supplementary material

The online version of this article (10.1186/s12859-019-2872-8) contains supplementary material, which is available to authorized users.

Collapse

Rezaei Tabar V, Zareifard H, Salimi S, Plewczynski D. Learning directed acyclic graphs by determination of candidate causes for discrete variables. J STAT COMPUT SIM 2019. [DOI: 10.1080/00949655.2019.1604709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Causal phenotypic networks for egg traits in an F₂ chicken population. Mol Genet Genomics 2019;294:1455-1462. [PMID: 31240383 DOI: 10.1007/s00438-019-01588-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 06/17/2019] [Indexed: 12/24/2022]

Abstract

Traditional single-trait genetic analyses, such as quantitative trait locus (QTL) and genome-wide association studies (GWAS), have been used to understand genotype-phenotype relationships for egg traits in chickens. Even though these techniques can detect potential genes of major effect, they cannot reveal cryptic causal relationships among QTLs and phenotypes. Thus, to better understand the relationships involving multiple genes and phenotypes of interest, other data analysis techniques must be used. Here, we utilized a QTL-directed dependency graph (QDG) mapping approach for a joint analysis of chicken egg traits, so that functional relationships and potential causal effects between them could be investigated. The QDG mapping identified a total of 17 QTLs affecting 24 egg traits that formed three independent networks of phenotypic trait groups (eggshell color, egg production, and size and weight of egg components), clearly distinguishing direct and indirect effects of QTLs towards correlated traits. For example, the network of size and weight of egg components contained 13 QTLs and 18 traits that are densely connected to each other. This indicates complex relationships between genotype and phenotype involving both direct and indirect effects of QTLs on the studied traits. Most of the QTLs were commonly identified by both the traditional (single-trait) mapping and the QDG approach. The network analysis, however, offers additional insight regarding the source and characterization of pleiotropy affecting egg traits. As such, the QDG analysis provides a substantial step forward, revealing cryptic relationships among QTLs and phenotypes, especially regarding direct and indirect QTL effects as well as potential causal relationships between traits, which can be used, for example, to optimize management practices and breeding strategies for the improvement of the traits.

Collapse

Glymour C, Zhang K, Spirtes P. Review of Causal Discovery Methods Based on Graphical Models. Front Genet 2019;10:524. [PMID: 31214249 PMCID: PMC6558187 DOI: 10.3389/fgene.2019.00524] [Citation(s) in RCA: 130] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 05/13/2019] [Indexed: 12/11/2022] Open

Tasaki S, Gaiteri C, Mostafavi S, Yu L, Wang Y, De Jager PL, Bennett DA. Multi-omic Directed Networks Describe Features of Gene Regulation in Aged Brains and Expand the Set of Genes Driving Cognitive Decline. Front Genet 2018;9:294. [PMID: 30140277 PMCID: PMC6095043 DOI: 10.3389/fgene.2018.00294] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 07/13/2018] [Indexed: 01/10/2023] Open

Genetic Drivers of Pancreatic Islet Function. Genetics 2018;209:335-356. [PMID: 29567659 DOI: 10.1534/genetics.118.300864] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 03/19/2018] [Indexed: 01/03/2023] Open

Abstract

The majority of gene loci that have been associated with type 2 diabetes play a role in pancreatic islet function. To evaluate the role of islet gene expression in the etiology of diabetes, we sensitized a genetically diverse mouse population with a Western diet high in fat (45% kcal) and sucrose (34%) and carried out genome-wide association mapping of diabetes-related phenotypes. We quantified mRNA abundance in the islets and identified 18,820 expression QTL. We applied mediation analysis to identify candidate causal driver genes at loci that affect the abundance of numerous transcripts. These include two genes previously associated with monogenic diabetes (PDX1 and HNF4A), as well as three genes with nominal association with diabetes-related traits in humans (FAM83E, IL6ST, and SAT2). We grouped transcripts into gene modules and mapped regulatory loci for modules enriched with transcripts specific for α-cells, and another specific for δ-cells. However, no single module enriched for β-cell-specific transcripts, suggesting heterogeneity of gene expression patterns within the β-cell population. A module enriched in transcripts associated with branched-chain amino acid metabolism was the most strongly correlated with physiological traits that reflect insulin resistance. Although the mice in this study were not overtly diabetic, the analysis of pancreatic islet gene expression under dietary-induced stress enabled us to identify correlated variation in groups of genes that are functionally linked to diabetes-associated physiological traits. Our analysis suggests an expected degree of concordance between diabetes-associated loci in the mouse and those found in human populations, and demonstrates how the mouse can provide evidence to support nominal associations found in human genome-wide association mapping.

Collapse

Lepik K, Annilo T, Kukuškina V, Kisand K, Kutalik Z, Peterson P, Peterson H. C-reactive protein upregulates the whole blood expression of CD59 - an integrative analysis. PLoS Comput Biol 2017;13:e1005766. [PMID: 28922377 PMCID: PMC5609773 DOI: 10.1371/journal.pcbi.1005766] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Revised: 09/22/2017] [Accepted: 09/01/2017] [Indexed: 12/21/2022] Open

Abstract

Elevated C-reactive protein (CRP) concentrations in the blood are associated with acute and chronic infections and inflammation. Nevertheless, the functional role of increased CRP in multiple bacterial and viral infections as well as in chronic inflammatory diseases remains unclear. Here, we studied the relationship between CRP and gene expression levels in the blood in 491 individuals from the Estonian Biobank cohort, to elucidate the role of CRP in these inflammatory mechanisms. As a result, we identified a set of 1,614 genes associated with changes in CRP levels with a high proportion of interferon-stimulated genes. Further, we performed likelihood-based causality model selection and Mendelian randomization analysis to discover causal links between CRP and the expression of CRP-associated genes. Strikingly, our computational analysis and cell culture stimulation assays revealed increased CRP levels to drive the expression of complement regulatory protein CD59, suggesting CRP to have a critical role in protecting blood cells from the adverse effects of the immune defence system. Our results show the benefit of integrative analysis approaches in hypothesis-free uncovering of causal relationships between traits.

Chronic inflammation is associated with chronic diseases, morbidity and mortality while lower base inflammation levels are thought to be predictive of healthy aging. Thus, to pursue a long and healthy lifespan, it is essential to understand the inflammatory regulatory mechanisms. To that end, we studied the functional role of C-reactive protein (CRP)–an inflammatory biomarker that is used to measure cardiovascular risk in clinical practice. There is evidence for a strong genetic component of elevated CRP levels but it is still unclear if it has a direct impact on the processes that lead to inflammatory diseases. In order to elucidate the function of CRP in the blood, we used statistical methods for causal inference to infer causal relationships between changes in CRP and gene expression levels. Our statistical analysis and cell culture experiments suggest that CRP drives the expression of complement regulatory protein CD59. Thus, CRP can have a functional role in protecting human blood cells from the adverse effects of the immune defence system.

Collapse

Bayesian Networks Illustrate Genomic and Residual Trait Connections in Maize (Zea mays L.). G3-GENES GENOMES GENETICS 2017. [PMID: 28637811 PMCID: PMC5555481 DOI: 10.1534/g3.117.044263] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Hasin Y, Seldin M, Lusis A. Multi-omics approaches to disease. Genome Biol 2017;18:83. [PMID: 28476144 PMCID: PMC5418815 DOI: 10.1186/s13059-017-1215-1] [Citation(s) in RCA: 1132] [Impact Index Per Article: 161.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Wang P, Rahman M, Jin L, Xiong M. A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data. BMC Genomics 2016;17:881. [PMID: 27821073 PMCID: PMC5100198 DOI: 10.1186/s12864-016-3169-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 10/18/2016] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The widely used genetic pleiotropic analyses of multiple phenotypes are often designed for examining the relationship between common variants and a few phenotypes. They are not suited for both high dimensional phenotypes and high dimensional genotype (next-generation sequencing) data. To overcome limitations of the traditional genetic pleiotropic analysis of multiple phenotypes, we develop sparse structural equation models (SEMs) as a general framework for a new paradigm of genetic analysis of multiple phenotypes. To incorporate both common and rare variants into the analysis, we extend the traditional multivariate SEMs to sparse functional SEMs. To deal with high dimensional phenotype and genotype data, we employ functional data analysis and the alternative direction methods of multiplier (ADMM) techniques to reduce data dimension and improve computational efficiency.

RESULTS

Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods. Simulations also demonstrate that the gene-based pleiotropic analysis has higher power than the single variant-based pleiotropic analysis. The proposed method is applied to exome sequence data from the NHLBI's Exome Sequencing Project (ESP) with 11 phenotypes, which identifies a network with 137 genes connected to 11 phenotypes and 341 edges. Among them, 114 genes showed pleiotropic genetic effects and 45 genes were reported to be associated with phenotypes in the analysis or other cardiovascular disease (CVD) related phenotypes in the literature.

CONCLUSIONS

Our proposed sparse functional SEMs can incorporate both common and rare variants into the analysis and the ADMM algorithm can efficiently solve the penalized SEMs. Using this model we can jointly infer genetic architecture and casual phenotype network structure, and decompose the genetic effect into direct, indirect and total effect. Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods.

Collapse

Han SW, Chen G, Cheon MS, Zhong H. Estimation of Directed Acyclic Graphs Through Two-stage Adaptive Lasso for Gene Network Inference. J Am Stat Assoc 2016;111:1004-1019. [PMID: 28239216 DOI: 10.1080/01621459.2016.1142880] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Richardson S, Tseng GC, Sun W. Statistical Methods in Integrative Genomics. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION 2016;3:181-209. [PMID: 27482531 PMCID: PMC4963036 DOI: 10.1146/annurev-statistics-041715-033506] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

Haycock PC, Burgess S, Wade KH, Bowden J, Relton C, Davey Smith G. Best (but oft-forgotten) practices: the design, analysis, and interpretation of Mendelian randomization studies. Am J Clin Nutr 2016;103:965-78. [PMID: 26961927 PMCID: PMC4807699 DOI: 10.3945/ajcn.115.118216] [Citation(s) in RCA: 341] [Impact Index Per Article: 42.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Accepted: 02/02/2016] [Indexed: 01/14/2023] Open

Moharil J, May P, Gaile DP, Blair RH. Belief propagation in genotype-phenotype networks. Stat Appl Genet Mol Biol 2016;15:39-53. [PMID: 26910752 DOI: 10.1515/sagmb-2015-0058] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Abstract

Graphical models have proven to be a valuable tool for connecting genotypes and phenotypes. Structural learning of phenotype-genotype networks has received considerable attention in the post-genome era. In recent years, a dozen different methods have emerged for network inference, which leverage natural variation that arises in certain genetic populations. The structure of the network itself can be used to form hypotheses based on the inferred direct and indirect network relationships, but represents a premature endpoint to the graphical analyses. In this work, we extend this endpoint. We examine the unexplored problem of perturbing a given network structure, and quantifying the system-wide effects on the network in a node-wise manner. The perturbation is achieved through the setting of values of phenotype node(s), which may reflect an inhibition or activation, and propagating this information through the entire network. We leverage belief propagation methods in Conditional Gaussian Bayesian Networks (CG-BNs), in order to absorb and propagate phenotypic evidence through the network. We show that the modeling assumptions adopted for genotype-phenotype networks represent an important sub-class of CG-BNs, which possess properties that ensure exact inference in the propagation scheme. The system-wide effects of the perturbation are quantified in a node-wise manner through the comparison of perturbed and unperturbed marginal distributions using a symmetric Kullback-Leibler divergence. Applications to kidney and skin cancer expression quantitative trait loci (eQTL) data from different mus musculus populations are presented. System-wide effects in the network were predicted and visualized across a spectrum of evidence. Sub-pathways and regions of the network responded in concert, suggesting co-regulation and coordination throughout the network in response to phenotypic changes. We demonstrate how these predicted system-wide effects can be examined in connection with estimated class probabilities for covariates of interest, e.g. cancer status. Despite the uncertainty in the network structure, we demonstrate the system-wide predictions are stable across an ensemble of highly likely networks. A software package, geneNetBP, which implements our approach, was developed in the R programming language.

Collapse

Yazdani A, Yazdani A, Samiei A, Boerwinkle E. Generating a robust statistical causal structure over 13 cardiovascular disease risk factors using genomics data. J Biomed Inform 2016;60:114-9. [PMID: 26827624 DOI: 10.1016/j.jbi.2016.01.012] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Revised: 01/19/2016] [Accepted: 01/22/2016] [Indexed: 10/22/2022]

Probabilistic Computational Causal Discovery for Systems Biology. UNCERTAINTY IN BIOLOGY 2016. [DOI: 10.1007/978-3-319-21296-8_3] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Peñagaricano F, Valente BD, Steibel JP, Bates RO, Ernst CW, Khatib H, Rosa GJM. Exploring causal networks underlying fat deposition and muscularity in pigs through the integration of phenotypic, genotypic and transcriptomic data. BMC SYSTEMS BIOLOGY 2015;9:58. [PMID: 26376630 PMCID: PMC4574162 DOI: 10.1186/s12918-015-0207-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 09/04/2015] [Indexed: 12/23/2022]

Abstract

BACKGROUND

Joint modeling and analysis of phenotypic, genotypic and transcriptomic data have the potential to uncover the genetic control of gene activity and phenotypic variation, as well as shed light on the manner and extent of connectedness among these variables. Current studies mainly report associations, i.e. undirected connections among variables without causal interpretation. Knowledge regarding causal relationships among genes and phenotypes can be used to predict the behavior of complex systems, as well as to optimize management practices and selection strategies. Here, we performed a multistep procedure for inferring causal networks underlying carcass fat deposition and muscularity in pigs using multi-omics data obtained from an F2 Duroc x Pietrain resource pig population.

RESULTS

We initially explored marginal associations between genotypes and phenotypic and expression traits through whole-genome scans, and then, in genomic regions with multiple significant hits, we assessed gene-phenotype network reconstruction using causal structural learning algorithms. One genomic region on SSC6 showed significant associations with three relevant phenotypes, off-midline10th-rib backfat thickness, loin muscle weight, and average intramuscular fat percentage, and also with the expression of seven genes, including ZNF24, SSX2IP, and AKR7A2. The inferred network indicated that the genotype affects the three phenotypes mainly through the expression of several genes. Among the phenotypes, fat deposition traits negatively affected loin muscle weight.

CONCLUSIONS

Our findings shed light on the antagonist relationship between carcass fat deposition and lean meat content in pigs. In addition, the procedure described in this study has the potential to unravel gene-phenotype networks underlying complex phenotypes.

Collapse

Fear JM, Arbeitman MN, Salomon MP, Dalton JE, Tower J, Nuzhdin SV, McIntyre LM. The Wright stuff: reimagining path analysis reveals novel components of the sex determination hierarchy in Drosophila melanogaster. BMC SYSTEMS BIOLOGY 2015;9:53. [PMID: 26335107 PMCID: PMC4558766 DOI: 10.1186/s12918-015-0200-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 08/20/2015] [Indexed: 11/10/2022]

Ziegler A, Mwambi H, König IR. Mendelian Randomization versus Path Models: Making Causal Inferences in Genetic Epidemiology. Hum Hered 2015. [PMID: 26201704 DOI: 10.1159/000381338] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Using molecular genetic information to infer causality in observational data: Mendelian randomisation. Curr Opin Behav Sci 2015. [DOI: 10.1016/j.cobeha.2014.08.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Lovell JT, Mullen JL, Lowry DB, Awole K, Richards JH, Sen S, Verslues PE, Juenger TE, McKay JK. Exploiting Differential Gene Expression and Epistasis to Discover Candidate Genes for Drought-Associated QTLs in Arabidopsis thaliana. THE PLANT CELL 2015;27:969-83. [PMID: 25873386 PMCID: PMC4558705 DOI: 10.1105/tpc.15.00122] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2015] [Revised: 03/13/2015] [Accepted: 04/01/2015] [Indexed: 05/09/2023]

Oren Y, Nachshon A, Frishberg A, Wilentzik R, Gat-Viks I. Linking traits based on their shared molecular mechanisms. eLife 2015;4. [PMID: 25781485 PMCID: PMC4362207 DOI: 10.7554/elife.04346] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Accepted: 02/20/2015] [Indexed: 12/29/2022] Open

Bayesian network reconstruction using systems genetics data: comparison of MCMC methods. Genetics 2015;199:973-89. [PMID: 25631319 PMCID: PMC4391572 DOI: 10.1534/genetics.114.172619] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2014] [Accepted: 01/26/2015] [Indexed: 12/23/2022] Open

Fondi M, Liò P. Multi -omics and metabolic modelling pipelines: challenges and tools for systems microbiology. Microbiol Res 2015;171:52-64. [PMID: 25644953 DOI: 10.1016/j.micres.2015.01.003] [Citation(s) in RCA: 100] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Revised: 01/02/2015] [Accepted: 01/03/2015] [Indexed: 12/27/2022]

Wang H, Paulo J, Kruijer W, Boer M, Jansen H, Tikunov Y, Usadel B, van Heusden S, Bovy A, van Eeuwijk F. Genotype–phenotype modeling considering intermediate level of biological variation: a case study involving sensory traits, metabolites and QTLs in ripe tomatoes. MOLECULAR BIOSYSTEMS 2015;11:3101-10. [DOI: 10.1039/c5mb00477b] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Mapping eQTL networks with mixed graphical Markov models. Genetics 2014;198:1377-93. [PMID: 25271303 DOI: 10.1534/genetics.114.169573] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Davey Smith G, Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet 2014;23:R89-98. [PMID: 25064373 PMCID: PMC4170722 DOI: 10.1093/hmg/ddu328] [Citation(s) in RCA: 2003] [Impact Index Per Article: 200.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Revised: 06/19/2014] [Accepted: 06/20/2014] [Indexed: 12/13/2022] Open

Wang H, van Eeuwijk FA. A new method to infer causal phenotype networks using QTL and phenotypic information. PLoS One 2014;9:e103997. [PMID: 25144184 PMCID: PMC4140682 DOI: 10.1371/journal.pone.0103997] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Accepted: 07/06/2014] [Indexed: 11/25/2022] Open

Zhang L, Kim S. Learning gene networks under SNP perturbations using eQTL datasets. PLoS Comput Biol 2014;10:e1003420. [PMID: 24586125 PMCID: PMC3937098 DOI: 10.1371/journal.pcbi.1003420] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 11/18/2013] [Indexed: 11/23/2022] Open

Abstract

The standard approach for identifying gene networks is based on experimental perturbations of gene regulatory systems such as gene knock-out experiments, followed by a genome-wide profiling of differential gene expressions. However, this approach is significantly limited in that it is not possible to perturb more than one or two genes simultaneously to discover complex gene interactions or to distinguish between direct and indirect downstream regulations of the differentially-expressed genes. As an alternative, genetical genomics study has been proposed to treat naturally-occurring genetic variants as potential perturbants of gene regulatory system and to recover gene networks via analysis of population gene-expression and genotype data. Despite many advantages of genetical genomics data analysis, the computational challenge that the effects of multifactorial genetic perturbations should be decoded simultaneously from data has prevented a widespread application of genetical genomics analysis. In this article, we propose a statistical framework for learning gene networks that overcomes the limitations of experimental perturbation methods and addresses the challenges of genetical genomics analysis. We introduce a new statistical model, called a sparse conditional Gaussian graphical model, and describe an efficient learning algorithm that simultaneously decodes the perturbations of gene regulatory system by a large number of SNPs to identify a gene network along with expression quantitative trait loci (eQTLs) that perturb this network. While our statistical model captures direct genetic perturbations of gene network, by performing inference on the probabilistic graphical model, we obtain detailed characterizations of how the direct SNP perturbation effects propagate through the gene network to perturb other genes indirectly. We demonstrate our statistical method using HapMap-simulated and yeast eQTL datasets. In particular, the yeast gene network identified computationally by our method under SNP perturbations is well supported by the results from experimental perturbation studies related to DNA replication stress response.

Collapse

Dong Z, Song T, Yuan C. Inference of gene regulatory networks from genetic perturbations with linear regression model. PLoS One 2013;8:e83263. [PMID: 24376676 PMCID: PMC3871530 DOI: 10.1371/journal.pone.0083263] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 11/01/2013] [Indexed: 11/19/2022] Open

Peng CH, Jiang YZ, Tai AS, Liu CB, Peng SC, Liao CT, Yen TC, Hsieh WP. Causal inference of gene regulation with subnetwork assembly from genetical genomics data. Nucleic Acids Res 2013;42:2803-19. [PMID: 24322297 PMCID: PMC3950678 DOI: 10.1093/nar/gkt1277] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Cai X, Bazerque JA, Giannakis GB. Inference of gene regulatory networks with sparse structural equation models exploiting genetic perturbations. PLoS Comput Biol 2013;9:e1003068. [PMID: 23717196 PMCID: PMC3662697 DOI: 10.1371/journal.pcbi.1003068] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2012] [Accepted: 03/28/2013] [Indexed: 12/22/2022] Open

Abstract

Integrating genetic perturbations with gene expression data not only improves accuracy of regulatory network topology inference, but also enables learning of causal regulatory relations between genes. Although a number of methods have been developed to integrate both types of data, the desiderata of efficient and powerful algorithms still remains. In this paper, sparse structural equation models (SEMs) are employed to integrate both gene expression data and cis-expression quantitative trait loci (cis-eQTL), for modeling gene regulatory networks in accordance with biological evidence about genes regulating or being regulated by a small number of genes. A systematic inference method named sparsity-aware maximum likelihood (SML) is developed for SEM estimation. Using simulated directed acyclic or cyclic networks, the SML performance is compared with that of two state-of-the-art algorithms: the adaptive Lasso (AL) based scheme, and the QTL-directed dependency graph (QDG) method. Computer simulations demonstrate that the novel SML algorithm offers significantly better performance than the AL-based and QDG algorithms across all sample sizes from 100 to 1,000, in terms of detection power and false discovery rate, in all the cases tested that include acyclic or cyclic networks of 10, 30 and 300 genes. The SML method is further applied to infer a network of 39 human genes that are related to the immune function and are chosen to have a reliable eQTL per gene. The resulting network consists of 9 genes and 13 edges. Most of the edges represent interactions reasonably expected from experimental evidence, while the remaining may just indicate the emergence of new interactions. The sparse SEM and efficient SML algorithm provide an effective means of exploiting both gene expression and perturbation data to infer gene regulatory networks. An open-source computer program implementing the SML algorithm is freely available upon request.

Collapse

Modeling causality for pairs of phenotypes in system genetics. Genetics 2013;193:1003-13. [PMID: 23288936 PMCID: PMC3583988 DOI: 10.1534/genetics.112.147124] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Shah SH, Kraus WE, Newgard CB. Metabolomic profiling for the identification of novel biomarkers and mechanisms related to common cardiovascular diseases: form and function. Circulation 2012;126:1110-20. [PMID: 22927473 DOI: 10.1161/circulationaha.111.060368] [Citation(s) in RCA: 267] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Bello N, Stevenson J, Tempelman R. Invited review: Milk production and reproductive performance: Modern interdisciplinary insights into an enduring axiom. J Dairy Sci 2012;95:5461-75. [DOI: 10.3168/jds.2012-5564] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2012] [Accepted: 06/05/2012] [Indexed: 11/19/2022]

Nuzhdin SV, Friesen ML, McIntyre LM. Genotype-phenotype mapping in a post-GWAS world. Trends Genet 2012;28:421-6. [PMID: 22818580 DOI: 10.1016/j.tig.2012.06.003] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2012] [Revised: 05/22/2012] [Accepted: 06/18/2012] [Indexed: 01/18/2023]

Edwards D, Wang L, Sørensen P. Network-enabled gene expression analysis. BMC Bioinformatics 2012;13:167. [PMID: 22799258 PMCID: PMC3556136 DOI: 10.1186/1471-2105-13-167] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Accepted: 06/28/2012] [Indexed: 01/17/2023] Open

Blair RH, Trichler DL, Gaille DP. Mathematical and statistical modeling in cancer systems biology. Front Physiol 2012;3:227. [PMID: 22754537 PMCID: PMC3385354 DOI: 10.3389/fphys.2012.00227] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2012] [Accepted: 06/05/2012] [Indexed: 11/13/2022] Open

Systems genetics: challenges and developing strategies. Biologia (Bratisl) 2012. [DOI: 10.2478/s11756-012-0026-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]