1
|
Nim HT, Dang L, Thiyagarajah H, Bakopoulos D, See M, Charitakis N, Sibbritt T, Eichenlaub MP, Archer SK, Fossat N, Burke RE, Tam PPL, Warr CG, Johnson TK, Ramialison M. A cis-regulatory-directed pipeline for the identification of genes involved in cardiac development and disease. Genome Biol 2021; 22:335. [PMID: 34906219 PMCID: PMC8672579 DOI: 10.1186/s13059-021-02539-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 11/10/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Congenital heart diseases are the major cause of death in newborns, but the genetic etiology of this developmental disorder is not fully known. The conventional approach to identify the disease-causing genes focuses on screening genes that display heart-specific expression during development. However, this approach would have discounted genes that are expressed widely in other tissues but may play critical roles in heart development. RESULTS We report an efficient pipeline of genome-wide gene discovery based on the identification of a cardiac-specific cis-regulatory element signature that points to candidate genes involved in heart development and congenital heart disease. With this pipeline, we retrieve 76% of the known cardiac developmental genes and predict 35 novel genes that previously had no known connectivity to heart development. Functional validation of these novel cardiac genes by RNAi-mediated knockdown of the conserved orthologs in Drosophila cardiac tissue reveals that disrupting the activity of 71% of these genes leads to adult mortality. Among these genes, RpL14, RpS24, and Rpn8 are associated with heart phenotypes. CONCLUSIONS Our pipeline has enabled the discovery of novel genes with roles in heart development. This workflow, which relies on screening for non-coding cis-regulatory signatures, is amenable for identifying developmental and disease genes for an organ without constraining to genes that are expressed exclusively in the organ of interest.
Collapse
Affiliation(s)
- Hieu T. Nim
- Australian Regenerative Medicine Institute and Systems Biology Institute Australia, Monash University, Clayton, VIC Australia
- Murdoch Children’s Research Institute, Parkville, VIC Australia
| | - Louis Dang
- Australian Regenerative Medicine Institute and Systems Biology Institute Australia, Monash University, Clayton, VIC Australia
| | - Harshini Thiyagarajah
- School of Biological Sciences, Faculty of Science, Monash University, Clayton, VIC Australia
| | - Daniel Bakopoulos
- School of Biological Sciences, Faculty of Science, Monash University, Clayton, VIC Australia
| | - Michael See
- Murdoch Children’s Research Institute, Parkville, VIC Australia
- Monash Bioinformatics Platform, Monash University, Clayton, VIC Australia
| | - Natalie Charitakis
- Murdoch Children’s Research Institute, Parkville, VIC Australia
- Department of Paediatrics, University of Melbourne, Parkville, VIC Australia
| | - Tennille Sibbritt
- Embryology Research Unit, Children’s Medical Research Institute, and School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, Westmead, New South Wales Australia
| | - Michael P. Eichenlaub
- Australian Regenerative Medicine Institute and Systems Biology Institute Australia, Monash University, Clayton, VIC Australia
| | - Stuart K. Archer
- Monash Bioinformatics Platform, Monash University, Clayton, VIC Australia
| | - Nicolas Fossat
- Embryology Research Unit, Children’s Medical Research Institute, and School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, Westmead, New South Wales Australia
- Present address: Copenhagen Hepatitis C Program, Department of Immunology and Microbiology, University of Copenhagen, Copenhagen, Denmark
- Present address: Department of Infectious Diseases, Hvidovre Hospital, Hvidovre, Denmark
| | - Richard E. Burke
- School of Biological Sciences, Faculty of Science, Monash University, Clayton, VIC Australia
| | - Patrick P. L. Tam
- Embryology Research Unit, Children’s Medical Research Institute, and School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, Westmead, New South Wales Australia
| | - Coral G. Warr
- School of Biological Sciences, Faculty of Science, Monash University, Clayton, VIC Australia
- School of Molecular Sciences, La Trobe University, Bundoora, Victoria 3083 Australia
| | - Travis K. Johnson
- School of Biological Sciences, Faculty of Science, Monash University, Clayton, VIC Australia
| | - Mirana Ramialison
- Australian Regenerative Medicine Institute and Systems Biology Institute Australia, Monash University, Clayton, VIC Australia
- Murdoch Children’s Research Institute, Parkville, VIC Australia
| |
Collapse
|
2
|
Colby SM, McClure RS, Overall CC, Renslow RS, McDermott JE. Improving network inference algorithms using resampling methods. BMC Bioinformatics 2018; 19:376. [PMID: 30314469 PMCID: PMC6186128 DOI: 10.1186/s12859-018-2402-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 09/27/2018] [Indexed: 11/10/2022] Open
Abstract
Background Relatively small changes to gene expression data dramatically affect co-expression networks inferred from that data which, in turn, can significantly alter the subsequent biological interpretation. This error propagation is an underappreciated problem that, while hinted at in the literature, has not yet been thoroughly explored. Resampling methods (e.g. bootstrap aggregation, random subspace method) are hypothesized to alleviate variability in network inference methods by minimizing outlier effects and distilling persistent associations in the data. But the efficacy of the approach assumes the generalization from statistical theory holds true in biological network inference applications. Results We evaluated the effect of bootstrap aggregation on inferred networks using commonly applied network inference methods in terms of stability, or resilience to perturbations in the underlying expression data, a metric for accuracy, and functional enrichment of edge interactions. Conclusion Bootstrap aggregation results in improved stability and, depending on the size of the input dataset, a marginal improvement to accuracy assessed by each method’s ability to link genes in the same functional pathway. Electronic supplementary material The online version of this article (10.1186/s12859-018-2402-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sean M Colby
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Ryan S McClure
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Christopher C Overall
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA.,Present Address: Center for Brain Immunology and Glia, University of Virginia, Charlottesville, Virginia, USA
| | - Ryan S Renslow
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Jason E McDermott
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA.
| |
Collapse
|
3
|
Thiagarajan R, Alavi A, Podichetty JT, Bazil JN, Beard DA. The feasibility of genome-scale biological network inference using Graphics Processing Units. Algorithms Mol Biol 2017; 12:8. [PMID: 28344638 PMCID: PMC5360040 DOI: 10.1186/s13015-017-0100-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2016] [Accepted: 03/13/2017] [Indexed: 01/20/2023] Open
Abstract
Systems research spanning fields from biology to finance involves the identification of models to represent the underpinnings of complex systems. Formal approaches for data-driven identification of network interactions include statistical inference-based approaches and methods to identify dynamical systems models that are capable of fitting multivariate data. Availability of large data sets and so-called ‘big data’ applications in biology present great opportunities as well as major challenges for systems identification/reverse engineering applications. For example, both inverse identification and forward simulations of genome-scale gene regulatory network models pose compute-intensive problems. This issue is addressed here by combining the processing power of Graphics Processing Units (GPUs) and a parallel reverse engineering algorithm for inference of regulatory networks. It is shown that, given an appropriate data set, information on genome-scale networks (systems of 1000 or more state variables) can be inferred using a reverse-engineering algorithm in a matter of days on a small-scale modern GPU cluster.
Collapse
|
4
|
McGoff KA, Guo X, Deckard A, Kelliher CM, Leman AR, Francey LJ, Hogenesch JB, Haase SB, Harer JL. The Local Edge Machine: inference of dynamic models of gene regulation. Genome Biol 2016; 17:214. [PMID: 27760556 PMCID: PMC5072315 DOI: 10.1186/s13059-016-1076-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 10/03/2016] [Indexed: 12/31/2022] Open
Abstract
We present a novel approach, the Local Edge Machine, for the inference of regulatory interactions directly from time-series gene expression data. We demonstrate its performance, robustness, and scalability on in silico datasets with varying behaviors, sizes, and degrees of complexity. Moreover, we demonstrate its ability to incorporate biological prior information and make informative predictions on a well-characterized in vivo system using data from budding yeast that have been synchronized in the cell cycle. Finally, we use an atlas of transcription data in a mammalian circadian system to illustrate how the method can be used for discovery in the context of large complex networks.
Collapse
Affiliation(s)
- Kevin A McGoff
- Department of Mathematics and Statistics, UNC Charlotte, 9201 University City Blvd., Charlotte, 28269, NC, USA.
| | - Xin Guo
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong, China
| | | | | | - Adam R Leman
- Department of Biology, Duke University, Durham, NC, USA
| | - Lauren J Francey
- Department of Molecular and Cellular Physiology, University of Cincinnati, Cincinnati, OH, USA
| | - John B Hogenesch
- Department of Molecular and Cellular Physiology, University of Cincinnati, Cincinnati, OH, USA
| | | | - John L Harer
- Department of Mathematics, Duke University, Durham, NC, USA
| |
Collapse
|
5
|
Zeigler AC, Richardson WJ, Holmes JW, Saucerman JJ. Computational modeling of cardiac fibroblasts and fibrosis. J Mol Cell Cardiol 2016; 93:73-83. [PMID: 26608708 PMCID: PMC4846515 DOI: 10.1016/j.yjmcc.2015.11.020] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Revised: 11/18/2015] [Accepted: 11/18/2015] [Indexed: 12/31/2022]
Abstract
Altered fibroblast behavior can lead to pathologic changes in the heart such as arrhythmia, diastolic dysfunction, and systolic dysfunction. Computational models are increasingly used as a tool to identify potential mechanisms driving a phenotype or potential therapeutic targets against an unwanted phenotype. Here we review how computational models incorporating cardiac fibroblasts have clarified the role for these cells in electrical conduction and tissue remodeling in the heart. Models of fibroblast signaling networks have primarily focused on fibroblast cell lines or fibroblasts from other tissues rather than cardiac fibroblasts, specifically, but they are useful for understanding how fundamental signaling pathways control fibroblast phenotype. In the future, modeling cardiac fibroblast signaling, incorporating -omics and drug-interaction data into signaling network models, and utilizing multi-scale models will improve the ability of in silico studies to predict potential therapeutic targets against adverse cardiac fibroblast activity.
Collapse
Affiliation(s)
- Angela C Zeigler
- University of Virginia, Biomedical Engineering Department, 415 Lane Road, Charlottesville, VA 22903, USA.
| | - William J Richardson
- University of Virginia, Biomedical Engineering Department, 415 Lane Road, Charlottesville, VA 22903, USA.
| | - Jeffrey W Holmes
- University of Virginia, Biomedical Engineering Department, 415 Lane Road, Charlottesville, VA 22903, USA.
| | - Jeffrey J Saucerman
- University of Virginia, Biomedical Engineering Department, 415 Lane Road, Charlottesville, VA 22903, USA.
| |
Collapse
|
6
|
Thomé JG, Mendoza MR, Cheuiche AV, La Porta VL, Silvello D, Dos Santos KG, Andrades ME, Clausell N, Rohde LE, Biolo A. Circulating microRNAs in obese and lean heart failure patients: A case-control study with computational target prediction analysis. Gene 2015. [PMID: 26211628 DOI: 10.1016/j.gene.2015.07.068] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
AIMS MicroRNAs (miRs) regulate processes involved in both cardiac remodeling and obesity. We investigated if the expression of selected miRs in patients with heart failure (HF) is influenced by the presence of obesity. METHODS In this case-control study, we compared plasma levels of miR-21, -130b, -221, -423-5p, and the -221/-130b ratio in 57 age- and gender-matched subjects: 40 HF patients (20 obese HF and 20 lean HF) and 17 lean healthy controls. Body composition was estimated by bioelectrical impedance analysis. MiRs were measured by quantitative reverse transcription-PCR. Bioinformatics analysis was performed based on miRs findings to predict their putative targets and investigate their biological function. RESULTS HF was associated with increased miR-423-5p levels in both lean and obese patients (P<0.05 vs. controls) without differences between HF groups. MiR-130b levels were reduced in obese HF patients compared with HF lean (P=0.036) and controls (P=0.025). MiR-221 levels were non-significantly increased in obese HF patients. MiR-21 levels were not different among the groups. MiR-221/-130b ratio was increased in obese HF patients, and was positively associated with body fat percentage (r=0.43; P=0.002), body mass index (r=0.44; P=0.002), and waist circumference (r=0.40; P=0.020). Computational prediction of target genes followed by functional enrichment analysis indicated a relevant role of miR-130b and miR-221 in modulating the expression of genes associated to cardiovascular and endocrine diseases, and suggested their influence in important signaling mechanisms and in numerous processes related to the circulatory and endocrine systems. CONCLUSIONS In HF patients, the presence of obesity is associated with a differential expression of selected miRs and the miR-221/-130b ratio had significant correlations with adiposity parameters. Computational target prediction analysis identified several interrelated pathways targeted by miR-130b and miR-221 with a known relationship with endocrine and cardiovascular diseases, representing potential mechanisms to be further validated.
Collapse
Affiliation(s)
- Juliana Gil Thomé
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Mariana Recamonde Mendoza
- Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Amanda Veiga Cheuiche
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Vanessa Laubert La Porta
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil; Heart Failure and Cardiac Transplant Unit, Cardiology Division, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Daiane Silvello
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Kátia Gonçalves Dos Santos
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil; Laboratory of Human Molecular Genetics, Universidade Luterana do Brasil, Canoas, RS, Brazil
| | - Michael Everton Andrades
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Nadine Clausell
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil; Heart Failure and Cardiac Transplant Unit, Cardiology Division, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Luis Eduardo Rohde
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil; Heart Failure and Cardiac Transplant Unit, Cardiology Division, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Andréia Biolo
- Post-Graduate Program in Cardiology and Cardiovascular Sciences, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Experimental and Molecular Cardiovascular Laboratory, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil; Heart Failure and Cardiac Transplant Unit, Cardiology Division, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil.
| |
Collapse
|
7
|
Belle A, Thiagarajan R, Soroushmehr SMR, Navidi F, Beard DA, Najarian K. Big Data Analytics in Healthcare. BIOMED RESEARCH INTERNATIONAL 2015; 2015:370194. [PMID: 26229957 PMCID: PMC4503556 DOI: 10.1155/2015/370194] [Citation(s) in RCA: 119] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2015] [Revised: 05/26/2015] [Accepted: 06/16/2015] [Indexed: 02/06/2023]
Abstract
The rapidly expanding field of big data analytics has started to play a pivotal role in the evolution of healthcare practices and research. It has provided tools to accumulate, manage, analyze, and assimilate large volumes of disparate, structured, and unstructured data produced by current healthcare systems. Big data analytics has been recently applied towards aiding the process of care delivery and disease exploration. However, the adoption rate and research development in this space is still hindered by some fundamental problems inherent within the big data paradigm. In this paper, we discuss some of these major challenges with a focus on three upcoming and promising areas of medical research: image, signal, and genomics based analytics. Recent research which targets utilization of large volumes of medical data while combining multimodal data from disparate sources is discussed. Potential areas of research within this field which have the ability to provide meaningful impact on healthcare delivery are also examined.
Collapse
Affiliation(s)
- Ashwin Belle
- Emergency Medicine Department, University of Michigan, Ann Arbor, MI 48109, USA
- University of Michigan Center for Integrative Research in Critical Care (MCIRCC), Ann Arbor, MI 48109, USA
| | - Raghuram Thiagarajan
- Department of Molecular and Integrative Physiology, University of Michigan, Ann Arbor, MI 48109, USA
| | - S. M. Reza Soroushmehr
- Emergency Medicine Department, University of Michigan, Ann Arbor, MI 48109, USA
- University of Michigan Center for Integrative Research in Critical Care (MCIRCC), Ann Arbor, MI 48109, USA
| | - Fatemeh Navidi
- Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, MI 48109, USA
| | - Daniel A. Beard
- University of Michigan Center for Integrative Research in Critical Care (MCIRCC), Ann Arbor, MI 48109, USA
- Department of Molecular and Integrative Physiology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Kayvan Najarian
- Emergency Medicine Department, University of Michigan, Ann Arbor, MI 48109, USA
- University of Michigan Center for Integrative Research in Critical Care (MCIRCC), Ann Arbor, MI 48109, USA
| |
Collapse
|