1
|
Grady SK, Peterson KA, Murray SA, Baker EJ, Langston MA, Chesler EJ. A graph theoretical approach to experimental prioritization in genome-scale investigations. Mamm Genome 2024; 35:724-733. [PMID: 39191873 PMCID: PMC11522061 DOI: 10.1007/s00335-024-10066-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 08/14/2024] [Indexed: 08/29/2024]
Abstract
The goal of systems biology is to gain a network level understanding of how gene interactions influence biological states, and ultimately inform upon human disease. Given the scale and scope of systems biology studies, resource constraints often limit researchers when validating genome-wide phenomena and potentially lead to an incomplete understanding of the underlying mechanisms. Further, prioritization strategies are often biased towards known entities (e.g. previously studied genes/proteins with commercially available reagents), and other technical issues that limit experimental breadth. Here, heterogeneous biological information is modeled as an association graph to which a high-performance minimum dominating set solver is applied to maximize coverage across the graph, and thus increase the breadth of experimentation. First, we tested our model on retrieval of existing gene functional annotations and demonstrated that minimum dominating set returns more diverse terms when compared to other computational methods. Next, we utilized our heterogenous network and minimum dominating set solver to assist in the process of identifying understudied genes to be interrogated by the International Mouse Phenotyping Consortium. Using an unbiased algorithmic strategy, poorly studied genes are prioritized from the remaining thousands of genes yet to be characterized. This method is tunable and extensible with the potential to incorporate additional user-defined prioritizing information. The minimum dominating set approach can be applied to any biological network in order to identify a tractable subset of features to test experimentally or to assist in prioritizing candidate genes associated with human disease.
Collapse
Affiliation(s)
- Stephen K Grady
- Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN, USA.
| | | | | | - Erich J Baker
- Department of Computer Science, Baylor University, Waco, TX, USA
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, USA
| | | |
Collapse
|
2
|
Siminea N, Czeizler E, Popescu VB, Petre I, Păun A. Connecting the dots: Computational network analysis for disease insight and drug repurposing. Curr Opin Struct Biol 2024; 88:102881. [PMID: 38991238 DOI: 10.1016/j.sbi.2024.102881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 05/22/2024] [Accepted: 06/19/2024] [Indexed: 07/13/2024]
Abstract
Network biology is a powerful framework for studying the structure, function, and dynamics of biological systems, offering insights into the balance between health and disease states. The field is seeing rapid progress in all of its aspects: data availability, network synthesis, network analytics, and impactful applications in medicine and drug development. We review the most recent and significant results in network biomedicine, with a focus on the latest data, analytics, software resources, and applications in medicine. We also discuss what in our view are the likely directions of impactful development over the next few years.
Collapse
Affiliation(s)
- Nicoleta Siminea
- Faculty of Mathematics and Computer Science, University of Bucharest, Romania; National Institute of Research and Development for Biological Sciences, Romania
| | - Eugen Czeizler
- Faculty of Medicine, University of Helsinki, Finland; National Institute of Research and Development for Biological Sciences, Romania
| | | | - Ion Petre
- Department of Mathematics and Statistics, University of Turku, Finland; National Institute of Research and Development for Biological Sciences, Romania.
| | - Andrei Păun
- Faculty of Mathematics and Computer Science, University of Bucharest, Romania; National Institute of Research and Development for Biological Sciences, Romania.
| |
Collapse
|
3
|
Pan C, Zhang Q, Zhu Y, Kong S, Liu J, Zhang C, Wang F, Zhang X. Module control of network analysis in psychopathology. iScience 2024; 27:110302. [PMID: 39045106 PMCID: PMC11263636 DOI: 10.1016/j.isci.2024.110302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 04/12/2024] [Accepted: 06/14/2024] [Indexed: 07/25/2024] Open
Abstract
The network approach to characterizing psychopathology departs from traditional latent categorical and dimensional approaches. Causal interplay among symptoms contributed to dynamic psychopathology system. Therefore, analyzing the symptom clusters is critical for understanding mental disorders. Furthermore, despite extensive research studying the topological features of symptom networks, the control relationships between symptoms remain largely unclear. Here, we present a novel systematizing concept, module control, to analyze the control principle of the symptom network at a module level. We introduce Module Control Network (MCN) to identify key modules that regulate the network's behavior. By applying our approach to a multivariate psychological dataset, we discover that non-emotional modules, such as sleep-related and stress-related modules, are the primary controlling modules in the symptom network. Our findings indicate that module control can expose central symptom cluster governing psychopathology network, offering novel insights into the underlying mechanisms of mental disorders and individualized approach to psychological interventions.
Collapse
Affiliation(s)
- Chunyu Pan
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, Jiangsu 210024, China
- Northeastern University, Shenyang, Liaoning 110169, China
| | - Quan Zhang
- Vanke School of Public Health, Tsinghua University, Beijing 100084, China
- Institute for Healthy China, Tsinghua University, Beijing 100084, China
| | - Yue Zhu
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, Jiangsu 210024, China
- Functional Brain Imaging Institute of Nanjing Medical University, Nanjing, Jiangsu 210024, China
| | - Shengzhou Kong
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, Jiangsu 210024, China
| | - Juan Liu
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, Jiangsu 210024, China
- Functional Brain Imaging Institute of Nanjing Medical University, Nanjing, Jiangsu 210024, China
| | | | - Fei Wang
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, Jiangsu 210024, China
- Functional Brain Imaging Institute of Nanjing Medical University, Nanjing, Jiangsu 210024, China
- Department of Mental Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Xizhe Zhang
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu 210033, China
| |
Collapse
|
4
|
Grady SK, Dojcsak L, Harville EW, Wallace ME, Vilda D, Donneyong MM, Hood DB, Valdez RB, Ramesh A, Im W, Matthews-Juarez P, Juarez PD, Langston MA. Seminar: Scalable Preprocessing Tools for Exposomic Data Analysis. ENVIRONMENTAL HEALTH PERSPECTIVES 2023; 131:124201. [PMID: 38109119 PMCID: PMC10727037 DOI: 10.1289/ehp12901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 11/22/2023] [Accepted: 11/28/2023] [Indexed: 12/19/2023]
Abstract
BACKGROUND The exposome serves as a popular framework in which to study exposures from chemical and nonchemical stressors across the life course and the differing roles that these exposures can play in human health. As a result, data relevant to the exposome have been used as a resource in the quest to untangle complicated health trajectories and help connect the dots from exposures to adverse outcome pathways. OBJECTIVES The primary aim of this methods seminar is to clarify and review preprocessing techniques critical for accurate and effective external exposomic data analysis. Scalability is emphasized through an application of highly innovative combinatorial techniques coupled with more traditional statistical strategies. The Public Health Exposome is used as an archetypical model. The novelty and innovation of this seminar's focus stem from its methodical, comprehensive treatment of preprocessing and its demonstration of the positive effects preprocessing can have on downstream analytics. DISCUSSION State-of-the-art technologies are described for data harmonization and to mitigate noise, which can stymie downstream interpretation, and to select key exposomic features, without which analytics may lose focus. A main task is the reduction of multicollinearity, a particularly formidable problem that frequently arises from repeated measurements of similar events taken at various times and from multiple sources. Empirical results highlight the effectiveness of a carefully planned preprocessing workflow as demonstrated in the context of more highly concentrated variable lists, improved correlational distributions, and enhanced downstream analytics for latent relationship discovery. The nascent field of exposome science can be characterized by the need to analyze and interpret a complex confluence of highly inhomogeneous spatial and temporal data, which may present formidable challenges to even the most powerful analytical tools. A systematic approach to preprocessing can therefore provide an essential first step in the application of modern computer and data science methods. https://doi.org/10.1289/EHP12901.
Collapse
Affiliation(s)
- Stephen K. Grady
- Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, Tennessee, USA
| | - Levente Dojcsak
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA
| | - Emily W. Harville
- Department Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Maeve E. Wallace
- Department of Social, Behavioral, and Population Sciences, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Dovile Vilda
- Department of Social, Behavioral, and Population Sciences, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | | | - Darryl B. Hood
- Division of Environmental Health Sciences, College of Public Health, Ohio State University, Columbus, Ohio, USA
| | - R. Burciaga Valdez
- Department of Economics, University of New Mexico, Albuquerque, New Mexico, USA
| | - Aramandla Ramesh
- Department of Biochemistry, Cancer Biology, Neuroscience & Pharmacology, Meharry Medical College, Nashville, Tennessee, USA
| | - Wansoo Im
- Department of Family and Community Medicine, Meharry Medical College, Nashville, Tennessee, USA
| | | | - Paul D. Juarez
- Department of Family and Community Medicine, Meharry Medical College, Nashville, Tennessee, USA
- Institute on Health Disparities, Equity, and the Exposome, Meharry Medical College, Nashville, Tennessee, USA
| | - Michael A. Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA
| |
Collapse
|