1
|
Jonjaroen V, Jitrakorn S, Charoonnart P, Kaewsaengon P, Thinkohkaew K, Payongsri P, Surarit R, Saksmerprome V, Niamsiri N. Optimizing chitosan nanoparticles for oral delivery of double-stranded RNA in treating white spot disease in shrimp: Key insights and practical implications. Int J Biol Macromol 2025; 290:138970. [PMID: 39706429 DOI: 10.1016/j.ijbiomac.2024.138970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Revised: 12/11/2024] [Accepted: 12/17/2024] [Indexed: 12/23/2024]
Abstract
Delivering double-stranded RNA (dsRNA) in shrimp is challenging due to the lack of an effective carrier system. This study optimized chitosan nanoparticles (CNs) from two sources-α-chitosan from shrimp and β-chitosan from squid-to encapsulate antiviral dsRNA for oral administration via shrimp feed. Using response surface methodology (RSM), formulations were refined for encapsulation efficiency, particle size, polydispersity index, and zeta potential. Both types of CNs demonstrated high encapsulation efficiency (>95 %), small sizes (<300 nm), and stable zeta potential (>20 mV). Shrimp-derived CNs provided superior RNase protection and controlled release, while squid-derived CNs showed a burst release. Incorporated into feed, both types of CNs remained stable for over a month. Shrimp-derived CNs offered greater dsRNA protection (>70 %) and improved efficacy against white spot syndrome virus (WSSV), significantly reducing mortality. These results position shrimp-derived CNs as promising dsRNA carriers for combating WSSV in shrimp aquaculture.
Collapse
Affiliation(s)
- Veasarach Jonjaroen
- Department of Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand.
| | - Sarocha Jitrakorn
- Center of Excellence for Shrimp Molecular Biology and Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand; National Center of Genetic Engineering and Biotechnology, (BIOTEC), Thailand Science Park, Pathum Thani 12120, Thailand.
| | - Patai Charoonnart
- Center of Excellence for Shrimp Molecular Biology and Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand; National Center of Genetic Engineering and Biotechnology, (BIOTEC), Thailand Science Park, Pathum Thani 12120, Thailand.
| | - Parichart Kaewsaengon
- Center of Excellence for Shrimp Molecular Biology and Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand; National Center of Genetic Engineering and Biotechnology, (BIOTEC), Thailand Science Park, Pathum Thani 12120, Thailand.
| | - Korlid Thinkohkaew
- Department of Material Science, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand.
| | - Panwajee Payongsri
- Department of Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand.
| | - Rudee Surarit
- Department of Oral Biology, Faculty of Dentistry, Mahidol University, Bangkok 10400, Thailand; Faculty of Dentistry, Siam University, Bangkok 10160, Thailand.
| | - Vanvimon Saksmerprome
- Center of Excellence for Shrimp Molecular Biology and Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand; National Center of Genetic Engineering and Biotechnology, (BIOTEC), Thailand Science Park, Pathum Thani 12120, Thailand.
| | - Nuttawee Niamsiri
- Department of Biotechnology, Faculty of Science, Mahidol University, Bangkok 10400, Thailand.
| |
Collapse
|
2
|
Liang J, Tian J, Zhang H, Li H, Chen L. Proteomics: An In-Depth Review on Recent Technical Advances and Their Applications in Biomedicine. Med Res Rev 2025. [PMID: 39789883 DOI: 10.1002/med.22098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 10/11/2024] [Accepted: 12/12/2024] [Indexed: 01/12/2025]
Abstract
Proteins hold pivotal importance since many diseases manifest changes in protein activity. Proteomics techniques provide a comprehensive exploration of protein structure, abundance, and function in biological samples, enabling the holistic characterization of overall changes in organisms. Nowadays, the breadth of emerging methodologies in proteomics is unprecedentedly vast, with constant optimization of technologies in sample processing, data collection, data analysis, and its scope of application is steadily transitioning from the bench to the clinic. Here, we offer an insightful review of the technical developments in proteomics and its applications in biomedicine over the past 5 years. We focus on its profound contributions in profiling disease spectra, discovering new biomarkers, identifying promising drug targets, deciphering alterations in protein conformation, and unearthing protein-protein interactions. Moreover, we summarize the cutting-edge technologies and potential breakthroughs in the proteomics pipeline and provide the principal challenges in proteomics. Based on these, we aspire to broaden the applicability of proteomics and inspire researchers to enhance our understanding of complex biological systems by utilizing such techniques.
Collapse
Affiliation(s)
- Jing Liang
- Wuya College of Innovation, Key Laboratory of Structure-Based Drug Design & Discovery, Ministry of Education, Shenyang Pharmaceutical University, Shenyang, China
| | - Jundan Tian
- Wuya College of Innovation, Key Laboratory of Structure-Based Drug Design & Discovery, Ministry of Education, Shenyang Pharmaceutical University, Shenyang, China
| | - Huadong Zhang
- College of Pharmacy, Institute of Structural Pharmacology & TCM Chemical Biology, Fujian Key Laboratory of Chinese Materia Medica, Fujian University of Traditional Chinese Medicine, Fuzhou, China
| | - Hua Li
- Wuya College of Innovation, Key Laboratory of Structure-Based Drug Design & Discovery, Ministry of Education, Shenyang Pharmaceutical University, Shenyang, China
- College of Pharmacy, Institute of Structural Pharmacology & TCM Chemical Biology, Fujian Key Laboratory of Chinese Materia Medica, Fujian University of Traditional Chinese Medicine, Fuzhou, China
| | - Lixia Chen
- Wuya College of Innovation, Key Laboratory of Structure-Based Drug Design & Discovery, Ministry of Education, Shenyang Pharmaceutical University, Shenyang, China
| |
Collapse
|
3
|
Schumann Y, Gocke A, Neumann JE. Computational Methods for Data Integration and Imputation of Missing Values in Omics Datasets. Proteomics 2025; 25:e202400100. [PMID: 39740174 DOI: 10.1002/pmic.202400100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 11/08/2024] [Accepted: 11/26/2024] [Indexed: 01/02/2025]
Abstract
Molecular profiling of different omic-modalities (e.g., DNA methylomics, transcriptomics, proteomics) in biological systems represents the basis for research and clinical decision-making. Measurement-specific biases, so-called batch effects, often hinder the integration of independently acquired datasets, and missing values further hamper the applicability of typical data processing algorithms. In addition to careful experimental design, well-defined standards in data acquisition and data exchange, the alleviation of these phenomena particularly requires a dedicated data integration and preprocessing pipeline. This review aims to give a comprehensive overview of computational methods for data integration and missing value imputation for omic data analyses. We provide formal definitions for missing value mechanisms and propose a novel statistical taxonomy for batch effects, especially in the presence of missing data. Based on an automated document search and systematic literature review, we describe 32 distinct data integration methods from five main methodological categories, as well as 37 algorithms for missing value imputation from five separate categories. Additionally, this review highlights multiple quantitative evaluation methods to aid researchers in selecting a suitable set of methods for their work. Finally, this work provides an integrated discussion of the relevance of batch effects and missing values in omics with corresponding method recommendations. We then propose a comprehensive three-step workflow from the study conception to final data analysis and deduce perspectives for future research. Eventually, we present a comprehensive flow chart as well as exemplary decision trees to aid practitioners in the selection of specific approaches for imputation and data integration in their studies.
Collapse
Affiliation(s)
- Yannis Schumann
- IT-Department, Deutsches Elektronen-Synchroton DESY, Hamburg, Germany
| | - Antonia Gocke
- Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
- Core Facility Mass Spectrometric Proteomics, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Julia E Neumann
- Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
- Institute of Neuropathology, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| |
Collapse
|
4
|
Heylen D, Pusparum M, Kuliesius J, Wilson J, Park YC, Jamiołkowski J, D’Onofrio V, Valkenborg D, Aerts J, Ertaylan G, Hooyberghs J. Synthetic plasma pool cohort correction for affinity-based proteomics datasets allows multiple study comparison. Brief Bioinform 2024; 26:bbae657. [PMID: 39694815 PMCID: PMC11653412 DOI: 10.1093/bib/bbae657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 11/05/2024] [Accepted: 12/03/2024] [Indexed: 12/20/2024] Open
Abstract
Proteomics stands as the crucial link between genomics and human diseases. Quantitative proteomics provides detailed insights into protein levels, enabling differentiation between distinct phenotypes. OLINK, a biotechnology company from Uppsala, Sweden, offers a targeted, affinity-based protein measurement method called Target 96, which has become prominent in the field of proteomics. The SCALLOP consortium, for instance, contains data from over 70.000 individuals across 45 independent cohort studies, all sampled by OLINK. However, when independent cohorts want to collaborate and quantitatively compare their target 96 protein values, it is currently advised to include 'identical biological bridging' samples in each sampling run to perform a reference sample normalization, correcting technical variations across measurements. Such a 'biological bridging sample' approach requires each of the involved cohorts to resend their biological bridging samples to OLINK to run them all together, which is logistically challenging, costly and time-consuming. Hence alternatives are searched and an evaluation of the current state of the art exposes the need for a more robust method that allows all OLINK Target 96 studies to compare proteomics data accurately and cost-efficiently. To meet these goals we developed the Synthetic Plasma Pool Cohort Correction, the 'SPOC correction' approach, based on the use of an OLINK-composed synthetic plasma sample. The method can easily be implemented in a federated data-sharing context which is illustrated on a sepsis use case.
Collapse
Affiliation(s)
- Dries Heylen
- Data Science Institute, Theory Lab, Hasselt University, 3590 Diepenbeek, Belgium
- Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Murih Pusparum
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Hasselt University, Data Science Institute, 3590 Diepenbeek, Belgium
| | - Jurgis Kuliesius
- Centre for Global Health Research, University of Edinburgh, Edinburgh BioQuarter, Edinburgh EH16 4UX, United Kingdom
| | - Jim Wilson
- Centre for Global Health Research, University of Edinburgh, Edinburgh BioQuarter, Edinburgh EH16 4UX, United Kingdom
- MRC Human Genetics Unit, University of Edinburgh, Western General Hospital, Edinburgh, EH4 2XU, United Kingdom
| | - Young-Chan Park
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Jacek Jamiołkowski
- Department of Population Medicine and Lifestyle Diseases Prevention, Medical University of Bialystok, 15-089 Białystok, Poland
| | - Valentino D’Onofrio
- Center for Vaccinology, Ghent University and Ghent University Hospital, 9000 Ghent, Belgium
| | - Dirk Valkenborg
- Hasselt University, Data Science Institute, 3590 Diepenbeek, Belgium
| | - Jan Aerts
- Augmented Intelligence for Data Analytics (AIDA) Lab Department of Biosystems KU Leuven, Leuven, Belgium
| | - Gökhan Ertaylan
- Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Jef Hooyberghs
- Data Science Institute, Theory Lab, Hasselt University, 3590 Diepenbeek, Belgium
| |
Collapse
|
5
|
Yu Y, Mai Y, Zheng Y, Shi L. Assessing and mitigating batch effects in large-scale omics studies. Genome Biol 2024; 25:254. [PMID: 39363244 PMCID: PMC11447944 DOI: 10.1186/s13059-024-03401-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/23/2024] [Indexed: 10/05/2024] Open
Abstract
Batch effects in omics data are notoriously common technical variations unrelated to study objectives, and may result in misleading outcomes if uncorrected, or hinder biomedical discovery if over-corrected. Assessing and mitigating batch effects is crucial for ensuring the reliability and reproducibility of omics data and minimizing the impact of technical variations on biological interpretation. In this review, we highlight the profound negative impact of batch effects and the urgent need to address this challenging problem in large-scale omics studies. We summarize potential sources of batch effects, current progress in evaluating and correcting them, and consortium efforts aiming to tackle them.
Collapse
Affiliation(s)
- Ying Yu
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China.
| | - Yuanbang Mai
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Yuanting Zheng
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China.
| | - Leming Shi
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China.
- Cancer Institute, Shanghai Cancer Center, Fudan University, Shanghai, China.
- International Human Phenome Institutes (Shanghai), Shanghai, China.
| |
Collapse
|
6
|
Karpov OA, Stotland A, Raedschelders K, Chazarin B, Ai L, Murray CI, Van Eyk JE. Proteomics of the heart. Physiol Rev 2024; 104:931-982. [PMID: 38300522 PMCID: PMC11381016 DOI: 10.1152/physrev.00026.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 12/25/2023] [Accepted: 01/14/2024] [Indexed: 02/02/2024] Open
Abstract
Mass spectrometry-based proteomics is a sophisticated identification tool specializing in portraying protein dynamics at a molecular level. Proteomics provides biologists with a snapshot of context-dependent protein and proteoform expression, structural conformations, dynamic turnover, and protein-protein interactions. Cardiac proteomics can offer a broader and deeper understanding of the molecular mechanisms that underscore cardiovascular disease, and it is foundational to the development of future therapeutic interventions. This review encapsulates the evolution, current technologies, and future perspectives of proteomic-based mass spectrometry as it applies to the study of the heart. Key technological advancements have allowed researchers to study proteomes at a single-cell level and employ robot-assisted automation systems for enhanced sample preparation techniques, and the increase in fidelity of the mass spectrometers has allowed for the unambiguous identification of numerous dynamic posttranslational modifications. Animal models of cardiovascular disease, ranging from early animal experiments to current sophisticated models of heart failure with preserved ejection fraction, have provided the tools to study a challenging organ in the laboratory. Further technological development will pave the way for the implementation of proteomics even closer within the clinical setting, allowing not only scientists but also patients to benefit from an understanding of protein interplay as it relates to cardiac disease physiology.
Collapse
Affiliation(s)
- Oleg A Karpov
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Aleksandr Stotland
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Koen Raedschelders
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Blandine Chazarin
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Lizhuo Ai
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Christopher I Murray
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Jennifer E Van Eyk
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| |
Collapse
|
7
|
Lin A, Torres CM, Hobbs EC, Bardhan J, Aley SB, Spencer CT, Taylor KL, Chiang T. Computational and Systems Biology Advances to Enable Bioagent Agnostic Signatures. Health Secur 2024; 22:130-139. [PMID: 38483337 PMCID: PMC11044874 DOI: 10.1089/hs.2023.0076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2024] Open
Affiliation(s)
- Andy Lin
- Andy Lin, PhD, is a Linus Pauling Distinguished Postdoctoral Fellow; in the National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA
| | - Cameron M. Torres
- Cameron M. Torres is a Graduate Research Assistant and Wieland Fellow, Department of Biological Sciences; at the University of Texas at El Paso, El Paso, TX
| | - Errett C. Hobbs
- Errett C. Hobbs, PhD, is a Data Scientist; in the National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA
| | - Jaydeep Bardhan
- Jaydeep Bardhan, PhD, is a Research Line Manager, Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA
| | - Stephen B. Aley
- Stephen B. Aley, PhD, is a Professor, Biological Sciences, and an Associate Vice President for Research, Sponsored Projects; at the University of Texas at El Paso, El Paso, TX
| | - Charles T. Spencer
- Charles T. Spencer, PhD, is an Associate Professor, Biological Sciences, and Edward and Barbara Brown Egbert Endowed Chair of the Department of Biological Sciences; at the University of Texas at El Paso, El Paso, TX
| | - Karen L. Taylor
- Karen L. Taylor, MS, is a Research Line Manager; in the National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA
| | - Tony Chiang
- Tony Chiang, PhD, is a Data Scientist; in the National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA
| |
Collapse
|
8
|
Lin A, Torres C, Hobbs EC, Bardhan J, Aley S, Spencer CT, Taylor KL, Chiang T. Computational and Systems Biology Advances to Enable Bioagent Agnostic Signatures. ARXIV 2024:arXiv:2310.13898v3. [PMID: 37961741 PMCID: PMC10635321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Enumerated threat agent lists have long driven biodefense priorities. The global SARS-CoV-2 pandemic demonstrated the limitations of searching for known threat agents as compared to a more agnostic approach. Recent technological advances are enabling agent-agnostic biodefense, especially through the integration of multi-modal observations of host-pathogen interactions directed by a human immunological model. Although well-developed technical assays exist for many aspects of human-pathogen interaction, the analytic methods and pipelines to combine and holistically interpret the results of such assays are immature and require further investments to exploit new technologies. In this manuscript, we discuss potential immunologically based bioagent-agnostic approaches and the computational tool gaps the community should prioritize filling.
Collapse
Affiliation(s)
- Andy Lin
- National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA 98109, USA
| | - Cameron Torres
- Department of Biological Sciences, University of Texas at El Paso, El Paso, Texas 79968 USA
| | - Errett C Hobbs
- National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA 98109, USA
| | - Jaydeep Bardhan
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Seattle, WA 98109, USA
| | - Stephen Aley
- Department of Biological Sciences, University of Texas at El Paso, El Paso, Texas 79968 USA
| | - Charles T Spencer
- Department of Biological Sciences, University of Texas at El Paso, El Paso, Texas 79968 USA
| | - Karen L Taylor
- National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA 98109, USA
| | - Tony Chiang
- National Security Directorate, Pacific Northwest National Laboratory, Seattle, WA 98109, USA
- Department of Biological Sciences, University of Texas at El Paso, El Paso, Texas 79968 USA
- Department of Mathematics, University of Washington, Seattle 98102 USA
| |
Collapse
|
9
|
Low S, Zheng H, Liu JJ, Moh A, Ang K, Tang WE, Lim Z, Subramaniam T, Sum CF, Lim SC. Longitudinal profiling and tracking stability in the Singapore study of macro-angiopathy and microvascular reactivity in type 2 diabetes cohort. Diab Vasc Dis Res 2023; 20:14791641231218453. [PMID: 38059349 DOI: 10.1177/14791641231218453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/08/2023] Open
Abstract
INTRODUCTION The Singapore Study of Macro-Angiopathy and microvascular Reactivity in Type 2 Diabetes (SMART2D) is a prospective cohort study which was started in 2011 to investigate the effect of risk factors on vascular function and diabetes-related complications in Asians. We aimed to compare the longitudinal change in risk factors by accounting for batch effect and assess the tracking stability of risk factors over time in patients recruited for SMART2D. In this study, we (1) described batch effect and its extent across a heterogenous range of longitudinal data parameters; (2) mitigated batch effect through statistical approach; and (3) assessed the tracking stability of the risk factors over time. METHODS A total of 2258 patients with type 2 diabetes mellitus (T2DM) were recruited at baseline. The study adopted a three-wave longitudinal design with intervals of 3 years between consecutive waves. The changes in a few selected risk factors were assessed after calibration, assuming patients with similar demographic and anthropometry profile had similar physiology. The tracking pattern of the risk factors was determined with stability coefficients derived from generalised estimating equations. RESULTS The medians of the longitudinal differences in risk factors between the waves were mostly modest at <10%. Larger increases in augmentation index (AI), aortic systolic blood pressure (BP) and aortic mean BP were consistently observed after calibration. The medians of the longitudinal differences in AI, aortic systolic BP and aortic mean BP between the waves were <2% before calibration, but increased slightly to <5% after calibration. Most of the risk factors had moderate to high tracking stability. Muscle mass and serum creatinine were among those with relatively high tracking stability. CONCLUSIONS The longitudinal differences in parameters between the waves were overall modest after calibration, suggesting that calibration may attenuate longitudinal differences inflated by non-biological factors such as systematic drift due to batch effect. Changes of the hemodynamic parameters are robust over time and not entirely attributable to age. Our study also demonstrated moderate to high tracking stability for most of the parameters.
Collapse
Affiliation(s)
- Serena Low
- Diabetes Centre, Admiralty Medical Centre, Singapore
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Huili Zheng
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
| | - Jian-Jun Liu
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
| | - Angela Moh
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
| | - Keven Ang
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
| | - Wern Ee Tang
- National Healthcare Group Polyclinics, Singapore
| | - Ziliang Lim
- National Healthcare Group Polyclinics, Singapore
| | | | - Chee Fang Sum
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
| | - Su Chi Lim
- Diabetes Centre, Admiralty Medical Centre, Singapore
- Clinical Research Unit, Khoo Teck Puat Hospital, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore
| |
Collapse
|
10
|
Yu Y, Zhang N, Mai Y, Ren L, Chen Q, Cao Z, Chen Q, Liu Y, Hou W, Yang J, Hong H, Xu J, Tong W, Dong L, Shi L, Fang X, Zheng Y. Correcting batch effects in large-scale multiomics studies using a reference-material-based ratio method. Genome Biol 2023; 24:201. [PMID: 37674217 PMCID: PMC10483871 DOI: 10.1186/s13059-023-03047-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 05/18/2023] [Indexed: 09/08/2023] Open
Abstract
BACKGROUND Batch effects are notoriously common technical variations in multiomics data and may result in misleading outcomes if uncorrected or over-corrected. A plethora of batch-effect correction algorithms are proposed to facilitate data integration. However, their respective advantages and limitations are not adequately assessed in terms of omics types, the performance metrics, and the application scenarios. RESULTS As part of the Quartet Project for quality control and data integration of multiomics profiling, we comprehensively assess the performance of seven batch effect correction algorithms based on different performance metrics of clinical relevance, i.e., the accuracy of identifying differentially expressed features, the robustness of predictive models, and the ability of accurately clustering cross-batch samples into their own donors. The ratio-based method, i.e., by scaling absolute feature values of study samples relative to those of concurrently profiled reference material(s), is found to be much more effective and broadly applicable than others, especially when batch effects are completely confounded with biological factors of study interests. We further provide practical guidelines for implementing the ratio based approach in increasingly large-scale multiomics studies. CONCLUSIONS Multiomics measurements are prone to batch effects, which can be effectively corrected using ratio-based scaling of the multiomics data. Our study lays the foundation for eliminating batch effects at a ratio scale.
Collapse
Affiliation(s)
- Ying Yu
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Naixin Zhang
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Yuanbang Mai
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Luyao Ren
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Qiaochu Chen
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Zehui Cao
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Qingwang Chen
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Yaqing Liu
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Wanwan Hou
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Jingcheng Yang
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China
- Greater Bay Area Institute of Precision Medicine, Guangzhou, Guangdong, China
| | - Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Joshua Xu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | | | - Leming Shi
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China.
- International Human Phenome Institutes, Shanghai, China.
| | - Xiang Fang
- National Institute of Metrology, Beijing, China.
| | - Yuanting Zheng
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Shanghai Cancer Center, Fudan University, Shanghai, China.
| |
Collapse
|
11
|
Flores JE, Claborne DM, Weller ZD, Webb-Robertson BJM, Waters KM, Bramer LM. Missing data in multi-omics integration: Recent advances through artificial intelligence. Front Artif Intell 2023; 6:1098308. [PMID: 36844425 PMCID: PMC9949722 DOI: 10.3389/frai.2023.1098308] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 01/23/2023] [Indexed: 02/11/2023] Open
Abstract
Biological systems function through complex interactions between various 'omics (biomolecules), and a more complete understanding of these systems is only possible through an integrated, multi-omic perspective. This has presented the need for the development of integration approaches that are able to capture the complex, often non-linear, interactions that define these biological systems and are adapted to the challenges of combining the heterogenous data across 'omic views. A principal challenge to multi-omic integration is missing data because all biomolecules are not measured in all samples. Due to either cost, instrument sensitivity, or other experimental factors, data for a biological sample may be missing for one or more 'omic techologies. Recent methodological developments in artificial intelligence and statistical learning have greatly facilitated the analyses of multi-omics data, however many of these techniques assume access to completely observed data. A subset of these methods incorporate mechanisms for handling partially observed samples, and these methods are the focus of this review. We describe recently developed approaches, noting their primary use cases and highlighting each method's approach to handling missing data. We additionally provide an overview of the more traditional missing data workflows and their limitations; and we discuss potential avenues for further developments as well as how the missing data issue and its current solutions may generalize beyond the multi-omics context.
Collapse
Affiliation(s)
- Javier E. Flores
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| | - Daniel M. Claborne
- Pacific Northwest National Laboratory, Artificial Intelligence and Data Analytics Division, National Security Directorate, Richland, WA, United States
| | - Zachary D. Weller
- Pacific Northwest National Laboratory, Artificial Intelligence and Data Analytics Division, National Security Directorate, Richland, WA, United States
| | - Bobbie-Jo M. Webb-Robertson
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| | - Katrina M. Waters
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| | - Lisa M. Bramer
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| |
Collapse
|