1
|
Taiwo OR, Onyeaka H, Oladipo EK, Oloke JK, Chukwugozie DC. Advancements in Predictive Microbiology: Integrating New Technologies for Efficient Food Safety Models. Int J Microbiol 2024; 2024:6612162. [PMID: 38799770 PMCID: PMC11126350 DOI: 10.1155/2024/6612162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 04/01/2024] [Accepted: 04/23/2024] [Indexed: 05/29/2024] Open
Abstract
Predictive microbiology is a rapidly evolving field that has gained significant interest over the years due to its diverse application in food safety. Predictive models are widely used in food microbiology to estimate the growth of microorganisms in food products. These models represent the dynamic interactions between intrinsic and extrinsic food factors as mathematical equations and then apply these data to predict shelf life, spoilage, and microbial risk assessment. Due to their ability to predict the microbial risk, these tools are also integrated into hazard analysis critical control point (HACCP) protocols. However, like most new technologies, several limitations have been linked to their use. Predictive models have been found incapable of modeling the intricate microbial interactions in food colonized by different bacteria populations under dynamic environmental conditions. To address this issue, researchers are integrating several new technologies into predictive models to improve efficiency and accuracy. Increasingly, newer technologies such as whole genome sequencing (WGS), metagenomics, artificial intelligence, and machine learning are being rapidly adopted into newer-generation models. This has facilitated the development of devices based on robotics, the Internet of Things, and time-temperature indicators that are being incorporated into food processing both domestically and industrially globally. This study reviewed current research on predictive models, limitations, challenges, and newer technologies being integrated into developing more efficient models. Machine learning algorithms commonly employed in predictive modeling are discussed with emphasis on their application in research and industry and their advantages over traditional models.
Collapse
Affiliation(s)
| | - Helen Onyeaka
- School of Chemical Engineering, University of Birmingham, Edgbaston B15 2TT, Birmingham, UK
| | - Elijah K. Oladipo
- Genomics Unit, Helix Biogen Institute, Ogbomosho, Oyo, Nigeria
- Department of Microbiology, Laboratory of Molecular Biology, Immunology and Bioinformatics, Adeleke University, Ede, Osun, Nigeria
| | - Julius Kola Oloke
- Department of Natural Science, Microbiology Unit, Precious Cornerstone University, Ibadan, Oyo, Nigeria
| | | |
Collapse
|
2
|
Zhao C, Su KJ, Wu C, Cao X, Sha Q, Li W, Luo Z, Qin T, Qiu C, Zhao LJ, Liu A, Jiang L, Zhang X, Shen H, Zhou W, Deng HW. Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics. ARXIV 2024:arXiv:2310.07990v2. [PMID: 37873011 PMCID: PMC10593076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
BACKGROUND Missing data is a common challenge in mass spectrometry-based metabolomics, which can lead to biased and incomplete analyses. The integration of whole-genome sequencing (WGS) data with metabolomics data has emerged as a promising approach to enhance the accuracy of data imputation in metabolomics studies. METHOD In this study, we propose a novel method that leverages the information from WGS data and reference metabolites to impute unknown metabolites. Our approach utilizes a multi-view variational autoencoder to jointly model the burden score, polygenetic risk score (PGS), and linkage disequilibrium (LD) pruned single nucleotide polymorphisms (SNPs) for feature extraction and missing metabolomics data imputation. By learning the latent representations of both omics data, our method can effectively impute missing metabolomics values based on genomic information. RESULTS We evaluate the performance of our method on empirical metabolomics datasets with missing values and demonstrate its superiority compared to conventional imputation techniques. Using 35 template metabolites derived burden scores, PGS and LD-pruned SNPs, the proposed methods achieved R^2-scores > 0.01 for 71.55% of metabolites. CONCLUSION The integration of WGS data in metabolomics imputation not only improves data completeness but also enhances downstream analyses, paving the way for more comprehensive and accurate investigations of metabolic pathways and disease associations. Our findings offer valuable insights into the potential benefits of utilizing WGS data for metabolomics data imputation and underscore the importance of leveraging multi-modal data integration in precision medicine research.
Collapse
|
3
|
Wang H, Bennett DA, De Jager PL, Zhang QY, Zhang HY. Genome-wide epistasis analysis for Alzheimer's disease and implications for genetic risk prediction. Alzheimers Res Ther 2021; 13:55. [PMID: 33663605 PMCID: PMC7934265 DOI: 10.1186/s13195-021-00794-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 02/22/2021] [Indexed: 12/24/2022]
Abstract
BACKGROUND Single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies only explain part of the heritability of Alzheimer's disease (AD). Epistasis has been considered as one of the main causes of "missing heritability" in AD. METHODS We performed genome-wide epistasis screening (N = 10,389) for the clinical diagnosis of AD using three popularly adopted methods. Subsequent analyses were performed to eliminate spurious associations caused by possible confounding factors. Then, candidate genetic interactions were examined for their co-expression in the brains of AD patients and analyzed for their association with intermediate AD phenotypes. Moreover, a new approach was developed to compile the epistasis risk factors into an epistasis risk score (ERS) based on multifactor dimensional reduction. Two independent datasets were used to evaluate the feasibility of ERSs in AD risk prediction. RESULTS We identified 2 candidate genetic interactions with PFDR < 0.05 (RAMP3-SEMA3A and NSMCE1-DGKE/C17orf67) and another 5 genetic interactions with PFDR < 0.1. Co-expression between the identified interactions supported the existence of possible biological interactions underlying the observed statistical significance. Further association of candidate interactions with intermediate phenotypes helps explain the mechanisms of neuropathological alterations involved in AD. Importantly, we found that ERSs can identify high-risk individuals showing earlier onset of AD. Combined risk scores of SNPs and SNP-SNP interactions showed slightly but steadily increased AUC in predicting the clinical status of AD. CONCLUSIONS In summary, we performed a genome-wide epistasis analysis to identify novel genetic interactions potentially implicated in AD. We found that ERS can serve as an indicator of the genetic risk of AD.
Collapse
Affiliation(s)
- Hui Wang
- Huazhong Agricultural University, College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, No.1 Shizishan Street, Hongshan District, Wuhan, 430070, Hubei, China
| | - David A Bennett
- Rush University Medical Center, Rush Alzheimer's Disease Center, Chicago, IL, USA
- Rush University Medical Center, Department of Neurological Sciences, Chicago, IL, USA
| | - Philip L De Jager
- Columbia University Medical Center, Center for Translational and Computational Neuroimmunology, New York, NY, USA
- Broad Institute, Cell Circuits Program, Cambridge, MA, USA
| | - Qing-Ye Zhang
- Huazhong Agricultural University, College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, No.1 Shizishan Street, Hongshan District, Wuhan, 430070, Hubei, China
| | - Hong-Yu Zhang
- Huazhong Agricultural University, College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, No.1 Shizishan Street, Hongshan District, Wuhan, 430070, Hubei, China.
| |
Collapse
|
4
|
Fernández-Torres J, Martínez-Nava GA, Zamudio-Cuevas Y, Martínez-Flores K, Gutiérrez-Ruíz MC, Gómez-Quiroz LE, Garrido-Rodríguez D, Muñoz-Valle JF, Oregón-Romero E, Lozada C, Cornejo DC, Pineda C, López-Reyes A. Impact of the gene-gene interactions related to the HIF-1α signaling pathway with the knee osteoarthritis development. Clin Rheumatol 2019; 38:2897-2907. [PMID: 31236747 DOI: 10.1007/s10067-019-04635-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Revised: 05/03/2019] [Accepted: 06/04/2019] [Indexed: 12/20/2022]
Abstract
INTRODUCTION/OBJECTIVES Articular cartilage is the target tissue of osteoarthritis (OA), and because it lacks capillary networks, the microenvironment is hypoxic. Hypoxia inducible factor-1 alpha (HIF-1α) regulates the homeostasis of this tissue. The aim of this study was to investigate whether genetic polymorphisms of the HIF-1α signaling pathway are involved in the development of knee OA. METHOD We performed a case-control association study and genotyped 134 knee OA patients and 267 healthy controls. All participants were genotyped in order to evaluate 42 SNPs from 22 genes involved in the HIF-1α signaling pathway using the OpenArray technology. Gene-gene interactions (epistasis) were analyzed using the multifactor dimensionality reduction (MDR) method. RESULTS The MDR analysis showed epistasis between AKT2 (rs8100018) and IGF1 (rs2288377), AKT2 (rs8100018) and IGF1 (rs35767), IGF1 (rs35767) and COL2A1 (rs1793953), and between GSK3B (rs6438552) and IGF1 (rs35767) polymorphisms, with information gain values of 21.24%, 8.37%, 9.93%, and 5.73%, respectively. Additionally, our model allowed us to identify high- and low-risk genotypes among COL2A1 rs1793953, GSK3B rs6438552, AKT2 rs8100018, and IGF1 rs35767 polymorphisms. CONCLUSIONS Knowing the interactions of these polymorphisms involved in HIF-1α signaling pathway could provide a new diagnostic support tool to identify individuals at high risk of developing knee OA.
Collapse
Affiliation(s)
- Javier Fernández-Torres
- Synovial Fluid Laboratory, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico.,Biological and Health Sciences PhD Program, Universidad Autónoma Metropolitana Iztapalapa, Mexico City, Mexico
| | | | - Yessica Zamudio-Cuevas
- Synovial Fluid Laboratory, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico
| | - Karina Martínez-Flores
- Synovial Fluid Laboratory, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico
| | | | | | - Daniela Garrido-Rodríguez
- Center for Research in Infectious Diseases, National Institute of Respiratory Diseases, Mexico City, Mexico
| | - José Francisco Muñoz-Valle
- Departamento de Biología Molecular y Genómica, Instituto de Investigación en Ciencias Biomédicas (IICB), Universidad de Guadalajara, Guadalajara, Mexico
| | - Edith Oregón-Romero
- Departamento de Biología Molecular y Genómica, Instituto de Investigación en Ciencias Biomédicas (IICB), Universidad de Guadalajara, Guadalajara, Mexico
| | - Carlos Lozada
- Rheumatology Service, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico
| | - Denise Clavijo Cornejo
- Musculoskeletal and Rheumatic Diseases Division, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico
| | - Carlos Pineda
- Musculoskeletal and Rheumatic Diseases Division, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico
| | - Alberto López-Reyes
- Synovial Fluid Laboratory, Instituto Nacional de Rehabilitación "Luis Guillermo Ibarra Ibarra", Mexico City, Mexico.
| |
Collapse
|
5
|
Rodriguez N, Rojas-Galeano S. Discovering feature relevancy and dependency by kernel-guided probabilistic model-building evolution. BioData Min 2017; 10:12. [PMID: 28331548 PMCID: PMC5353680 DOI: 10.1186/s13040-017-0131-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Accepted: 02/14/2017] [Indexed: 11/18/2022] Open
Abstract
Background Discovering relevant features (biomarkers) that discriminate etiologies of a disease is useful to provide biomedical researchers with candidate targets for further laboratory experimentation while saving costs; dependencies among biomarkers may suggest additional valuable information, for example, to characterize complex epistatic relationships from genetic data. The use of classifiers to guide the search for biomarkers (the so–called wrapper approach) has been widely studied. However, simultaneously searching for relevancy and dependencies among markers is a less explored ground. Results We propose a new wrapper method that builds upon the discrimination power of a weighted kernel classifier to guide the search for a probabilistic model of simultaneous marginal and interacting effects. The feasibility of the method was evaluated in three empirical studies. The first one assessed its ability to discover complex epistatic effects on a large–scale testbed of generated human genetic problems; the method succeeded in 4 out of 5 of these problems while providing more accurate and expressive results than a baseline technique that also considers dependencies. The second study evaluated the performance of the method in benchmark classification tasks; in average the prediction accuracy was comparable to two other baseline techniques whilst finding smaller subsets of relevant features. The last study was aimed at discovering relevancy/dependency in a hepatitis dataset; in this regard, evidence recently reported in medical literature corroborated our findings. As a byproduct, the method was implemented and made freely available as a toolbox of software components deployed within an existing visual data–mining workbench. Conclusions The mining advantages exhibited by the method come at the expense of a higher computational complexity, posing interesting algorithmic challenges regarding its applicability to large–scale datasets. Extending the probabilistic assumptions of the method to continuous distributions and higher–degree interactions is also appealing. As a final remark, we advocate broadening the use of visual graphical software tools as they enable biodata researchers to focus on experiment design, visualisation and data analysis rather than on refining their scripting programming skills. Electronic supplementary material The online version of this article (doi:10.1186/s13040-017-0131-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nestor Rodriguez
- Universidad Distrital FJC, School of Engineering, Bogota, Colombia
| | | |
Collapse
|
6
|
Gola D, Mahachie John JM, van Steen K, König IR. A roadmap to multifactor dimensionality reduction methods. Brief Bioinform 2015; 17:293-308. [PMID: 26108231 PMCID: PMC4793893 DOI: 10.1093/bib/bbv038] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Indexed: 02/02/2023] Open
Abstract
Complex diseases are defined to be determined by multiple genetic and environmental factors alone as well as in interactions. To analyze interactions in genetic data, many statistical methods have been suggested, with most of them relying on statistical regression models. Given the known limitations of classical methods, approaches from the machine-learning community have also become attractive. From this latter family, a fast-growing collection of methods emerged that are based on the Multifactor Dimensionality Reduction (MDR) approach. Since its first introduction, MDR has enjoyed great popularity in applications and has been extended and modified multiple times. Based on a literature search, we here provide a systematic and comprehensive overview of these suggested methods. The methods are described in detail, and the availability of implementations is listed. Most recent approaches offer to deal with large-scale data sets and rare variants, which is why we expect these methods to even gain in popularity.
Collapse
|
7
|
Abstract
The treatment of juvenile idiopathic arthritis (JIA) has substantially evolved over the past two decades. Research has been conducted and is ongoing on how therapies can best be utilized either as monotherapy or in combination for enhanced efficacy. The introduction of biologic therapies that selectively target specific cytokines has changed the acceptable clinical course of childhood arthritis. In addition to the development and utilization of new therapeutic agents, the pediatric rheumatology community has made vital progress toward defining disease activity, developing validated outcome measures, and establishing collaborative networks to assess both clinical outcomes and the long-term side effects related to therapeutics for juvenile arthritis. In this chapter, we will discuss the therapeutic evolution in JIA over the past two decades. Although the largest strides have been made with biologic agents, and these newer drugs have more rigorous data to support their use, select commonly used non-biologic therapies are included, with the discussion focused on more recent updated literature.
Collapse
Affiliation(s)
- Elizabeth A Kessler
- Division of Rheumatology, Department of Pediatrics, Children's Mercy Hospitals and Clinics, 2401 Gillham Road, Kansas City, MO 64108, USA.
| | - Mara L Becker
- Division of Rheumatology, Department of Pediatrics, Children's Mercy Hospitals and Clinics, 2401 Gillham Road, Kansas City, MO 64108, USA.
| |
Collapse
|
8
|
Li CF, Luo FT, Zeng YX, Jia WH. Weighted risk score-based multifactor dimensionality reduction to detect gene-gene interactions in nasopharyngeal carcinoma. Int J Mol Sci 2014; 15:10724-37. [PMID: 24933637 PMCID: PMC4100176 DOI: 10.3390/ijms150610724] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Revised: 04/21/2014] [Accepted: 06/03/2014] [Indexed: 12/02/2022] Open
Abstract
Determining the complex relationships between diseases, polymorphisms in human genes and environmental factors is challenging. Multifactor dimensionality reduction (MDR) has been proven to be capable of effectively detecting the statistical patterns of epistasis, although classification accuracy is required for this approach. The imbalanced dataset can cause seriously negative effects on classification accuracy. Moreover, MDR methods cannot quantitatively assess the disease risk of genotype combinations. Hence, we introduce a novel weighted risk score-based multifactor dimensionality reduction (WRSMDR) method that uses the Bayesian posterior probability of polymorphism combinations as a new quantitative measure of disease risk. First, we compared the WRSMDR to the MDR method in simulated datasets. Our results showed that the WRSMDR method had reasonable power to identify high-order gene-gene interactions, and it was more effective than MDR at detecting four-locus models. Moreover, WRSMDR reveals more information regarding the effect of genotype combination on the disease risk, and the result was easier to determine and apply than with MDR. Finally, we applied WRSMDR to a nasopharyngeal carcinoma (NPC) case-control study and identified a statistically significant high-order interaction among three polymorphisms: rs2860580, rs11865086 and rs2305806.
Collapse
Affiliation(s)
- Chao-Feng Li
- Department of Medical Statistics and Epidemiology, School of Public Health, Sun Yat-sen University, Guangzhou 510080, China.
| | - Fu-Tian Luo
- Department of Medical Statistics and Epidemiology, School of Public Health, Sun Yat-sen University, Guangzhou 510080, China.
| | - Yi-Xin Zeng
- State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou 510060, China.
| | - Wei-Hua Jia
- State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou 510060, China.
| |
Collapse
|