1
|
Zhang F, Shi C, He Q, Zhu L, Zhao J, Yao W, Loor JJ, Luo J. Integrated analysis of genomics and transcriptomics revealed the genetic basis for goaty flavor formation in goat milk. Genomics 2024; 116:110873. [PMID: 38823464 DOI: 10.1016/j.ygeno.2024.110873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 05/12/2024] [Accepted: 05/29/2024] [Indexed: 06/03/2024]
Abstract
Goat milk exhibits a robust and distinctive "goaty" flavor. However, the underlying genetic basis of goaty flavor remains elusive and requires further elucidation at the genomic level. Through comparative genomics analysis, we identified divergent signatures of certain proteins in goat, sheep, and cow. MMUT has undergone a goat-specific mutation in the B12 binding domain. We observed the goat FASN exhibits nonsynonymous mutations in the acyltransferase domain. Structural variations in these key proteins may enhance the capacity for synthesizing goaty flavor compounds in goat. Integrated omics analysis revealed the catabolism of branched-chain amino acids contributed to the goat milk flavor. Furthermore, we uncovered a regulatory mechanism in which the transcription factor ZNF281 suppresses the expression of the ECHDC1 gene may play a pivotal role in the accumulation of flavor substances in goat milk. These findings provide insights into the genetic basis underlying the formation of goaty flavor in goat milk. STATEMENT OF SIGNIFICANCE: Branched-chain fatty acids (BCFAs) play a crucial role in generating the distinctive "goaty" flavor of goat milk. Whether there is an underlying genetic basis associated with goaty flavor is unknown. To begin deciphering mechanisms of goat milk flavor development, we collected transcriptomic data from mammary tissue of goat, sheep, cow, and buffalo at peak lactation for cross-species transcriptome analysis and downloaded nine publicly available genomes for comparative genomic analysis. Our data indicate that the catabolic pathway of branched-chain amino acids (BCAAs) is under positive selection in the goat genome, and most genes involved in this pathway exhibit significantly higher expression levels in goat mammary tissue compared to other species, which contributes to the development of flavor in goat milk. Furthermore, we have elucidated the regulatory mechanism by which the transcription factor ZNF281 suppresses ECHDC1 gene expression, thereby exerting an important influence on the accumulation of flavor compounds in goat milk. These findings provide insights into the genetic mechanisms underlying flavor formation in goat milk and suggest further research to manipulate the flavor of animal products.
Collapse
Affiliation(s)
- Fuhong Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China
| | - Chenbo Shi
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China
| | - Qiuya He
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China
| | - Lu Zhu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China
| | - Jianqing Zhao
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China
| | - Weiwei Yao
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China
| | - Juan J Loor
- Department of Animal Sciences and Division of Nutritional Sciences, University of Illinois, Urbana, IL 61801, United States of America
| | - Jun Luo
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, PR China.
| |
Collapse
|
2
|
el Bouhaddani S, Höllerhage M, Uh HW, Moebius C, Bickle M, Höglinger G, Houwing-Duistermaat J. Statistical integration of multi-omics and drug screening data from cell lines. PLoS Comput Biol 2024; 20:e1011809. [PMID: 38295113 PMCID: PMC10878536 DOI: 10.1371/journal.pcbi.1011809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 02/20/2024] [Accepted: 01/08/2024] [Indexed: 02/02/2024] Open
Abstract
Data integration methods are used to obtain a unified summary of multiple datasets. For multi-modal data, we propose a computational workflow to jointly analyze datasets from cell lines. The workflow comprises a novel probabilistic data integration method, named POPLS-DA, for multi-omics data. The workflow is motivated by a study on synucleinopathies where transcriptomics, proteomics, and drug screening data are measured in affected LUHMES cell lines and controls. The aim is to highlight potentially druggable pathways and genes involved in synucleinopathies. First, POPLS-DA is used to prioritize genes and proteins that best distinguish cases and controls. For these genes, an integrated interaction network is constructed where the drug screen data is incorporated to highlight druggable genes and pathways in the network. Finally, functional enrichment analyses are performed to identify clusters of synaptic and lysosome-related genes and proteins targeted by the protective drugs. POPLS-DA is compared to other single- and multi-omics approaches. We found that HSPA5, a member of the heat shock protein 70 family, was one of the most targeted genes by the validated drugs, in particular by AT1-blockers. HSPA5 and AT1-blockers have been previously linked to α-synuclein pathology and Parkinson's disease, showing the relevance of our findings. Our computational workflow identified new directions for therapeutic targets for synucleinopathies. POPLS-DA provided a larger interpretable gene set than other single- and multi-omic approaches. An implementation based on R and markdown is freely available online.
Collapse
Affiliation(s)
| | | | - Hae-Won Uh
- Dept. Data science & Biostatistics, UMC Utrecht, Utrecht, Netherlands
| | - Claudia Moebius
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Marc Bickle
- Roche Institute for Translational Bioengineering, Basel, Switzerland
| | - Günter Höglinger
- Department of Neurology, Hannover Medical School, Hannover, Germany
- Department of Neurology, Ludwig-Maximilians-Universität, Munich, Germany
- German Center for Neurodegenerative Diseases, Munich, Germany
- Munich Cluster for Systems Neurology (SyNergy), Munich, Germany
| | - Jeanine Houwing-Duistermaat
- Dept. Data science & Biostatistics, UMC Utrecht, Utrecht, Netherlands
- Dept. of Mathematics, Radboud University, Nijmegen, Netherlands
| |
Collapse
|
3
|
Abbas-Aghababazadeh F, Xu W, Haibe-Kains B. The impact of violating the independence assumption in meta-analysis on biomarker discovery. Front Genet 2023; 13:1027345. [PMID: 36726714 PMCID: PMC9885264 DOI: 10.3389/fgene.2022.1027345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 11/25/2022] [Indexed: 01/06/2023] Open
Abstract
With rapid advancements in high-throughput sequencing technologies, massive amounts of "-omics" data are now available in almost every biomedical field. Due to variance in biological models and analytic methods, findings from clinical and biological studies are often not generalizable when tested in independent cohorts. Meta-analysis, a set of statistical tools to integrate independent studies addressing similar research questions, has been proposed to improve the accuracy and robustness of new biological insights. However, it is common practice among biomarker discovery studies using preclinical pharmacogenomic data to borrow molecular profiles of cancer cell lines from one study to another, creating dependence across studies. The impact of violating the independence assumption in meta-analyses is largely unknown. In this study, we review and compare different meta-analyses to estimate variations across studies along with biomarker discoveries using preclinical pharmacogenomics data. We further evaluate the performance of conventional meta-analysis where the dependence of the effects was ignored via simulation studies. Results show that, as the number of non-independent effects increased, relative mean squared error and lower coverage probability increased. Additionally, we also assess potential bias in the estimation of effects for established meta-analysis approaches when data are duplicated and the assumption of independence is violated. Using pharmacogenomics biomarker discovery, we find that treating dependent studies as independent can substantially increase the bias of meta-analyses. Importantly, we show that violating the independence assumption decreases the generalizability of the biomarker discovery process and increases false positive results, a key challenge in precision oncology.
Collapse
Affiliation(s)
| | - Wei Xu
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada,Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Benjamin Haibe-Kains
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada,Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada,Ontario Institute for Cancer Research, Toronto, ON, Canada,Department of Computer Science, University of Toronto, Toronto, ON, Canada,*Correspondence: Benjamin Haibe-Kains,
| |
Collapse
|
4
|
Jafari M, Mirzaie M, Bao J, Barneh F, Zheng S, Eriksson J, Heckman CA, Tang J. Bipartite network models to design combination therapies in acute myeloid leukaemia. Nat Commun 2022; 13:2128. [PMID: 35440130 PMCID: PMC9018865 DOI: 10.1038/s41467-022-29793-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 03/30/2022] [Indexed: 12/20/2022] Open
Abstract
Combination therapy is preferred over single-targeted monotherapies for cancer treatment due to its efficiency and safety. However, identifying effective drug combinations costs time and resources. We propose a method for identifying potential drug combinations by bipartite network modelling of patient-related drug response data, specifically the Beat AML dataset. The median of cell viability is used as a drug potency measurement to reconstruct a weighted bipartite network, model drug-biological sample interactions, and find the clusters of nodes inside two projected networks. Then, the clustering results are leveraged to discover effective multi-targeted drug combinations, which are also supported by more evidence using GDSC and ALMANAC databases. The potency and synergy levels of selective drug combinations are corroborated against monotherapy in three cell lines for acute myeloid leukaemia in vitro. In this study, we introduce a nominal data mining approach to improving acute myeloid leukaemia treatment through combinatorial therapy. Identifying effective drug combinations to treat cancer is a challenging task, either experimentally or computationally. Here, the authors develop a bipartite network modelling approach to propose drug combination strategies in acute myeloid leukaemia using patient and cell line drug screening data.
Collapse
Affiliation(s)
- Mohieddin Jafari
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
| | - Mehdi Mirzaie
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Jie Bao
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Farnaz Barneh
- Prinses Maxima Center for Pediatric Oncology, 3584 CS Utrecht, Utrech, the Netherlands
| | - Shuyu Zheng
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Johanna Eriksson
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Caroline A Heckman
- Institute for Molecular Medicine Finland - FIMM, HiLIFE - Helsinki Institute of Life Science, iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
5
|
Abstract
Multi-omics data analysis is an important aspect of cancer molecular biology studies and has led to ground-breaking discoveries. Many efforts have been made to develop machine learning methods that automatically integrate omics data. Here, we review machine learning tools categorized as either general-purpose or task-specific, covering both supervised and unsupervised learning for integrative analysis of multi-omics data. We benchmark the performance of five machine learning approaches using data from the Cancer Cell Line Encyclopedia, reporting accuracy on cancer type classification and mean absolute error on drug response prediction, and evaluating runtime efficiency. This review provides recommendations to researchers regarding suitable machine learning method selection for their specific applications. It should also promote the development of novel machine learning methodologies for data integration, which will be essential for drug discovery, clinical trial design, and personalized treatments. Featuring a balance of both biological and technical content Categorizing the reviewed tools into general purpose and task-specific Performing an independent benchmarking analysis using a publicly available dataset
Collapse
Affiliation(s)
- Zhaoxiang Cai
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, 214 Hawkesbury Rd, Westmead, NSW 2145, Australia
| | - Rebecca C Poulos
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, 214 Hawkesbury Rd, Westmead, NSW 2145, Australia
| | - Jia Liu
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, 214 Hawkesbury Rd, Westmead, NSW 2145, Australia.,Faculty of Medicine, Western Sydney University, Campbelltown, NSW, Australia
| | - Qing Zhong
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, 214 Hawkesbury Rd, Westmead, NSW 2145, Australia
| |
Collapse
|
6
|
Dewulf JP, Paquay S, Marbaix E, Achouri Y, Van Schaftingen E, Bommer GT. ECHDC1 knockout mice accumulate ethyl-branched lipids and excrete abnormal intermediates of branched-chain fatty acid metabolism. J Biol Chem 2021; 297:101083. [PMID: 34419447 PMCID: PMC8473548 DOI: 10.1016/j.jbc.2021.101083] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 08/11/2021] [Accepted: 08/13/2021] [Indexed: 12/15/2022] Open
Abstract
The cytosolic enzyme ethylmalonyl-CoA decarboxylase (ECHDC1) decarboxylates ethyl- or methyl-malonyl-CoA, two side products of acetyl-CoA carboxylase. These CoA derivatives can be used to synthesize a subset of branched-chain fatty acids (FAs). We previously found that ECHDC1 limits the synthesis of these abnormal FAs in cell lines, but its effects in vivo are unknown. To further evaluate the effects of ECHDC1 deficiency, we generated knockout mice. These mice were viable, fertile, showed normal postnatal growth, and lacked obvious macroscopic and histologic changes. Surprisingly, tissues from wild-type mice already contained methyl-branched FAs due to methylmalonyl-CoA incorporation, but these FAs were only increased in the intraorbital glands of ECHDC1 knockout mice. In contrast, ECHDC1 knockout mice accumulated 16–20-carbon FAs carrying ethyl-branches in all tissues, which were undetectable in wild-type mice. Ethyl-branched FAs were incorporated into different lipids, including acylcarnitines, phosphatidylcholines, plasmanylcholines, and triglycerides. Interestingly, we found a variety of unusual glycine-conjugates in the urine of knockout mice, which included adducts of ethyl-branched compounds in different stages of oxidation. This suggests that the excretion of potentially toxic intermediates of branched-chain FA metabolism might prevent a more dramatic phenotype in these mice. Curiously, ECHDC1 knockout mice also accumulated 2,2-dimethylmalonyl-CoA. This indicates that the broad specificity of ECHDC1 might help eliminate a variety of potentially dangerous branched-chain dicarboxylyl-CoAs. We conclude that ECHDC1 prevents the formation of ethyl-branched FAs and that urinary excretion of glycine-conjugates allows mice to eliminate potentially deleterious intermediates of branched-chain FA metabolism.
Collapse
Affiliation(s)
- Joseph P Dewulf
- Department of Biochemistry, de Duve Institute, UCLouvain, Brussels, Belgium; Walloon Excellence in Lifesciences and Biotechnology (WELBIO), Brussels, Belgium; Department of Laboratory Medicine, University Hospital St Luc, UCLouvain, Bruxelles, Belgium.
| | - Stéphanie Paquay
- Department of Biochemistry, de Duve Institute, UCLouvain, Brussels, Belgium; Walloon Excellence in Lifesciences and Biotechnology (WELBIO), Brussels, Belgium; Department of Neuropediatrics, University Hospital St Luc, UCLouvain, Bruxelles, Belgium
| | - Etienne Marbaix
- Department of Anatomical Pathology, University Hospital St Luc, UCLouvain, Bruxelles, Belgium; Department of Cell Biology, de Duve Institute, UCLouvain, Bruxelles, Belgium
| | - Younès Achouri
- Transgenesis Platform, de Duve Institute, UCLouvain, Bruxelles, Belgium
| | - Emile Van Schaftingen
- Department of Biochemistry, de Duve Institute, UCLouvain, Brussels, Belgium; Walloon Excellence in Lifesciences and Biotechnology (WELBIO), Brussels, Belgium.
| | - Guido T Bommer
- Department of Biochemistry, de Duve Institute, UCLouvain, Brussels, Belgium; Walloon Excellence in Lifesciences and Biotechnology (WELBIO), Brussels, Belgium.
| |
Collapse
|