Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Acharjee A, Kloosterman B, Visser RGF, Maliepaard C. Integration of multi-omics data for prediction of phenotypic traits using random forest. BMC Bioinformatics 2016;17 Suppl 5:180. [PMID: 27295212 PMCID: PMC4905610 DOI: 10.1186/s12859-016-1043-4] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

For:	Acharjee A, Kloosterman B, Visser RGF, Maliepaard C. Integration of multi-omics data for prediction of phenotypic traits using random forest. BMC Bioinformatics 2016;17 Suppl 5:180. [PMID: 27295212 PMCID: PMC4905610 DOI: 10.1186/s12859-016-1043-4] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Number

Cited by Other Article(s)

Shi Y, Fang J, Li J, Yu K, Zhu J, Lu Y. Fracture risk prediction in diabetes patients based on Lasso feature selection and Machine Learning. Comput Methods Biomech Biomed Engin 2024:1-17. [PMID: 39257307 DOI: 10.1080/10255842.2024.2400325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 08/12/2024] [Accepted: 08/21/2024] [Indexed: 09/12/2024]

Rönn T, Perfilyev A, Oskolkov N, Ling C. Predicting type 2 diabetes via machine learning integration of multiple omics from human pancreatic islets. Sci Rep 2024;14:14637. [PMID: 38918439 PMCID: PMC11199577 DOI: 10.1038/s41598-024-64846-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 06/13/2024] [Indexed: 06/27/2024] Open

Meng Y, Davison J, Clarke JT, Zobel M, Gerz M, Moora M, Öpik M, Bueno CG. Environmental modulation of plant mycorrhizal traits in the global flora. Ecol Lett 2023;26:1862-1876. [PMID: 37766496 DOI: 10.1111/ele.14309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 08/15/2023] [Accepted: 08/21/2023] [Indexed: 09/29/2023]

Wu Y, Liu H, Liu S, Lou C. Estimate of near-surface NO₂ concentrations in Fenwei Plain, China, based on TROPOMI data and random forest model. ENVIRONMENTAL MONITORING AND ASSESSMENT 2023;195:1379. [PMID: 37882903 DOI: 10.1007/s10661-023-11993-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Accepted: 10/12/2023] [Indexed: 10/27/2023]

Young T, Laroche O, Walker SP, Miller MR, Casanovas P, Steiner K, Esmaeili N, Zhao R, Bowman JP, Wilson R, Bridle A, Carter CG, Nowak BF, Alfaro AC, Symonds JE. Prediction of Feed Efficiency and Performance-Based Traits in Fish via Integration of Multiple Omics and Clinical Covariates. BIOLOGY 2023;12:1135. [PMID: 37627019 PMCID: PMC10452023 DOI: 10.3390/biology12081135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 08/07/2023] [Accepted: 08/08/2023] [Indexed: 08/27/2023]

Abstract

Fish aquaculture is a rapidly expanding global industry, set to support growing demands for sources of marine protein. Enhancing feed efficiency (FE) in farmed fish is required to reduce production costs and improve sector sustainability. Recognising that organisms are complex systems whose emerging phenotypes are the product of multiple interacting molecular processes, systems-based approaches are expected to deliver new biological insights into FE and growth performance. Here, we establish 14 diverse layers of multi-omics and clinical covariates to assess their capacities to predict FE and associated performance traits in a fish model (Oncorhynchus tshawytscha) and uncover the influential variables. Inter-omic relatedness between the different layers revealed several significant concordances, particularly between datasets originating from similar material/tissue and between blood indicators and some of the proteomic (liver), metabolomic (liver), and microbiomic layers. Single- and multi-layer random forest (RF) regression models showed that integration of all data layers provide greater FE prediction power than any single-layer model alone. Although FE was among the most challenging of the traits we attempted to predict, the mean accuracy of 40 different FE models in terms of root-mean square errors normalized to percentage was 30.4%, supporting RF as a feature selection tool and approach for complex trait prediction. Major contributions to the integrated FE models were derived from layers of proteomic and metabolomic data, with substantial influence also provided by the lipid composition layer. A correlation matrix of the top 27 variables in the models highlighted FE trait-associations with faecal bacteria (Serratia spp.), palmitic and nervonic acid moieties in whole body lipids, levels of free glycerol in muscle, and N-acetylglutamic acid content in liver. In summary, we identified subsets of molecular characteristics for the assessment of commercially relevant performance-based metrics in farmed Chinook salmon.

Collapse

Affiliation(s)

Tim Young Aquaculture Biotechnology Research Group, Department of Environmental Science, School of Science, Private Bag 92006, Auckland 1142, New Zealand The Centre for Biomedical and Chemical Sciences, School of Science, Auckland University of Technology, Private Bag 92006, Auckland 1142, New Zealand
Olivier Laroche Cawthron Institute, Nelson 7010, New Zealand
Seumas P. Walker Cawthron Institute, Nelson 7010, New Zealand
Matthew R. Miller Cawthron Institute, Nelson 7010, New Zealand Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia
Paula Casanovas Cawthron Institute, Nelson 7010, New Zealand
Konstanze Steiner Cawthron Institute, Nelson 7010, New Zealand
Noah Esmaeili Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia
Ruixiang Zhao Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia
John P. Bowman Tasmanian Institute of Agricultural Research, University of Tasmania, Hobart 7005, Australia
Richard Wilson Central Science Laboratory, Research Division, University of Tasmania, Hobart 7001, Australia
Andrew Bridle Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia
Chris G. Carter Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia Blue Economy Cooperative Research Centre, Launceston 7250, Australia
Barbara F. Nowak Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia
Andrea C. Alfaro Aquaculture Biotechnology Research Group, Department of Environmental Science, School of Science, Private Bag 92006, Auckland 1142, New Zealand
Jane E. Symonds Cawthron Institute, Nelson 7010, New Zealand Institute for Marine and Antarctic Studies, University of Tasmania, Hobart Private Bag 49, Hobart 7005, Australia

Collapse

Chen W, Lv X, Cao X, Yuan Z, Wang S, Getachew T, Mwacharo JM, Haile A, Quan K, Li Y, Sun W. Integration of the Microbiome, Metabolome and Transcriptome Reveals Escherichia coli F17 Susceptibility of Sheep. Animals (Basel) 2023;13:ani13061050. [PMID: 36978593 PMCID: PMC10044122 DOI: 10.3390/ani13061050] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 03/09/2023] [Accepted: 03/11/2023] [Indexed: 03/17/2023] Open

Affiliation(s)

Weihao Chen College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
Xiaoyang Lv Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China International Joint Research Laboratory in Universities of Jiangsu Province of China for Domestic Animal Germplasm Resources and Genetic Improvement, Yangzhou University, Yangzhou 225009, China
Xiukai Cao Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China
Zehu Yuan Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China
Shanhe Wang College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
Tesfaye Getachew International Centre for Agricultural Research in the Dry Areas, Addis Ababa 999047, Ethiopia
Joram M. Mwacharo International Centre for Agricultural Research in the Dry Areas, Addis Ababa 999047, Ethiopia
Aynalem Haile International Centre for Agricultural Research in the Dry Areas, Addis Ababa 999047, Ethiopia
Kai Quan College of Animal Science and Technology, Henan University of Animal Husbandry and Economics, Zhengzhou 450046, China
Yutao Li CSIRO Agriculture and Food, 306 Carmody Rd, St Lucia, QLD 4067, Australia
Wei Sun College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China International Joint Research Laboratory in Universities of Jiangsu Province of China for Domestic Animal Germplasm Resources and Genetic Improvement, Yangzhou University, Yangzhou 225009, China “Innovative China” “Belt and Road” International Agricultural Technology Innovation Institute for Evaluation, Protection, and Improvement on Sheep Genetic Resource, Yangzhou 225009, China Correspondence: ; Tel.: +86-13952750912

Collapse

Kircher M, Säurich J, Selle M, Jung K. Assessing Outlier Probabilities in Transcriptomics Data When Evaluating a Classifier. Genes (Basel) 2023;14:genes14020387. [PMID: 36833313 PMCID: PMC9956321 DOI: 10.3390/genes14020387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 01/27/2023] [Accepted: 01/30/2023] [Indexed: 02/04/2023] Open

Mahmood U, Li X, Fan Y, Chang W, Niu Y, Li J, Qu C, Lu K. Multi-omics revolution to promote plant breeding efficiency. FRONTIERS IN PLANT SCIENCE 2022;13:1062952. [PMID: 36570904 PMCID: PMC9773847 DOI: 10.3389/fpls.2022.1062952] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 11/24/2022] [Indexed: 06/17/2023]

Affiliation(s)

Umer Mahmood Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
Xiaodong Li Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
Yonghai Fan Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
Wei Chang Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
Yue Niu Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
Jiana Li Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China Academy of Agricultural Sciences, Southwest University, Chongqing, China Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
Cunmin Qu Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China Academy of Agricultural Sciences, Southwest University, Chongqing, China Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
Kun Lu Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China Academy of Agricultural Sciences, Southwest University, Chongqing, China Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China

Collapse

Kao PH, Baiya S, Lai ZY, Huang CM, Jhan LH, Lin CJ, Lai YS, Kao CF. An advanced systems biology framework of feature engineering for cold tolerance genes discovery from integrated omics and non-omics data in soybean. FRONTIERS IN PLANT SCIENCE 2022;13:1019709. [PMID: 36247545 PMCID: PMC9562094 DOI: 10.3389/fpls.2022.1019709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 09/06/2022] [Indexed: 06/16/2023]

Abstract

Soybean is sensitive to low temperatures during the crop growing season. An urgent demand for breeding cold-tolerant cultivars to alleviate the production loss is apparent to cope with this scenario. Cold-tolerant trait is a complex and quantitative trait controlled by multiple genes, environmental factors, and their interaction. In this study, we proposed an advanced systems biology framework of feature engineering for the discovery of cold tolerance genes (CTgenes) from integrated omics and non-omics (OnO) data in soybean. An integrative pipeline was introduced for feature selection and feature extraction from different layers in the integrated OnO data using data ensemble methods and the non-parameter random forest prioritization to minimize uncertainties and false positives for accuracy improvement of results. In total, 44, 143, and 45 CTgenes were identified in short-, mid-, and long-term cold treatment, respectively, from the corresponding gene-pool. These CTgenes outperformed the remaining genes, the random genes, and the other candidate genes identified by other approaches in an independent RNA-seq database. Furthermore, we applied pathway enrichment and crosstalk network analyses to uncover relevant physiological pathways with the discovery of underlying cold tolerance in hormone- and defense-related modules. Our CTgenes were validated by using 55 SNP genotype data of 56 soybean samples in cold tolerance experiments. This suggests that the CTgenes identified from our proposed systematic framework can effectively distinguish cold-resistant and cold-sensitive lines. It is an important advancement in the soybean cold-stress response. The proposed pipelines provide an alternative solution to biomarker discovery, module discovery, and sample classification underlying a particular trait in plants in a robust and efficient way.

Collapse

Liang M, An B, Chang T, Deng T, Du L, Li K, Cao S, Du Y, Xu L, Zhang L, Gao X, Li J, Gao H. Incorporating kernelized multi-omics data improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2022;13:103. [PMID: 36127743 PMCID: PMC9490992 DOI: 10.1186/s40104-022-00756-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 07/08/2022] [Indexed: 11/18/2022] Open

Abstract

Background

Genomic selection (GS) has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes. Besides genome, transcriptome and metabolome information are increasingly considered new sources for GS. Difficulties in building the model with multi-omics data for GS and the limit of specimen availability have both delayed the progress of investigating multi-omics.

Results

We utilized the Cosine kernel to map genomic and transcriptomic data as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${n}\times {n}$$\end{document}n×n symmetric matrix (G matrix and T matrix), combined with the best linear unbiased prediction (BLUP) for GS. Here, we defined five kernel-based prediction models: genomic BLUP (GBLUP), transcriptome-BLUP (TBLUP), multi-omics BLUP (MBLUP, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\boldsymbol M=\mathrm{ratio}\times\boldsymbol G+(1-\mathrm{ratio})\times\boldsymbol T$$\end{document}M=ratio×G+(1-ratio)×T), multi-omics single-step BLUP (mssBLUP), and weighted multi-omics single-step BLUP (wmssBLUP) to integrate transcribed individuals and genotyped resource population. The predictive accuracy evaluations in four traits of the Chinese Simmental beef cattle population showed that (1) MBLUP was far preferred to GBLUP (ratio = 1.0), (2) the prediction accuracy of wmssBLUP and mssBLUP had 4.18% and 3.37% average improvement over GBLUP, (3) We also found the accuracy of wmssBLUP increased with the growing proportion of transcribed cattle in the whole resource population.

Conclusions

We concluded that the inclusion of transcriptome data in GS had the potential to improve accuracy. Moreover, wmssBLUP is accepted to be a promising alternative for the present situation in which plenty of individuals are genotyped when fewer are transcribed.

Supplementary Information

The online version contains supplementary material available at 10.1186/s40104-022-00756-6.

Collapse

Affiliation(s)

Mang Liang Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Bingxing An Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Tianpeng Chang Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Tianyu Deng Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Lili Du Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Keanning Li Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Sheng Cao Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Yueying Du Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Lingyang Xu Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Lupei Zhang Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Xue Gao Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Junya Li Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
Huijiang Gao Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China.

Collapse

Yu Z, Wang Z, Jiang Q, Wang J, Zheng J, Zhang T. Analysis of Factors of Productivity of Tight Conglomerate Reservoirs Based on Random Forest Algorithm. ACS OMEGA 2022;7:20390-20404. [PMID: 35721933 PMCID: PMC9202053 DOI: 10.1021/acsomega.2c02546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Accepted: 05/20/2022] [Indexed: 05/25/2023]

Abstract

The tight conglomerate reservoir of Baikouquan formation in the MA 131 well block in the Junggar basin abounds with petroleum reserves, yet the vertical wells in this reservoir have achieved a limited development effect. The tight conglomerate reservoirs have become an important target for exploration and exploitation. The high-efficiency development scheme of a small well spacing three-dimensional (3D) staggered well pattern has been determined by a series of field tests on well pattern and well spacing development. Multistage fracturing with a horizontal well has been demonstrated as the primary development technology. The horizontal wells in the MA 131 small well spacing demonstration area have achieved significantly different development effects, and the major controlling factors for high and stable production of a single well remain unclear. In this study, we proposed an evaluation model of major productivity controlling factors of the tight conglomerate reservoir to provide a reference for oil recovery based on a random forest (RF) machine-learning algorithm. The productivity factors were investigated from two aspects: petrophysical facies that are capable of indicating the genetic mechanism of geological dessert and engineering dessert parameters forming complex fracture networks. Resultantly, the reservoir in the MA 131 well block can be classified into 12 petrophysical facies according to the sedimentary characteristics and diagenesis analysis. The mercury injection curves of a variety of petrophysical facies can be classified into four reservoir quality types. The RF model was trained on 80% of the data to predict the oil well class using the selected features as primary inputs while the remaining 20% of the data were set to test the model performance. The results indicated that the RF model produced excellent results with only 12 misclassifications across the entire data set of 627 samples that represent <2% error. The important evaluation score of the random forest algorithm model showed that the reservoir type, oil saturation, horizontal stress difference, and gravel content are the most important four indicators, with each value exceeding 15%. Brittleness and maximum horizontal stress are considered the least important indexes, with values of less than 5%. Reservoir quality and oil saturation were confirmed as the major controlling factors and material foundation for oil wells' high and stable production. As indicated in this study, stress difference and gravel content are the major controlling factors in the formation of a complex fracture network.

Collapse

Hesami M, Alizadeh M, Jones AMP, Torkamaneh D. Machine learning: its challenges and opportunities in plant system biology. Appl Microbiol Biotechnol 2022;106:3507-3530. [PMID: 35575915 DOI: 10.1007/s00253-022-11963-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 03/14/2022] [Accepted: 05/07/2022] [Indexed: 12/25/2022]

Rudar J, Porter TM, Wright M, Golding GB, Hajibabaei M. LANDMark: an ensemble approach to the supervised selection of biomarkers in high-throughput sequencing data. BMC Bioinformatics 2022;23:110. [PMID: 35361114 PMCID: PMC8969335 DOI: 10.1186/s12859-022-04631-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 03/07/2022] [Indexed: 11/10/2022] Open

Abstract

Background

Identification of biomarkers, which are measurable characteristics of biological datasets, can be challenging. Although amplicon sequence variants (ASVs) can be considered potential biomarkers, identifying important ASVs in high-throughput sequencing datasets is challenging. Noise, algorithmic failures to account for specific distributional properties, and feature interactions can complicate the discovery of ASV biomarkers. In addition, these issues can impact the replicability of various models and elevate false-discovery rates. Contemporary machine learning approaches can be leveraged to address these issues. Ensembles of decision trees are particularly effective at classifying the types of data commonly generated in high-throughput sequencing (HTS) studies due to their robustness when the number of features in the training data is orders of magnitude larger than the number of samples. In addition, when combined with appropriate model introspection algorithms, machine learning algorithms can also be used to discover and select potential biomarkers. However, the construction of these models could introduce various biases which potentially obfuscate feature discovery.

Results

We developed a decision tree ensemble, LANDMark, which uses oblique and non-linear cuts at each node. In synthetic and toy tests LANDMark consistently ranked as the best classifier and often outperformed the Random Forest classifier. When trained on the full metabarcoding dataset obtained from Canada’s Wood Buffalo National Park, LANDMark was able to create highly predictive models and achieved an overall balanced accuracy score of 0.96 ± 0.06. The use of recursive feature elimination did not impact LANDMark’s generalization performance and, when trained on data from the BE amplicon, it was able to outperform the Linear Support Vector Machine, Logistic Regression models, and Stochastic Gradient Descent models (p ≤ 0.05). Finally, LANDMark distinguishes itself due to its ability to learn smoother non-linear decision boundaries.

Conclusions

Our work introduces LANDMark, a meta-classifier which blends the characteristics of several machine learning models into a decision tree and ensemble learning framework. To our knowledge, this is the first study to apply this type of ensemble approach to amplicon sequencing data and we have shown that analyzing these datasets using LANDMark can produce highly predictive and consistent models.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-022-04631-z.

Collapse

Li J, Chen F, Liang H, Yan J. MoNET: an R package for multi-omic network analysis. Bioinformatics 2022;38:1165-1167. [PMID: 34694378 DOI: 10.1093/bioinformatics/btab722] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 08/31/2021] [Accepted: 10/19/2021] [Indexed: 02/03/2023] Open

Meng D, Xu J, Zhao J. Analysis and prediction of hand, foot and mouth disease incidence in China using Random Forest and XGBoost. PLoS One 2021;16:e0261629. [PMID: 34936688 PMCID: PMC8694472 DOI: 10.1371/journal.pone.0261629] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 12/06/2021] [Indexed: 12/13/2022] Open

De San-Martin BS, Ferreira VG, Bitencourt MR, Pereira PCG, Carrilho E, de Assunção NA, de Carvalho LRS. Metabolomics as a potential tool for the diagnosis of growth hormone deficiency (GHD): a review. ARCHIVES OF ENDOCRINOLOGY AND METABOLISM 2021;64:654-663. [PMID: 33085993 PMCID: PMC10528619 DOI: 10.20945/2359-3997000000300] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 08/25/2020] [Indexed: 11/23/2022]

Jia B, Chen Y, Wu J. Bibliometric Analysis and Research Trend Forecast of Healthy Urban Planning for 40 Years (1981-2020). INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph18189444. [PMID: 34574368 PMCID: PMC8464861 DOI: 10.3390/ijerph18189444] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 08/26/2021] [Accepted: 09/01/2021] [Indexed: 11/25/2022]

Reel PS, Reel S, Pearson E, Trucco E, Jefferson E. Using machine learning approaches for multi-omics data analysis: A review. Biotechnol Adv 2021;49:107739. [PMID: 33794304 DOI: 10.1016/j.biotechadv.2021.107739] [Citation(s) in RCA: 265] [Impact Index Per Article: 88.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/01/2021] [Accepted: 03/25/2021] [Indexed: 02/06/2023]

Pazhamala LT, Kudapa H, Weckwerth W, Millar AH, Varshney RK. Systems biology for crop improvement. THE PLANT GENOME 2021;14:e20098. [PMID: 33949787 DOI: 10.1002/tpg2.20098] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Accepted: 03/09/2021] [Indexed: 05/19/2023]

Kim DY, Kim JM. Multi-omics integration strategies for animal epigenetic studies - A review. Anim Biosci 2021;34:1271-1282. [PMID: 33902167 PMCID: PMC8255897 DOI: 10.5713/ab.21.0042] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 04/21/2021] [Indexed: 12/15/2022] Open

Singh G, Papoutsoglou EA, Keijts-Lalleman F, Vencheva B, Rice M, Visser RG, Bachem CW, Finkers R. Extracting knowledge networks from plant scientific literature: potato tuber flesh color as an exemplary trait. BMC PLANT BIOLOGY 2021;21:198. [PMID: 33894758 PMCID: PMC8070292 DOI: 10.1186/s12870-021-02943-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 03/29/2021] [Indexed: 06/12/2023]

Abstract

BACKGROUND

Scientific literature carries a wealth of information crucial for research, but only a fraction of it is present as structured information in databases and therefore can be analyzed using traditional data analysis tools. Natural language processing (NLP) is often and successfully employed to support humans by distilling relevant information from large corpora of free text and structuring it in a way that lends itself to further computational analyses. For this pilot, we developed a pipeline that uses NLP on biological literature to produce knowledge networks. We focused on the flesh color of potato, a well-studied trait with known associations, and we investigated whether these knowledge networks can assist us in formulating new hypotheses on the underlying biological processes.

RESULTS

We trained an NLP model based on a manually annotated corpus of 34 full-text potato articles, to recognize relevant biological entities and relationships between them in text (genes, proteins, metabolites and traits). This model detected the number of biological entities with a precision of 97.65% and a recall of 88.91% on the training set. We conducted a time series analysis on 4023 PubMed abstract of plant genetics-based articles which focus on 4 major Solanaceous crops (tomato, potato, eggplant and capsicum), to determine that the networks contained both previously known and contemporaneously unknown leads to subsequently discovered biological phenomena relating to flesh color. A novel time-based analysis of these networks indicates a connection between our trait and a candidate gene (zeaxanthin epoxidase) already two years prior to explicit statements of that connection in the literature.

CONCLUSIONS

Our time-based analysis indicates that network-assisted hypothesis generation shows promise for knowledge discovery, data integration and hypothesis generation in scientific research.

Collapse

Xu Y, Zhao Y, Wang X, Ma Y, Li P, Yang Z, Zhang X, Xu C, Xu S. Incorporation of parental phenotypic data into multi-omic models improves prediction of yield-related traits in hybrid rice. PLANT BIOTECHNOLOGY JOURNAL 2021;19:261-272. [PMID: 32738177 PMCID: PMC7868986 DOI: 10.1111/pbi.13458] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 06/14/2020] [Accepted: 07/22/2020] [Indexed: 05/15/2023]

Abstract

Hybrid breeding has been shown to effectively increase rice productivity. However, identifying desirable hybrids out of numerous potential combinations is a daunting challenge. Genomic selection holds great promise for accelerating hybrid breeding by enabling early selection before phenotypes are measured. With the recent advances in multi-omic technologies, hybrid prediction based on transcriptomic and metabolomic data has received increasing attention. However, the current omic-based hybrid prediction has ignored parental phenotypic information, which is of fundamental importance in plant breeding. In this study, we integrated parental phenotypic information into various multi-omic prediction models applied in hybrid breeding of rice and compared the predictabilities of 15 combinations from four sets of predictors from the parents, that is genome, transcriptome, metabolome and phenome. The predictability for each combination was evaluated using the best linear unbiased prediction and a modified fast HAT method. We found significant interactions between predictors and traits in predictability, but joint prediction with various combinations of the predictors significantly improved predictability relative to prediction of any single source omic data for each trait investigated. Incorporation of parental phenotypic data into various omic predictors increased the predictability, averagely by 13.6%, 54.5%, 19.9% and 8.3%, for grain yield, number of tillers per plant, number of grains per panicle and 1000 grain weight, respectively. Among nine models of incorporating parental traits, the AD-All model was the most effective one. This novel strategy of incorporating parental phenotypic data into multi-omic prediction is expected to improve hybrid breeding progress, especially with the development of high-throughput phenotyping technologies.

Collapse

Affiliation(s)

Yang Xu Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Yue Zhao Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Xin Wang Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Ying Ma Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Pengcheng Li Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Zefeng Yang Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Xuecai Zhang International Maize and Wheat Improvement Center (CIMMYT)MexicoDFMexico
Chenwu Xu Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
Shizhong Xu Department of Botany and Plant SciencesUniversity of CaliforniaRiversideCAUSA

Collapse

Proteome-wide Systems Genetics to Identify Functional Regulators of Complex Traits. Cell Syst 2021;12:5-22. [PMID: 33476553 DOI: 10.1016/j.cels.2020.10.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/15/2020] [Accepted: 10/07/2020] [Indexed: 02/08/2023]

Acharjee A, Larkman J, Xu Y, Cardoso VR, Gkoutos GV. A random forest based biomarker discovery and power analysis framework for diagnostics research. BMC Med Genomics 2020;13:178. [PMID: 33228632 PMCID: PMC7685541 DOI: 10.1186/s12920-020-00826-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 11/15/2020] [Indexed: 11/25/2022] Open

Abstract

Background

Biomarker identification is one of the major and important goal of functional genomics and translational medicine studies. Large scale –omics data are increasingly being accumulated and can provide vital means for the identification of biomarkers for the early diagnosis of complex disease and/or for advanced patient/diseases stratification. These tasks are clearly interlinked, and it is essential that an unbiased and stable methodology is applied in order to address them. Although, recently, many, primarily machine learning based, biomarker identification approaches have been developed, the exploration of potential associations between biomarker identification and the design of future experiments remains a challenge.

Methods

In this study, using both simulated and published experimentally derived datasets, we assessed the performance of several state-of-the-art Random Forest (RF) based decision approaches, namely the Boruta method, the permutation based feature selection without correction method, the permutation based feature selection with correction method, and the backward elimination based feature selection method. Moreover, we conducted a power analysis to estimate the number of samples required for potential future studies.

Results

We present a number of different RF based stable feature selection methods and compare their performances using simulated, as well as published, experimentally derived, datasets. Across all of the scenarios considered, we found the Boruta method to be the most stable methodology, whilst the Permutation (Raw) approach offered the largest number of relevant features, when allowed to stabilise over a number of iterations. Finally, we developed and made available a web interface (https://joelarkman.shinyapps.io/PowerTools/) to streamline power calculations thereby aiding the design of potential future studies within a translational medicine context.

Conclusions

We developed a RF-based biomarker discovery framework and provide a web interface for our framework, termed PowerTools, that caters the design of appropriate and cost-effective subsequent future omics study.

Collapse

Affiliation(s)

Animesh Acharjee College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT, UK. .,Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, Birmingham, B15 2TT, UK. .,NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, B15 2WB, UK.
Joseph Larkman College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT, UK.,Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, Birmingham, B15 2TT, UK
Yuanwei Xu College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT, UK.,Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, Birmingham, B15 2TT, UK
Victor Roth Cardoso College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT, UK.,Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, Birmingham, B15 2TT, UK.,MRC Health Data Research UK (HDR UK), London, UK
Georgios V Gkoutos College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT, UK.,Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, Birmingham, B15 2TT, UK.,NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, B15 2WB, UK.,MRC Health Data Research UK (HDR UK), London, UK.,NIHR Experimental Cancer Medicine Centre, Birmingham, B15 2TT, UK.,NIHR Biomedical Research Centre, University Hospital Birmingham, Birmingham, B15 2TT, UK

Collapse

Statistical and Machine-Learning Analyses in Nutritional Genomics Studies. Nutrients 2020;12:nu12103140. [PMID: 33066636 PMCID: PMC7602401 DOI: 10.3390/nu12103140] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 10/08/2020] [Accepted: 10/10/2020] [Indexed: 12/18/2022] Open

Balmant KM, Noble JD, C Alves F, Dervinis C, Conde D, Schmidt HW, Vazquez AI, Barbazuk WB, Campos GDL, Resende MFR, Kirst M. Xylem systems genetics analysis reveals a key regulator of lignin biosynthesis in Populus deltoides. Genome Res 2020;30:1131-1143. [PMID: 32817237 PMCID: PMC7462072 DOI: 10.1101/gr.261438.120] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 07/13/2020] [Indexed: 02/01/2023]

Shi WJ, Zhuang Y, Russell PH, Hobbs BD, Parker MM, Castaldi PJ, Rudra P, Vestal B, Hersh CP, Saba LM, Kechris K. Unsupervised discovery of phenotype-specific multi-omics networks. Bioinformatics 2020;35:4336-4343. [PMID: 30957844 DOI: 10.1093/bioinformatics/btz226] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 02/01/2019] [Accepted: 04/05/2019] [Indexed: 12/15/2022] Open

Abstract

MOTIVATION

Complex diseases often involve a wide spectrum of phenotypic traits. Better understanding of the biological mechanisms relevant to each trait promotes understanding of the etiology of the disease and the potential for targeted and effective treatment plans. There have been many efforts towards omics data integration and network reconstruction, but limited work has examined the incorporation of relevant (quantitative) phenotypic traits.

RESULTS

We propose a novel technique, sparse multiple canonical correlation network analysis (SmCCNet), for integrating multiple omics data types along with a quantitative phenotype of interest, and for constructing multi-omics networks that are specific to the phenotype. As a case study, we focus on miRNA-mRNA networks. Through simulations, we demonstrate that SmCCNet has better overall prediction performance compared to popular gene expression network construction and integration approaches under realistic settings. Applying SmCCNet to studies on chronic obstructive pulmonary disease (COPD) and breast cancer, we found enrichment of known relevant pathways (e.g. the Cadherin pathway for COPD and the interferon-gamma signaling pathway for breast cancer) as well as less known omics features that may be important to the diseases. Although those applications focus on miRNA-mRNA co-expression networks, SmCCNet is applicable to a variety of omics and other data types. It can also be easily generalized to incorporate multiple quantitative phenotype simultaneously. The versatility of SmCCNet suggests great potential of the approach in many areas.

AVAILABILITY AND IMPLEMENTATION

The SmCCNet algorithm is written in R, and is freely available on the web at https://cran.r-project.org/web/packages/SmCCNet/index.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Jamil IN, Remali J, Azizan KA, Nor Muhammad NA, Arita M, Goh HH, Aizat WM. Systematic Multi-Omics Integration (MOI) Approach in Plant Systems Biology. FRONTIERS IN PLANT SCIENCE 2020;11:944. [PMID: 32754171 PMCID: PMC7371031 DOI: 10.3389/fpls.2020.00944] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 06/10/2020] [Indexed: 05/03/2023]

Moreira FF, Oliveira HR, Volenec JJ, Rainey KM, Brito LF. Integrating High-Throughput Phenotyping and Statistical Genomic Methods to Genetically Improve Longitudinal Traits in Crops. FRONTIERS IN PLANT SCIENCE 2020;11:681. [PMID: 32528513 PMCID: PMC7264266 DOI: 10.3389/fpls.2020.00681] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 04/30/2020] [Indexed: 05/28/2023]

Eicher T, Kinnebrew G, Patt A, Spencer K, Ying K, Ma Q, Machiraju R, Mathé EA. Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources. Metabolites 2020;10:E202. [PMID: 32429287 PMCID: PMC7281435 DOI: 10.3390/metabo10050202] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 05/07/2020] [Accepted: 05/13/2020] [Indexed: 02/06/2023] Open

Affiliation(s)

Tara Eicher Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.) Computer Science and Engineering Department, The Ohio State University College of Engineering, Columbus, OH 43210, USA
Garrett Kinnebrew Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.) Comprehensive Cancer Center, The Ohio State University and James Cancer Hospital, Columbus, OH 43210, USA; Bioinformatics Shared Resource Group, The Ohio State University, Columbus, OH 43210, USA
Andrew Patt Division of Preclinical Innovation, National Center for Advancing Translational Sciences, NIH, 9800 Medical Center Dr., Rockville, MD, 20892, USA; Biomedical Sciences Graduate Program, The Ohio State University, Columbus, OH 43210, USA
Kyle Spencer Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.) Biomedical Sciences Graduate Program, The Ohio State University, Columbus, OH 43210, USA Nationwide Children’s Research Hospital, Columbus, OH 43210, USA
Kevin Ying Comprehensive Cancer Center, The Ohio State University and James Cancer Hospital, Columbus, OH 43210, USA; Molecular, Cellular and Developmental Biology Program, The Ohio State University, Columbus, OH 43210, USA
Qin Ma Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
Raghu Machiraju Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.) Computer Science and Engineering Department, The Ohio State University College of Engineering, Columbus, OH 43210, USA Department of Pathology, Wexner Medical Center, The Ohio State University, Columbus, OH 43210, USA Translational Data Analytics Institute, The Ohio State University, Columbus, OH 43210, USA
Ewy A. Mathé Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.) Division of Preclinical Innovation, National Center for Advancing Translational Sciences, NIH, 9800 Medical Center Dr., Rockville, MD, 20892, USA;

Collapse

Zhang X, Yang S, Srivastava G, Chen MY, Cheng X. Hybridization of cognitive computing for food services. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2019.106051] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Clinical-learning versus machine-learning for transdiagnostic prediction of psychosis onset in individuals at-risk. Transl Psychiatry 2019;9:259. [PMID: 31624229 PMCID: PMC6797779 DOI: 10.1038/s41398-019-0600-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Revised: 05/03/2019] [Accepted: 05/31/2019] [Indexed: 02/08/2023] Open

Zhuang YY, Liu HJ, Song X, Ju Y, Peng H. A Linear Regression Predictor for Identifying N⁶-Methyladenosine Sites Using Frequent Gapped K-mer Pattern. MOLECULAR THERAPY. NUCLEIC ACIDS 2019;18:673-680. [PMID: 31707204 PMCID: PMC6849367 DOI: 10.1016/j.omtn.2019.10.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 08/19/2019] [Accepted: 10/03/2019] [Indexed: 01/07/2023]

Krautenbacher N, Flach N, Böck A, Laubhahn K, Laimighofer M, Theis FJ, Ankerst DP, Fuchs C, Schaub B. A strategy for high-dimensional multivariable analysis classifies childhood asthma phenotypes from genetic, immunological, and environmental factors. Allergy 2019;74:1364-1373. [PMID: 30737985 PMCID: PMC6767756 DOI: 10.1111/all.13745] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2018] [Revised: 12/22/2018] [Accepted: 01/06/2019] [Indexed: 12/14/2022]

Abstract

Background

Associations between childhood asthma phenotypes and genetic, immunological, and environmental factors have been previously established. Yet, strategies to integrate high‐dimensional risk factors from multiple distinct data sets, and thereby increase the statistical power of analyses, have been hampered by a preponderance of missing data and lack of methods to accommodate them.

Methods

We assembled questionnaire, diagnostic, genotype, microarray, RT‐qPCR, flow cytometry, and cytokine data (referred to as data modalities) to use as input factors for a classifier that could distinguish healthy children, mild‐to‐moderate allergic asthmatics, and nonallergic asthmatics. Based on data from 260 German children aged 4‐14 from our university outpatient clinic, we built a novel multilevel prediction approach for asthma outcome which could deal with a present complex missing data structure.

Results

The optimal learning method was boosting based on all data sets, achieving an area underneath the receiver operating characteristic curve (AUC) for three classes of phenotypes of 0.81 (95%‐confidence interval (CI): 0.65‐0.94) using leave‐one‐out cross‐validation. Besides improving the AUC, our integrative multilevel learning approach led to tighter CIs than using smaller complete predictor data sets (AUC = 0.82 [0.66‐0.94] for boosting). The most important variables for classifying childhood asthma phenotypes comprised novel identified genes, namely PKN2 (protein kinase N2), PTK2 (protein tyrosine kinase 2), and ALPP (alkaline phosphatase, placental).

Conclusion

Our combination of several data modalities using a novel strategy improved classification of childhood asthma phenotypes but requires validation in external populations. The generic approach is applicable to other multilevel data‐based risk prediction settings, which typically suffer from incomplete data.

Collapse

Affiliation(s)

Norbert Krautenbacher Institute of Computational Biology Helmholtz Zentrum München German Research Center for Environmental Health GmbH Neuherberg Germany Technische Universität München Center for Mathematics Chair of Mathematical Modeling of Biological Systems Garching Germany
Nicolai Flach Institute of Computational Biology Helmholtz Zentrum München German Research Center for Environmental Health GmbH Neuherberg Germany Technische Universität München Center for Mathematics Chair of Mathematical Modeling of Biological Systems Garching Germany
Andreas Böck Department of Pulmonary and Allergy Dr. von Hauner Children's Hospital LMU Munich Germany
Kristina Laubhahn Department of Pulmonary and Allergy Dr. von Hauner Children's Hospital LMU Munich Germany Member of German Lung Centre (DZL) CPC Munich Germany
Michael Laimighofer Institute of Computational Biology Helmholtz Zentrum München German Research Center for Environmental Health GmbH Neuherberg Germany Technische Universität München Center for Mathematics Chair of Mathematical Modeling of Biological Systems Garching Germany
Fabian J. Theis Institute of Computational Biology Helmholtz Zentrum München German Research Center for Environmental Health GmbH Neuherberg Germany Technische Universität München Center for Mathematics Chair of Mathematical Modeling of Biological Systems Garching Germany
Donna P. Ankerst Technische Universität München Center for Mathematics Chair of Mathematical Modeling of Biological Systems Garching Germany University of Texas Health Science Center at San Antonio San Antonio Texas
Christiane Fuchs Institute of Computational Biology Helmholtz Zentrum München German Research Center for Environmental Health GmbH Neuherberg Germany Technische Universität München Center for Mathematics Chair of Mathematical Modeling of Biological Systems Garching Germany Faculty of Business Administration and Economics Bielefeld University Bielefeld Germany
Bianca Schaub Department of Pulmonary and Allergy Dr. von Hauner Children's Hospital LMU Munich Germany Member of German Lung Centre (DZL) CPC Munich Germany

Collapse

Ajjolli Nagaraja A, Fontaine N, Delsaut M, Charton P, Damour C, Offmann B, Grondin-Perez B, Cadet F. Flux prediction using artificial neural network (ANN) for the upper part of glycolysis. PLoS One 2019;14:e0216178. [PMID: 31067238 PMCID: PMC6505829 DOI: 10.1371/journal.pone.0216178] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Accepted: 04/15/2019] [Indexed: 01/08/2023] Open

Li Z, Gao N, Martini JWR, Simianer H. Integrating Gene Expression Data Into Genomic Prediction. Front Genet 2019;10:126. [PMID: 30858865 PMCID: PMC6397893 DOI: 10.3389/fgene.2019.00126] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2018] [Accepted: 02/04/2019] [Indexed: 01/14/2023] Open

Segal JP, Mullish BH, Quraishi MN, Acharjee A, Williams HRT, Iqbal T, Hart AL, Marchesi JR. The application of omics techniques to understand the role of the gut microbiota in inflammatory bowel disease. Therap Adv Gastroenterol 2019;12:1756284818822250. [PMID: 30719076 PMCID: PMC6348496 DOI: 10.1177/1756284818822250] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 11/23/2018] [Indexed: 02/04/2023] Open

Men H, Jiao Y, Shi Y, Gong F, Chen Y, Fang H, Liu J. Odor Fingerprint Analysis Using Feature Mining Method Based on Olfactory Sensory Evaluation. SENSORS (BASEL, SWITZERLAND) 2018;18:E3387. [PMID: 30309029 PMCID: PMC6210366 DOI: 10.3390/s18103387] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Revised: 10/06/2018] [Accepted: 10/08/2018] [Indexed: 12/01/2022]

Darst B, Engelman CD, Tian Y, Lorenzo Bermejo J. Data mining and machine learning approaches for the integration of genome-wide association and methylation data: methodology and main conclusions from GAW20. BMC Genet 2018;19:76. [PMID: 30255774 PMCID: PMC6157271 DOI: 10.1186/s12863-018-0646-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Multiple layers of genetic and epigenetic variability are being simultaneously explored in an increasing number of health studies. We summarize here different approaches applied in the Data Mining and Machine Learning group at the GAW20 to integrate genome-wide genotype and methylation array data.

RESULTS

We provide a non-intimidating introduction to some frequently used methods to investigate high-dimensional molecular data and compare the different approaches tried by group members: random forest, deep learning, cluster analysis, mixed models, and gene-set enrichment analysis. Group contributions were quite heterogeneous regarding investigated data sets (real vs simulated), conducted data quality control and assessed phenotypes (eg, metabolic syndrome vs relative differences of log-transformed triglyceride concentrations before and after fenofibrate treatment). However, some common technical issues were detected, leading to practical recommendations.

CONCLUSIONS

Different sources of correlation were identified by group members, including population stratification, family structure, batch effects, linkage disequilibrium and correlation of methylation values at neighboring cytosine-phosphate-guanine (CpG) sites, and the majority of applied approaches were able to take into account identified correlation structures. The ability to efficiently deal with high-dimensional omics data, and the model free nature of the approaches that did not require detailed model specifications were clearly recognized as the main strengths of applied methods. A limitation of random forest is its sensitivity to highly correlated variables. The parameter setup and the interpretation of results from deep learning methods, in particular deep neural networks, can be extremely challenging. Cluster analysis and mixed models may need some predimension reduction based on existing literature, data filtering, and supplementary statistical methods, and gene-set enrichment analysis requires biological insight.

Collapse

Berlin R, Gruen R, Best J. Systems Medicine Disease: Disease Classification and Scalability Beyond Networks and Boundary Conditions. Front Bioeng Biotechnol 2018;6:112. [PMID: 30131956 PMCID: PMC6090066 DOI: 10.3389/fbioe.2018.00112] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 07/18/2018] [Indexed: 12/26/2022] Open

Wang Y, Xia ST, Tang Q, Wu J, Zhu X. A Novel Consistent Random Forest Framework: Bernoulli Random Forests. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018;29:3510-3523. [PMID: 28816676 DOI: 10.1109/tnnls.2017.2729778] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Liu C, Liu B, Liu L, Zhang EL, Sun BD, Xu G, Chen J, Gao YQ. Arachidonic Acid Metabolism Pathway Is Not Only Dominant in Metabolic Modulation but Associated With Phenotypic Variation After Acute Hypoxia Exposure. Front Physiol 2018;9:236. [PMID: 29615930 PMCID: PMC5864929 DOI: 10.3389/fphys.2018.00236] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 03/02/2018] [Indexed: 12/22/2022] Open

Abstract

Background: The modulation of arachidonic acid (AA) metabolism pathway is identified in metabolic alterations after hypoxia exposure, but its biological function is controversial. We aimed at integrating plasma metabolomic and transcriptomic approaches to systematically explore the roles of the AA metabolism pathway in response to acute hypoxia using an acute mountain sickness (AMS) model. Methods: Blood samples were obtained from 53 enrolled subjects before and after exposure to high altitude. Ultra-performance liquid chromatography-quadrupole time-of-flight mass spectrometry and RNA sequencing were separately performed for metabolomic and transcriptomic profiling, respectively. Influential modules comprising essential metabolites and genes were identified by weighted gene co-expression network analysis (WGCNA) after integrating metabolic information with phenotypic and transcriptomic datasets, respectively. Results: Enrolled subjects exhibited diverse response manners to hypoxia. Combined with obviously altered heart rate, oxygen saturation, hemoglobin, and Lake Louise Score (LLS), metabolomic profiling detected that 36 metabolites were highly related to clinical features in hypoxia responses, out of which 27 were upregulated and nine were downregulated, and could be mapped to AA metabolism pathway significantly. Integrated analysis of metabolomic and transcriptomic data revealed that these dominant molecules showed remarkable association with genes in gas transport incapacitation and disorders of hemoglobin metabolism pathways, such as ALAS2, HEMGN. After detailed description of AA metabolism pathway, we found that the molecules of 15-d-PGJ2, PGA2, PGE2, 12-O-3-OH-LTB4, LTD4, LTE4 were significantly up-regulated after hypoxia stimuli, and increased in those with poor response manner to hypoxia particularly. Further analysis in another cohort showed that genes in AA metabolism pathway such as PTGES, PTGS1, GGT1, TBAS1 et al. were excessively elevated in subjects in maladaptation to hypoxia. Conclusion: This is the first study to construct the map of AA metabolism pathway in response to hypoxia and reveal the crosstalk between phenotypic variation under hypoxia and the AA metabolism pathway. These findings may improve our understanding of the advanced pathophysiological mechanisms in acute hypoxic diseases and provide new insights into critical roles of the AA metabolism pathway in the development and prevention of these diseases.

Collapse

Affiliation(s)

Chang Liu Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China
Bao Liu Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China.,The 12th Hospital of Chinese People's Liberation Army, Kashi, China
Lu Liu Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China
Er-Long Zhang Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China
Bind-da Sun Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China
Gang Xu Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China
Jian Chen Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China
Yu-Qi Gao Institute of Medicine and Hygienic Equipment for High Altitude Region, College of High Altitude Military Medicine, Army Medical University, Third Military Medical University, Chongqing, China.,Key Laboratory of High Altitude Environmental Medicine, Army Medical University, Third Military Medical University, Ministry of Education, Chongqing, China.,Key Laboratory of High Altitude Medicine, People's Liberation Army, Chongqing, China

Collapse

Acharjee A, Chibon PY, Kloosterman B, America T, Renaut J, Maliepaard C, Visser RGF. Genetical genomics of quality related traits in potato tubers using proteomics. BMC PLANT BIOLOGY 2018;18:20. [PMID: 29361908 PMCID: PMC5781343 DOI: 10.1186/s12870-018-1229-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 01/09/2018] [Indexed: 05/21/2023]

Abstract

BACKGROUND

Recent advances in ~omics technologies such as transcriptomics, metabolomics and proteomics along with genotypic profiling have permitted the genetic dissection of complex traits such as quality traits in non-model species. To get more insight into the genetic factors underlying variation in quality traits related to carbohydrate and starch metabolism and cold sweetening, we determined the protein content and composition in potato tubers using 2D-gel electrophoresis in a diploid potato mapping population. Upon analyzing we made sure that the proteins from the patatin family were excluded to ensure a better representation of the other proteins.

RESULTS

We subsequently performed pQTL analyses for all other proteins with a sufficient representation in the population and established a relationship between proteins and 26 potato tuber quality traits (e.g. flesh colour, enzymatic discoloration) by co-localization on the genetic map and a direct correlation study of protein abundances and phenotypic traits. Over 1643 unique protein spots were detected in total over the two harvests. We were able to map pQTLs for over 300 different protein spots some of which co-localized with traits such as starch content and cold sweetening. pQTLs were observed on every chromosome although not evenly distributed over the chromosomes. The largest number of pQTLs was found for chromosome 8 and the lowest for chromosome number 10. For some 20 protein spots multiple QTLs were observed.

CONCLUSIONS

From this analysis, hotspot areas for protein QTLs were identified on chromosomes three, five, eight and nine. The hotspot on chromosome 3 coincided with a QTL previously identified for total protein content and had more than 23 pQTLs in the region from 70 to 80 cM. Some of the co-localizing protein spots associated with some of the most interesting tuber quality traits were identified, albeit far less than we had anticipated at the onset of the experiments.

Collapse

Kim M, Tagkopoulos I. Data integration and predictive modeling methods for multi-omics datasets. Mol Omics 2018;14:8-25. [DOI: 10.1039/c7mo00051k] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Voelckel C, Gruenheit N, Lockhart P. Evolutionary Transcriptomics and Proteomics: Insight into Plant Adaptation. TRENDS IN PLANT SCIENCE 2017;22:462-471. [PMID: 28365131 DOI: 10.1016/j.tplants.2017.03.001] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 02/21/2017] [Accepted: 03/01/2017] [Indexed: 06/07/2023]

IPF-LASSO: Integrative L₁-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2017;2017:7691937. [PMID: 28546826 PMCID: PMC5435977 DOI: 10.1155/2017/7691937] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2017] [Accepted: 03/14/2017] [Indexed: 11/29/2022]

Kopczynski D, Coman C, Zahedi RP, Lorenz K, Sickmann A, Ahrends R. Multi-OMICS: a critical technical perspective on integrative lipidomics approaches. Biochim Biophys Acta Mol Cell Biol Lipids 2017;1862:808-811. [PMID: 28193460 DOI: 10.1016/j.bbalip.2017.02.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Revised: 02/03/2017] [Accepted: 02/06/2017] [Indexed: 02/06/2023]

Acharjee A, Prentice P, Acerini C, Smith J, Hughes IA, Ong K, Griffin JL, Dunger D, Koulman A. The translation of lipid profiles to nutritional biomarkers in the study of infant metabolism. Metabolomics 2017;13:25. [PMID: 28190990 PMCID: PMC5272886 DOI: 10.1007/s11306-017-1166-2] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 01/12/2017] [Indexed: 02/02/2023]

Abstract

INTRODUCTION

Links between early life exposures and later health outcomes may, in part, be due to nutritional programming in infancy. This hypothesis is supported by observed long-term benefits associated with breastfeeding, such as better cognitive development in childhood, and lower risks of obesity and high blood pressure in later life. However, the possible underlying mechanisms are expected to be complex and may be difficult to disentangle due to the lack of understanding of the metabolic processes that differentiate breastfed infants compared to those receiving just formula feed.

OBJECTIVE

Our aim was to investigate the relationships between infant feeding and the lipid profiles and to validate specific lipids in separate datasets so that a small set of lipids can be used as nutritional biomarkers.

METHOD

We utilized a direct infusion high-resolution mass spectrometry method to analyse the lipid profiles of 3.2 mm dried blood spot samples collected at age 3 months from the Cambridge Baby Growth Study (CBGS-1), which formed the discovery cohort. For validation two sample sets were profiled: Cambridge Baby Growth Study (CBGS-2) and Pregnancy Outcome Prediction Study (POPS). Lipidomic profiles were compared between infant groups who were either exclusively breastfed, exclusively formula-fed or mixed-fed at various levels. Data analysis included supervised Random Forest method with combined classification and regression mode. Selection of lipids was based on an iterative backward elimination procedure without compromising the class error in the classification mode.

CONCLUSION

From this study, we were able to identify and validate three lipids: PC(35:2), SM(36:2) and SM(39:1) that can be used collectively as biomarkers for infant nutrition during early development. These biomarkers can be used to determine whether young infants (3-6 months) are breast-fed or receive formula milk.

Collapse

Fabres PJ, Collins C, Cavagnaro TR, Rodríguez López CM. A Concise Review on Multi-Omics Data Integration for Terroir Analysis in Vitis vinifera. FRONTIERS IN PLANT SCIENCE 2017;8:1065. [PMID: 28676813 PMCID: PMC5477006 DOI: 10.3389/fpls.2017.01065] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 06/02/2017] [Indexed: 05/19/2023]

Bhattacharjee B, Shafi M, Acharjee A. Investigating the Influence Relationship Models for Stocks in Indian Equity Market: A Weighted Network Modelling Study. PLoS One 2016;11:e0166087. [PMID: 27846251 PMCID: PMC5113066 DOI: 10.1371/journal.pone.0166087] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Accepted: 10/22/2016] [Indexed: 11/18/2022] Open