1
|
Bisutti V, Mota LFM, Giannuzzi D, Toscano A, Amalfitano N, Schiavon S, Pegolo S, Cecchinato A. Infrared spectroscopy coupled with machine learning algorithms for predicting the detailed milk mineral profile in dairy cattle. Food Chem 2024; 461:140800. [PMID: 39163724 DOI: 10.1016/j.foodchem.2024.140800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 08/02/2024] [Accepted: 08/06/2024] [Indexed: 08/22/2024]
Abstract
Milk minerals are not only essential components for human health, but they can be informative for milk quality and cow's health. Herein, we investigated the feasibility of Fourier Transformed mid Infrared (FTIR) spectroscopy for the prediction of a detailed panel of 17 macro, trace, and environmental elements in bovine milk, using partial least squares regression (PLS) and machine learning approaches. The automatic machine learning significantly outperformed the PLS regression in terms of prediction performances of the mineral elements. For macrominerals, the R2 ranged from 0.59 to 0.78. Promising predictability was achieved for Cu and B (R2 = 0.66 and 0.74, respectively) and more moderate ones for Fe, Mn, Zn, and Al (R2 from 0.48 to 0.58). These results provide a reliable basis for a rapid and cost-effective quantification of these traits, serving as a resource for dairy farmers seeking to enhance the quality of milk production and optimize cheese properties.
Collapse
Affiliation(s)
- Vittoria Bisutti
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Lucio Flavio Macedo Mota
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Diana Giannuzzi
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Alessandro Toscano
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Nicolò Amalfitano
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Stefano Schiavon
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Sara Pegolo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| | - Alessio Cecchinato
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020, Legnaro (PD), Italy.
| |
Collapse
|
2
|
Khan UM, Sameen A, Decker EA, Shabbir MA, Hussain S, Latif A, Abdi G, Aadil RM. Implementation of plant extracts for cheddar-type cheese production in conjunction with FTIR and Raman spectroscopy comparison. Food Chem X 2024; 22:101256. [PMID: 38495457 PMCID: PMC10943033 DOI: 10.1016/j.fochx.2024.101256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/13/2024] [Accepted: 02/25/2024] [Indexed: 03/19/2024] Open
Abstract
Plant extracts have demonstrated the ability to act as coagulants for milk coagulation at an adequate concentration, wide temperatures and pH ranges. This research is focused on the use of different vegetative extracts such as Citrus aurnatium flower extract (CAFE), bromelain, fig latex, and melon extract as economical and beneficial coagulants in the development of plant-based cheddar-type cheese. The cheddar-type cheese samples were subjected to physicochemical analysis in comparison to controlled cheese samples made from acetic acid and rennet. The fat, moisture, protein, and salt contents remained the same over the storage period, but a slight decline was observed in pH. The Ferric reducing antioxidant power (FRAP) increased with the passage of the ripening period. The FTIR and Raman spectra showed exponential changes and qualitative estimates in the binding and vibrational structure of lipids and protein in plant-based cheeses. The higher FTIR and Raman spectra bands were observed in acid, rennet, bromelain, and CAFE due to their firm and strong texture of cheese while lower spectra were observed in cheese made from melon extract due to weak curdling and textural properties. These plant extracts are economical and easily available alternative sources for cheese production with higher protein and nutritional contents.
Collapse
Affiliation(s)
- Usman Mir Khan
- National Institute of Food Science and Technology, University of Agriculture, Faisalabad 38000, Pakistan
| | - Aysha Sameen
- Department of Food Science and Technology, Government College Women University, Faisalabad 38000, Pakistan
| | - Eric Andrew Decker
- Department of Food Science, University of Massachusetts, Amherst, MA 01003, USA
| | - Muhammad Asim Shabbir
- National Institute of Food Science and Technology, University of Agriculture, Faisalabad 38000, Pakistan
| | - Shahzad Hussain
- Department of Food Science and Nutrition, College of Food and Agriculture, King Saud University, Riyadh 11451, Saudi Arabia
| | - Anam Latif
- Institute of Food Science and Nutrition, University of Sargodha, Sargodha 40100, Pakistan
| | - Gholamreza Abdi
- Department of Biotechnology, Persian Gulf Research Institute, Persian Gulf University, Bushehr 75169, Iran
| | - Rana Muhammad Aadil
- National Institute of Food Science and Technology, University of Agriculture, Faisalabad 38000, Pakistan
| |
Collapse
|
3
|
Mota LFM, Giannuzzi D, Pegolo S, Toledo-Alvarado H, Schiavon S, Gallo L, Trevisi E, Arazi A, Katz G, Rosa GJM, Cecchinato A. Combining genetic markers, on-farm information and infrared data for the in-line prediction of blood biomarkers of metabolic disorders in Holstein cattle. J Anim Sci Biotechnol 2024; 15:83. [PMID: 38851729 PMCID: PMC11162571 DOI: 10.1186/s40104-024-01042-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 04/28/2024] [Indexed: 06/10/2024] Open
Abstract
BACKGROUND Various blood metabolites are known to be useful indicators of health status in dairy cattle, but their routine assessment is time-consuming, expensive, and stressful for the cows at the herd level. Thus, we evaluated the effectiveness of combining in-line near infrared (NIR) milk spectra with on-farm (days in milk [DIM] and parity) and genetic markers for predicting blood metabolites in Holstein cattle. Data were obtained from 388 Holstein cows from a farm with an AfiLab system. NIR spectra, on-farm information, and single nucleotide polymorphisms (SNP) markers were blended to develop calibration equations for blood metabolites using the elastic net (ENet) approach, considering 3 models: (1) Model 1 (M1) including only NIR information, (2) Model 2 (M2) with both NIR and on-farm information, and (3) Model 3 (M3) combining NIR, on-farm and genomic information. Dimension reduction was considered for M3 by preselecting SNP markers from genome-wide association study (GWAS) results. RESULTS Results indicate that M2 improved the predictive ability by an average of 19% for energy-related metabolites (glucose, cholesterol, NEFA, BHB, urea, and creatinine), 20% for liver function/hepatic damage, 7% for inflammation/innate immunity, 24% for oxidative stress metabolites, and 23% for minerals compared to M1. Meanwhile, M3 further enhanced the predictive ability by 34% for energy-related metabolites, 32% for liver function/hepatic damage, 22% for inflammation/innate immunity, 42.1% for oxidative stress metabolites, and 41% for minerals, compared to M1. We found improved predictive ability of M3 using selected SNP markers from GWAS results using a threshold of > 2.0 by 5% for energy-related metabolites, 9% for liver function/hepatic damage, 8% for inflammation/innate immunity, 22% for oxidative stress metabolites, and 9% for minerals. Slight reductions were observed for phosphorus (2%), ferric-reducing antioxidant power (1%), and glucose (3%). Furthermore, it was found that prediction accuracies are influenced by using more restrictive thresholds (-log10(P-value) > 2.5 and 3.0), with a lower increase in the predictive ability. CONCLUSION Our results highlighted the potential of combining several sources of information, such as genetic markers, on-farm information, and in-line NIR infrared data improves the predictive ability of blood metabolites in dairy cattle, representing an effective strategy for large-scale in-line health monitoring in commercial herds.
Collapse
Affiliation(s)
- Lucio F M Mota
- Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padova, Legnaro, Padova, 35020, Italy
| | - Diana Giannuzzi
- Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padova, Legnaro, Padova, 35020, Italy.
| | - Sara Pegolo
- Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padova, Legnaro, Padova, 35020, Italy
| | - Hugo Toledo-Alvarado
- Department of Genetics and Biostatistics, School of Veterinary Medicine and Zootechnics, National Autonomous University of Mexico, Ciudad Universitaria, Mexico City, 04510, Mexico
| | - Stefano Schiavon
- Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padova, Legnaro, Padova, 35020, Italy
| | - Luigi Gallo
- Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padova, Legnaro, Padova, 35020, Italy
| | - Erminio Trevisi
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food, and Environmental Sciences, Università Cattolica del Sacro Cuore, Piacenza, 29122, Italy
| | | | - Gil Katz
- Afimilk LTD, Afikim, 15148, Israel
| | - Guilherme J M Rosa
- Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, 53706, USA
| | - Alessio Cecchinato
- Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padova, Legnaro, Padova, 35020, Italy
| |
Collapse
|
4
|
Zhang X, Chen S, Zhang P, Wang C, Wang Q, Zhou X. Staging of Liver Fibrosis Based on Energy Valley Optimization Multiple Stacking (EVO-MS) Model. Bioengineering (Basel) 2024; 11:485. [PMID: 38790352 PMCID: PMC11117710 DOI: 10.3390/bioengineering11050485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 05/09/2024] [Accepted: 05/10/2024] [Indexed: 05/26/2024] Open
Abstract
Currently, staging the degree of liver fibrosis predominantly relies on liver biopsy, a method fraught with potential risks, such as bleeding and infection. With the rapid development of medical imaging devices, quantification of liver fibrosis through image processing technology has become feasible. Stacking technology is one of the effective ensemble techniques for potential usage, but precise tuning to find the optimal configuration manually is challenging. Therefore, this paper proposes a novel EVO-MS model-a multiple stacking ensemble learning model optimized by the energy valley optimization (EVO) algorithm to select most informatic features for fibrosis quantification. Liver contours are profiled from 415 biopsied proven CT cases, from which 10 shape features are calculated and inputted into a Support Vector Machine (SVM) classifier to generate the accurate predictions, then the EVO algorithm is applied to find the optimal parameter combination to fuse six base models: K-Nearest Neighbors (KNNs), Decision Tree (DT), Naive Bayes (NB), Extreme Gradient Boosting (XGB), Gradient Boosting Decision Tree (GBDT), and Random Forest (RF), to create a well-performing ensemble model. Experimental results indicate that selecting 3-5 feature parameters yields satisfactory results in classification, with features such as the contour roundness non-uniformity (Rmax), maximum peak height of contour (Rp), and maximum valley depth of contour (Rm) significantly influencing classification accuracy. The improved EVO algorithm, combined with a multiple stacking model, achieves an accuracy of 0.864, a precision of 0.813, a sensitivity of 0.912, a specificity of 0.824, and an F1-score of 0.860, which demonstrates the effectiveness of our EVO-MS model in staging the degree of liver fibrosis.
Collapse
Affiliation(s)
- Xuejun Zhang
- School of Computer, Electronics and Information, Guangxi University, Nanning 530004, China; (X.Z.); (P.Z.); (C.W.)
- Guangxi Key Laboratory of Multimedia Communications and Network Technology, Guangxi University, Nanning 530004, China
| | - Shengxiang Chen
- School of Computer, Electronics and Information, Guangxi University, Nanning 530004, China; (X.Z.); (P.Z.); (C.W.)
| | - Pengfei Zhang
- School of Computer, Electronics and Information, Guangxi University, Nanning 530004, China; (X.Z.); (P.Z.); (C.W.)
| | - Chun Wang
- School of Computer, Electronics and Information, Guangxi University, Nanning 530004, China; (X.Z.); (P.Z.); (C.W.)
| | - Qibo Wang
- School of Computer, Electronics and Information, Guangxi University, Nanning 530004, China; (X.Z.); (P.Z.); (C.W.)
| | - Xiangrong Zhou
- Department of Electrical, Electronic and Computer Engineering, Gifu University, Gifu 501-1193, Japan;
| |
Collapse
|
5
|
Mota LFM, Giannuzzi D, Pegolo S, Sturaro E, Gianola D, Negrini R, Trevisi E, Ajmone Marsan P, Cecchinato A. Genomic prediction of blood biomarkers of metabolic disorders in Holstein cattle using parametric and nonparametric models. Genet Sel Evol 2024; 56:31. [PMID: 38684971 PMCID: PMC11057143 DOI: 10.1186/s12711-024-00903-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 04/12/2024] [Indexed: 05/02/2024] Open
Abstract
BACKGROUND Metabolic disturbances adversely impact productive and reproductive performance of dairy cattle due to changes in endocrine status and immune function, which increase the risk of disease. This may occur in the post-partum phase, but also throughout lactation, with sub-clinical symptoms. Recently, increased attention has been directed towards improved health and resilience in dairy cattle, and genomic selection (GS) could be a helpful tool for selecting animals that are more resilient to metabolic disturbances throughout lactation. Hence, we evaluated the genomic prediction of serum biomarkers levels for metabolic distress in 1353 Holsteins genotyped with the 100K single nucleotide polymorphism (SNP) chip assay. The GS was evaluated using parametric models best linear unbiased prediction (GBLUP), Bayesian B (BayesB), elastic net (ENET), and nonparametric models, gradient boosting machine (GBM) and stacking ensemble (Stack), which combines ENET and GBM approaches. RESULTS The results show that the Stack approach outperformed other methods with a relative difference (RD), calculated as an increment in prediction accuracy, of approximately 18.0% compared to GBLUP, 12.6% compared to BayesB, 8.7% compared to ENET, and 4.4% compared to GBM. The highest RD in prediction accuracy between other models with respect to GBLUP was observed for haptoglobin (hapto) from 17.7% for BayesB to 41.2% for Stack; for Zn from 9.8% (BayesB) to 29.3% (Stack); for ceruloplasmin (CuCp) from 9.3% (BayesB) to 27.9% (Stack); for ferric reducing antioxidant power (FRAP) from 8.0% (BayesB) to 40.0% (Stack); and for total protein (PROTt) from 5.7% (BayesB) to 22.9% (Stack). Using a subset of top SNPs (1.5k) selected from the GBM approach improved the accuracy for GBLUP from 1.8 to 76.5%. However, for the other models reductions in prediction accuracy of 4.8% for ENET (average of 10 traits), 5.9% for GBM (average of 21 traits), and 6.6% for Stack (average of 16 traits) were observed. CONCLUSIONS Our results indicate that the Stack approach was more accurate in predicting metabolic disturbances than GBLUP, BayesB, ENET, and GBM and seemed to be competitive for predicting complex phenotypes with various degrees of mode of inheritance, i.e. additive and non-additive effects. Selecting markers based on GBM improved accuracy of GBLUP.
Collapse
Affiliation(s)
- Lucio F M Mota
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy.
| | - Diana Giannuzzi
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy
| | - Sara Pegolo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy.
| | - Enrico Sturaro
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy
| | - Daniel Gianola
- Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, 53706, USA
| | - Riccardo Negrini
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food, and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Erminio Trevisi
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food, and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
- Nutrigenomics and Proteomics Research Center, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Paolo Ajmone Marsan
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food, and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
- Nutrigenomics and Proteomics Research Center, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Alessio Cecchinato
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy
| |
Collapse
|
6
|
Giannuzzi D, Mota LFM, Pegolo S, Tagliapietra F, Schiavon S, Gallo L, Marsan PA, Trevisi E, Cecchinato A. Prediction of detailed blood metabolic profile using milk infrared spectra and machine learning methods in dairy cattle. J Dairy Sci 2023; 106:3321-3344. [PMID: 37028959 DOI: 10.3168/jds.2022-22454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 12/14/2022] [Indexed: 04/09/2023]
Abstract
The adoption of preventive management decisions is crucial to dealing with metabolic impairments in dairy cattle. Various serum metabolites are known to be useful indicators of the health status of cows. In this study, we used milk Fourier-transform mid-infrared (FTIR) spectra and various machine learning (ML) algorithms to develop prediction equations for a panel of 29 blood metabolites, including those related to energy metabolism, liver function/hepatic damage, oxidative stress, inflammation/innate immunity, and minerals. For most traits, the data set comprised observations from 1,204 Holstein-Friesian dairy cows belonging to 5 herds. An exception was represented by β-hydroxybutyrate prediction, which contained observations from 2,701 multibreed cows pertaining to 33 herds. The best predictive model was developed using an automatic ML algorithm that tested various methods, including elastic net, distributed random forest, gradient boosting machine, artificial neural network, and stacking ensemble. These ML predictions were compared with partial least squares regression, the most commonly used method for FTIR prediction of blood traits. Performance of each model was evaluated using 2 cross-validation (CV) scenarios: 5-fold random (CVr) and herd-out (CVh). We also tested the best model's ability to classify values precisely in the 2 extreme tails, namely, the 25th (Q25) and 75th (Q75) percentiles (true-positive prediction scenario). Compared with partial least squares regression, ML algorithms achieved more accurate performance. Specifically, elastic net increased the R2 value from 5% to 75% for CVr and 2% to 139% for CVh, whereas the stacking ensemble increased the R2 value from 4% to 70% for CVr and 4% to 150% for CVh. Considering the best model, with the CVr scenario, good prediction accuracies were obtained for glucose (R2 = 0.81), urea (R2 = 0.73), albumin (R2 = 0.75), total reactive oxygen metabolites (R2 = 0.79), total thiol groups (R2 = 0.76), ceruloplasmin (R2 = 0.74), total proteins (R2 = 0.81), globulins (R2 = 0.87), and Na (R2 = 0.72). Good prediction accuracy in classifying extreme values was achieved for glucose (Q25 = 70.8%, Q75 = 69.9%), albumin (Q25 = 72.3%), total reactive oxygen metabolites (Q25 = 75.1%, Q75 = 74%), thiol groups (Q75 = 70.4%), total proteins (Q25 = 72.4%, Q75 = 77.2.%), globulins (Q25 = 74.8%, Q75 = 81.5%), and haptoglobin (Q75 = 74.4%). In conclusion, our study shows that FTIR spectra can be used to predict blood metabolites with relatively good accuracy, depending on trait, and are a promising tool for large-scale monitoring.
Collapse
Affiliation(s)
- Diana Giannuzzi
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy.
| | - Lucio Flavio Macedo Mota
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy
| | - Sara Pegolo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy
| | - Franco Tagliapietra
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy
| | - Stefano Schiavon
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy
| | - Luigi Gallo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy
| | - Paolo Ajmone Marsan
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food and Environmental Sciences, Catholic University of the Sacred Heart, 29122, Piacenza, Italy; Nutrigenomics and Proteomics Research Center, Catholic University of the Sacred Heart, 29122, Piacenza, Italy
| | - Erminio Trevisi
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food and Environmental Sciences, Catholic University of the Sacred Heart, 29122, Piacenza, Italy
| | - Alessio Cecchinato
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro (PD), Italy
| |
Collapse
|
7
|
Mota LFM, Giannuzzi D, Pegolo S, Trevisi E, Ajmone-Marsan P, Cecchinato A. Integrating on-farm and genomic information improves the predictive ability of milk infrared prediction of blood indicators of metabolic disorders in dairy cows. Genet Sel Evol 2023; 55:23. [PMID: 37013482 PMCID: PMC10069109 DOI: 10.1186/s12711-023-00795-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 03/21/2023] [Indexed: 04/05/2023] Open
Abstract
BACKGROUND Blood metabolic profiles can be used to assess metabolic disorders and to evaluate the health status of dairy cows. Given that these analyses are time-consuming, expensive, and stressful for the cows, there has been increased interest in Fourier transform infrared (FTIR) spectroscopy of milk samples as a rapid, cost-effective alternative for predicting metabolic disturbances. The integration of FTIR data with other layers of information such as genomic and on-farm data (days in milk (DIM) and parity) has been proposed to further enhance the predictive ability of statistical methods. Here, we developed a phenotype prediction approach for a panel of blood metabolites based on a combination of milk FTIR data, on-farm data, and genomic information recorded on 1150 Holstein cows, using BayesB and gradient boosting machine (GBM) models, with tenfold, batch-out and herd-out cross-validation (CV) scenarios. RESULTS The predictive ability of these approaches was measured by the coefficient of determination (R2). The results show that, compared to the model that includes only FTIR data, integration of both on-farm (DIM and parity) and genomic information with FTIR data improves the R2 for blood metabolites across the three CV scenarios, especially with the herd-out CV: R2 values ranged from 5.9 to 17.8% for BayesB, from 8.2 to 16.9% for GBM with the tenfold random CV, from 3.8 to 13.5% for BayesB and from 8.6 to 17.5% for GBM with the batch-out CV, and from 8.4 to 23.0% for BayesB and from 8.1 to 23.8% for GBM with the herd-out CV. Overall, with the model that includes the three sources of data, GBM was more accurate than BayesB with accuracies across the CV scenarios increasing by 7.1% for energy-related metabolites, 10.7% for liver function/hepatic damage, 9.6% for oxidative stress, 6.1% for inflammation/innate immunity, and 11.4% for mineral indicators. CONCLUSIONS Our results show that, compared to using only milk FTIR data, a model integrating milk FTIR spectra with on-farm and genomic information improves the prediction of blood metabolic traits in Holstein cattle and that GBM is more accurate in predicting blood metabolites than BayesB, especially for the batch-out CV and herd-out CV scenarios.
Collapse
Affiliation(s)
- Lucio F M Mota
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy.
| | - Diana Giannuzzi
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy
| | - Sara Pegolo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy
| | - Erminio Trevisi
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food, and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
- Nutrigenomics and Proteomics Research Center, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Paolo Ajmone-Marsan
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food, and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
- Nutrigenomics and Proteomics Research Center, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Alessio Cecchinato
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020, Legnaro, PD, Italy
| |
Collapse
|
8
|
Cheng J, Sun J, Yao K, Xu M, Wang S, Fu L. Hyperspectral technique combined with stacking and blending ensemble learning method for detection of cadmium content in oilseed rape leaves. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE 2023; 103:2690-2699. [PMID: 36479694 DOI: 10.1002/jsfa.12376] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 10/21/2022] [Accepted: 12/01/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND Oilseed rape, as one of the most important oil crops, is an important source of vegetable oil and protein for mankind. As a non-essential element for plant growth, heavy metal cadmium (Cd) is easily absorbed by plants. Cd will inhibit the photosynthesis of plants, destroy the cell structure, slow the growth of plants, and affect their development and yield. It is necessary to develop a method based on visible near-infrared (NIR) hyperspectral imaging (HSI) technology to quickly and nondestructively determine the Cd content in rape leaves. RESULTS Two-layer estimation models were established by combining visible-NIR HSI with ensemble learning methods (stacking and blending). One layer used support vector regression, extreme learning machine, decision tree, and random forest (RF) as basic learners, and the other layer used support vector regression or RF as a meta learner. Different models were used to analyze the spectra of rape treated with five Cd concentrations to obtain the best prediction method. The results showed that the best model to predict Cd content was the stacking ensemble model with RF as the meta learner, with coefficient of determination for prediction of 0.9815 and root-mean-square error for prediction of 5.8969 mg kg-1 . A pseudo-color image was developed using this stacking model to visualize the content and distribution of Cd. CONCLUSION The combination of visible-NIR HSI technology and the stacking ensemble learning method is a feasible method to detect the Cd content in rape leaves, which has the potential of being rapid and nondestructive. © 2022 Society of Chemical Industry.
Collapse
Affiliation(s)
- Jiehong Cheng
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, China
| | - Jun Sun
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, China
| | - Kunshan Yao
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, China
| | - Min Xu
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, China
| | - Simin Wang
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, China
| | - Lvhui Fu
- School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, China
| |
Collapse
|
9
|
Establishment and Validation of Fourier Transform Infrared Spectroscopy (FT–MIR) Methodology for the Detection of Linoleic Acid in Buffalo Milk. Foods 2023; 12:foods12061199. [PMID: 36981127 PMCID: PMC10048274 DOI: 10.3390/foods12061199] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 02/28/2023] [Accepted: 03/10/2023] [Indexed: 03/14/2023] Open
Abstract
Buffalo milk is a dairy product that is considered to have a higher nutritional value compared to cow’s milk. Linoleic acid (LA) is an essential fatty acid that is important for human health. This study aimed to investigate and validate the use of Fourier transform mid-infrared spectroscopy (FT-MIR) for the quantification of the linoleic acid in buffalo milk. Three machine learning models were used to predict linoleic acid content, and random forest was employed to select the most important subset of spectra for improved model performance. The validity of the FT-MIR methods was evaluated in accordance with ICH Q2 (R1) guidelines using the accuracy profile method, and the precision, the accuracy, and the limit of quantification were determined. The results showed that Fourier transform infrared spectroscopy is a suitable technique for the analysis of linoleic acid, with a lower limit of quantification of 0.15 mg/mL milk. Our results showed that FT-MIR spectroscopy is a viable method for LA concentration analysis.
Collapse
|
10
|
Macedo Mota LF, Bisutti V, Vanzin A, Pegolo S, Toscano A, Schiavon S, Tagliapietra F, Gallo L, Ajmone Marsan P, Cecchinato A. Predicting milk protein fractions using infrared spectroscopy and a gradient boosting machine for breeding purposes in Holstein cattle. J Dairy Sci 2023; 106:1853-1873. [PMID: 36710177 DOI: 10.3168/jds.2022-22119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 10/10/2022] [Indexed: 01/29/2023]
Abstract
In recent years, increasing attention has been focused on the genetic evaluation of protein fractions in cow milk with the aim of improving milk quality and technological characteristics. In this context, advances in high-throughput phenotyping by Fourier transform infrared (FTIR) spectroscopy offer the opportunity for large-scale, efficient measurement of novel traits that can be exploited in breeding programs as indicator traits. We took milk samples from 2,558 Holstein cows belonging to 38 herds in northern Italy, operating under different production systems. Fourier transform infrared spectra were collected on the same day as milk sampling and stored for subsequent analysis. Two sets of data (i.e., phenotypes and FTIR spectra) collected in 2 different years (2013 and 2019-2020) were compiled. The following traits were assessed using HPLC: true protein, major casein fractions [αS1-casein (CN), αS2-CN, β-CN, κ-CN, and glycosylated-κ-CN], and major whey proteins (β-lactoglobulin and α-lactalbumin), all of which were measured both in grams per liter (g/L) and proportion of total nitrogen (% N). The FTIR predictions were calculated using the gradient boosting machine technique and tested by 3 different cross-validation (CRV) methods. We used the following CRV scenarios: (1) random 10-fold, which randomly split the whole into 10-folds of equal size (9-folds for training and 1-fold for validation); (2) herd/date-out CRV, which assigned 80% of herd/date as the training set with independence of 20% of herd/date assigned as the validation set; (3) forward/backward CRV, which split the data set in training and validation set according with the year of milk sampling (FTIR and gold standard data assessed in 2013 or 2019-2020) using the "old" and "new" databases for training and validation, and vice-versa with independence among them; (4) the CRV for genetic parameters (CRV-gen), where animals without pedigree as assigned as a fixed training population and animals with pedigree information was split in 5-folds, in which 1-fold was assigned to the fixed training population, and 4-folds were assigned to the validation set (independent from the training set). The results (i.e., measures and predictions) of CRV-gen were used to infer the genetic parameters for gold standard laboratory measurements (i.e., proteins assessed with HPLC) and FTIR-based predictions considering the CRV-gen scenario from a bi-trait animal model using single-step genomic BLUP. We found that the prediction accuracies of the gradient boosting machine equations differed according to the way in which the proteins were expressed, achieving higher accuracy when expressed in g/L than when expressed as % N in all CRV scenarios. Concerning the reproducibility of the equations over the different years, the results showed no relevant differences in predictive ability between using "old" data as the training set and "new" data as the validation set and vice-versa. Comparing the additive genetic variance estimates for milk protein fractions between the FTIR predicted and HPLC measures, we found reductions of -19.7% for milk protein fractions expressed in g/L, and -21.19% expressed as % N. Although we found reductions in the heritability estimates, they were small, with values ranging from -1.9 to -7.25% for g/L, and -1.6 to -7.9% for % N. The posterior distributions of the additive genetic correlations (ra) between the FTIR predictions and the laboratory measurements were generally high (>0.8), even when the milk protein fractions were expressed as % N. Our results show the potential of using FTIR predictions in breeding programs as indicator traits for the selection of animals to enhance milk protein fraction contents. We expect acceptable responses to selection due to the high genetic correlations between HPLC measurements and FTIR predictions.
Collapse
Affiliation(s)
- L F Macedo Mota
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - V Bisutti
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - A Vanzin
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - S Pegolo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy.
| | - A Toscano
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - S Schiavon
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - F Tagliapietra
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - L Gallo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| | - P Ajmone Marsan
- Department of Animal Science, Food and Nutrition (DIANA) and Research Center Romeo and Enrica Invernizzi for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122 Piacenza, Italy
| | - A Cecchinato
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy
| |
Collapse
|
11
|
Comparison of machine learning models for bluetongue risk prediction: a seroprevalence study on small ruminants. BMC Vet Res 2022; 18:394. [DOI: 10.1186/s12917-022-03486-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Accepted: 10/25/2022] [Indexed: 11/11/2022] Open
Abstract
Abstract
Background
Bluetongue (BT) is a disease of concern to animal breeders, so the question on their minds is whether they can predict the risk of the disease before it occurs. The main objective of this study is to enhance the accuracy of BT risk prediction by relying on machine learning (ML) approaches to help in fulfilling this inquiry. Several risk factors of BT that affect the occurrence and magnitude of animal infection with the virus have been reported globally. Additionally, risk factors, such as sex, age, species, and season, unevenly affect animal health and welfare. Therefore, the seroprevalence study data of 233 apparently healthy animals (125 sheep and 108 goats) from five different provinces in Egypt were used to analyze and compare the performance of the algorithms in predicting BT risk.
Results
Logistic regression (LR), decision tree (DT), random forest (RF), and a feedforward artificial neural network (ANN) were used to develop predictive BT risk models and compare their performance to the base model (LR). Model performance was assessed by the area under the receiver operating characteristics curve (AUC), accuracy, true positive rate (TPR), false positive rate (FPR), false negative rate (FNR), precision, and F1 score. The results indicated that RF performed better than other models, with an AUC score of 81%, ANN of 79.6%, and DT of 72.85%. In terms of performance and prediction, LR showed a much lower value (AUC = 69%). Upon further observation of the results, it was discovered that age and season were the most important predictor variables reported in classification and prediction.
Conclusion
The findings of this study can be utilized to predict and control BT risk factors in sheep and goats, with better diagnostic discrimination in terms of accuracy, TPR, FNR, FPR, and precision of ML models over traditional and commonly used LR models. Our findings advocate that the implementation of ML algorithms, mainly RF, in farm decision making and prediction is a promising technique for analyzing cross-section studies, providing adequate predictive power and significant competence in identifying and ranking predictors representing potential risk factors for BT.
Collapse
|
12
|
Giannuzzi D, Mota LFM, Pegolo S, Gallo L, Schiavon S, Tagliapietra F, Katz G, Fainboym D, Minuti A, Trevisi E, Cecchinato A. In-line near-infrared analysis of milk coupled with machine learning methods for the daily prediction of blood metabolic profile in dairy cattle. Sci Rep 2022; 12:8058. [PMID: 35577915 PMCID: PMC9110744 DOI: 10.1038/s41598-022-11799-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 04/12/2022] [Indexed: 12/29/2022] Open
Abstract
Precision livestock farming technologies are used to monitor animal health and welfare parameters continuously and in real time in order to optimize nutrition and productivity and to detect health issues at an early stage. The possibility of predicting blood metabolites from milk samples obtained during routine milking by means of infrared spectroscopy has become increasingly attractive. We developed, for the first time, prediction equations for a set of blood metabolites using diverse machine learning methods and milk near-infrared spectra collected by the AfiLab instrument. Our dataset was obtained from 385 Holstein Friesian dairy cows. Stacking ensemble and multi-layer feedforward artificial neural network outperformed the other machine learning methods tested, with a reduction in the root mean square error of between 3 and 6% in most blood parameters. We obtained moderate correlations (r) between the observed and predicted phenotypes for γ-glutamyl transferase (r = 0.58), alkaline phosphatase (0.54), haptoglobin (0.66), globulins (0.61), total reactive oxygen metabolites (0.60) and thiol groups (0.57). The AfiLab instrument has strong potential but may not yet be ready to predict the metabolic stress of dairy cows in practice. Further research is needed to find out methods that allow an improvement in accuracy of prediction equations.
Collapse
Affiliation(s)
- Diana Giannuzzi
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy.
| | - Lucio Flavio Macedo Mota
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
| | - Sara Pegolo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
| | - Luigi Gallo
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
| | - Stefano Schiavon
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
| | - Franco Tagliapietra
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
| | - Gil Katz
- Afimilk Ltd., 1514800, Kibbutz Afikim, Israel
| | | | - Andrea Minuti
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Erminio Trevisi
- Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Alessio Cecchinato
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
| |
Collapse
|