1
|
Oh S, Byeon H, Wijaya J. Machine learning surveillance of foodborne infectious diseases using wastewater microbiome, crowdsourced, and environmental data. WATER RESEARCH 2024; 265:122282. [PMID: 39178596 DOI: 10.1016/j.watres.2024.122282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 08/14/2024] [Accepted: 08/15/2024] [Indexed: 08/26/2024]
Abstract
Clostridium perfringens (CP) is a common cause of foodborne infection, leading to significant human health risks and a high economic burden. Thus, effective CP disease surveillance is essential for preventive and therapeutic interventions; however, conventional practices often entail complex, resource-intensive, and costly procedures. This study introduced a data-driven machine learning (ML) modeling framework for CP-related disease surveillance. It leveraged an integrated dataset of municipal wastewater microbiome (e.g., CP abundance), crowdsourced (CP-related web search keywords), and environmental data. Various optimization strategies, including data integration, data normalization, model selection, and hyperparameter tuning, were implemented to improve the ML modeling performance, leading to enhanced predictions of CP cases over time. Explainable artificial intelligence methods identified CP abundance as the most reliable predictor of CP disease cases. Multi-omics subsequently revealed the presence of CP and its genotypes/toxinotypes in wastewater, validating the utility of microbiome-data-enabled ML surveillance for foodborne diseases. This ML-based framework thus exhibits significant potential for complementing and reinforcing existing disease surveillance systems.
Collapse
Affiliation(s)
- Seungdae Oh
- Department of Civil Engineering, College of Engineering, Kyung Hee University, Yongin, Republic of Korea.
| | - Haeil Byeon
- Department of Civil Engineering, College of Engineering, Kyung Hee University, Yongin, Republic of Korea
| | - Jonathan Wijaya
- Department of Civil Engineering, College of Engineering, Kyung Hee University, Yongin, Republic of Korea
| |
Collapse
|
2
|
Rowe SM, Zhang E, Godden SM, Vasquez AK, Nydam DV. Comparison of a machine learning model with a conventional rule-based selective dry cow therapy algorithm for detection of intramammary infections. J Dairy Sci 2024:S0022-0302(24)01180-9. [PMID: 39343221 DOI: 10.3168/jds.2024-25418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 08/26/2024] [Indexed: 10/01/2024]
Abstract
We trained machine learning models to identify intramammary infections (IMI) in late lactation cows at dry-off to guide antibiotic treatment, and compared their performance to a rule-based algorithm that is currently used on dairy farms in the US. We conducted an observational test-characteristics study using a data set of 3,645 cows approaching dry-off from 68 US dairy herds. The outcome variables of interest were cow-level IMI caused by all pathogens, major pathogens, and Streptococcus and Strep-like organisms (SSLO), which were determined using aerobic culture of aseptic quarter-milk samples and identification of isolates using MALDI-TOF. Individual cow records were extracted from the farm software to create 53 feature variables at the cow and 39 at the herd-level which were derived from cow-level descriptive data, records of clinical mastitis events, results from routine testing of milk for volume and concentrations of somatic cell count (SCC), fat, and protein. ML algorithms evaluated were logistic regression, decision tree, random forest, light gradient-boosting machine, naïve bayes, and neural networks. For comparison, cows were also classified according to a conventional rule-based algorithm that considered a cow as high risk for IMI if she had at one or more high SCC (>200,000 cells/ml) tests or ≥2 cases of clinical mastitis during the lactation of enrollment. Area under the curve (AUC) and Youden's index were used to compare models, in addition to binary classification metrics, including sensitivity, specificity, and predictive values. ML models had slightly higher AUC and Youden's index values than the rule-based algorithm for all IMI outcomes of interest. However, these improvements in prediction accuracy were substantially less than what we had considered necessary for the technology to be a worthwhile alternative to the rule-based algorithm. Therefore, evidence is lacking to support the wholesale use of ML-guided selective dry cow therapy at the moment. We recommend that producers wanting to implement algorithm-guided SDCT use a rule-based method.
Collapse
Affiliation(s)
- S M Rowe
- Sydney School of Veterinary Science, The University of Sydney, Camden, New South Wales 2570, Australia.
| | - E Zhang
- Sydney Informatics Hub, The University of Sydney, Camperdown, New South Wales, Australia
| | - S M Godden
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul 55108
| | | | - D V Nydam
- Department of Public and Ecosystem Health, College of Veterinary Medicine, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
3
|
Lu L, Luan Y, Wang H, Gao Y, Wu S, Zhao X. Flavonoid as a Potent Antioxidant: Quantitative Structure-Activity Relationship Analysis, Mechanism Study, and Molecular Design by Synergizing Molecular Simulation and Machine Learning. J Phys Chem A 2024; 128:6216-6228. [PMID: 39023240 DOI: 10.1021/acs.jpca.4c03241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
In this work, a quantitative structure-antioxidant activity relationship of flavonoids was performed using a machine learning (ML) method. To achieve lipid-soluble, highly antioxidant flavonoids, 398 molecular structures with various substitute groups were designed based on the flavonoid skeleton. The hydrogen dissociation energies (ΔG1, ΔG2, and ΔG3) related to multiple hydrogen atom transfer processes and the solubility parameter (δ) of flavonoids were calculated using molecular simulation. The group decomposition results and the calculated antioxidant parameters constituted the ML data set. The artificial neural network and random forest models were constructed to predict and analyze the contribution of the substitute groups and positions to the antioxidant activity. The results showed the hydroxyl group at positions B4', B5', and B6' and the branched alkyl group at position C3 in the flavonoid skeleton were the optimal choice for improving antioxidant activity and compatibility with apolar organic materials. Compared to the pyrogallol group-grafted flavonoid, the designed potent flavonoid decreased ΔG1 and δ by 2.2 and 15.1%, respectively, while ΔG2 and ΔG3 kept the favorable lower values. These findings suggest that an efficient flavonoid prefers multiple ortho-phenolic hydroxyl groups and suitable sites with hydrophobic groups. The combination of molecular simulation and the ML method may offer a new research approach for the molecular design of novel antioxidants.
Collapse
Affiliation(s)
- Ling Lu
- State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, P. R. China
- Research Institute of Petroleum Processing, SINOPEC, Beijing 100083, P. R. China
| | - Yajie Luan
- State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, P. R. China
| | - Huaqi Wang
- College of Materials Science and Engineering, Beijing University of Chemical Technology, Beijing 100029, P. R. China
| | - Yangyang Gao
- State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, P. R. China
| | - Sizhu Wu
- State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, P. R. China
| | - Xiuying Zhao
- College of Materials Science and Engineering, Beijing University of Chemical Technology, Beijing 100029, P. R. China
| |
Collapse
|
4
|
Olofsson C, Toftaker I, Rachah A, Reksen O, Kielland C. Pathogen-specific patterns of milking traits in automatic milking systems. J Dairy Sci 2024; 107:6035-6051. [PMID: 38554822 DOI: 10.3168/jds.2023-23933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 02/23/2024] [Indexed: 04/02/2024]
Abstract
Early detection of IMI can improve animal health and welfare in dairy herds. The implementation of sensors and automatic milking systems (AMS) in dairy production inherently increases the amount of available data and hence also the potential for new approaches to mastitis management. To use the full potential of data from AMS and auxiliary sensors, a better understanding of physiological and pathological changes in milking traits associated with different udder pathogens may be imperative. This observational study aimed to investigate pathogen-specific patterns in milking traits recorded in AMS. The milking traits included; online SCC (OCC), electrical conductivity (EC), milk yield (MY), and average milk flow rate (AMF). Data were collected for a study period of 2 years and included 101,492 milkings from 237 lactations in 169 cows from one farm. Measurements of OCC were recorded at cow level and data on EC, MY, and AMF were obtained at quarter level. In addition to the data obtained from the AMS, altogether 5,756 quarter milk samples were collected. Milk samples were obtained monthly for bacteriological culturing. We included findings of 13 known mastitis pathogens to study pathogen-specific patterns in milking traits. These patterns were compared with those in a baseline group consisting of cows that did not have any positive milk culture results throughout the lactation period. Patterns of the milking traits are described for all positive samples both across 305 DIM, and in the 15-d period before a positive bacteriological sample. The association between a positive sample and the milking traits [ln(OCC); EC-IQR, the ratio between the quarter with the highest and the quarter with the lowest level of EC; and MY] for the 15 d before the detection of a pathogen was assessed using mixed effects linear regression models. All pathogens were associated with alterations in the level and variability of ln(OCC) relative to lactations with no positive bacteriological samples. A positive sample for Staphylococcus aureus was associated with increased values for MY during the 15 d before a positive diagnosis. It is biologically plausible to interpret changes in OCC and EC-IQR as consequences of an IMI, while higher MY in bacteriologically-positive cows is most likely linked to the increased risk of infection in high-yielding cows. In this study, the most notable changes in the traits (OCC and EC-IQR) were observed for Staph. aureus and Streptococcus dysgalactiae, followed by Streptococcus simulans, Streptococcus uberis, and Lactococcus lactis. Even if we did not detect significant associations between positive bacteriology and EC-IQR, visual assessment and descriptive statistics indicated that there might be differences suggesting that it could be an informative trait for detecting infection when combined with OCC and possibly other relevant traits using machine learning algorithms.
Collapse
Affiliation(s)
- Charlott Olofsson
- Department of Production Animal Clinical Sciences, Faculty of Veterinary Medicine, Norwegian University of Life Sciences, 1433 Ås, Norway.
| | - Ingrid Toftaker
- Department of Production Animal Clinical Sciences, Faculty of Veterinary Medicine, Norwegian University of Life Sciences, 1433 Ås, Norway
| | - Amira Rachah
- Department of Sustainable Energy Technology, SINTEF Industry, Trondheim, 7031, Norway
| | - Olav Reksen
- Department of Production Animal Clinical Sciences, Faculty of Veterinary Medicine, Norwegian University of Life Sciences, 1433 Ås, Norway
| | - Camilla Kielland
- Department of Production Animal Clinical Sciences, Faculty of Veterinary Medicine, Norwegian University of Life Sciences, 1433 Ås, Norway
| |
Collapse
|
5
|
Mitsunaga TM, Nery Garcia BL, Pereira LBR, Costa YCB, da Silva RF, Delbem ACB, dos Santos MV. Current Trends in Artificial Intelligence and Bovine Mastitis Research: A Bibliometric Review Approach. Animals (Basel) 2024; 14:2023. [PMID: 39061485 PMCID: PMC11273831 DOI: 10.3390/ani14142023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 07/08/2024] [Accepted: 07/08/2024] [Indexed: 07/28/2024] Open
Abstract
Mastitis, an important disease in dairy cows, causes significant losses in herd profitability. Accurate diagnosis is crucial for adequate control. Studies using artificial intelligence (AI) models to classify, identify, predict, and diagnose mastitis show promise in improving mastitis control. This bibliometric review aimed to evaluate AI and bovine mastitis terms in the most relevant Scopus-indexed papers from 2011 to 2021. Sixty-two documents were analyzed, revealing key terms, prominent researchers, relevant publications, main themes, and keyword clusters. "Mastitis" and "machine learning" were the most cited terms, with an increasing trend from 2018 to 2021. Other terms, such as "sensors" and "mastitis detection", also emerged. The United States was the most cited country and presented the largest collaboration network. Publications on mastitis and AI models notably increased from 2016 to 2021, indicating growing interest. However, few studies utilized AI for bovine mastitis detection, primarily employing artificial neural network models. This suggests a clear potential for further research in this area.
Collapse
Affiliation(s)
- Thatiane Mendes Mitsunaga
- Luiz de Queiroz College of Agriculture—ESALQ, University of São Paulo, Av. Pádua Dias, 11, Piracicaba 13418-900, SP, Brazil;
| | - Breno Luis Nery Garcia
- School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga 13635-900, SP, Brazil; (B.L.N.G.); (L.B.R.P.)
| | - Ligia Beatriz Rizzanti Pereira
- School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga 13635-900, SP, Brazil; (B.L.N.G.); (L.B.R.P.)
| | | | - Roberto Fray da Silva
- Biosystems Engineering Department, Luiz de Queiroz College of Agriculture—ESALQ, University of São Paulo, Av. Pádua Dias, 11, Piracicaba 13418-900, SP, Brazil;
- Center for Artificial Intelligence—C4AI, University of Sao Paulo, Av. Prof. Lúcio Martins Rodrigues, 370-Butantã, São Paulo 05508-020, SP, Brazil;
| | - Alexandre Cláudio Botazzo Delbem
- Center for Artificial Intelligence—C4AI, University of Sao Paulo, Av. Prof. Lúcio Martins Rodrigues, 370-Butantã, São Paulo 05508-020, SP, Brazil;
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13560-970, SP, Brazil
| | - Marcos Veiga dos Santos
- School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga 13635-900, SP, Brazil; (B.L.N.G.); (L.B.R.P.)
| |
Collapse
|
6
|
Satoła A, Satoła K. Performance comparison of machine learning models used for predicting subclinical mastitis in dairy cows: Bagging, boosting, stacking, and super-learner ensembles versus single machine learning models. J Dairy Sci 2024; 107:3959-3972. [PMID: 38310958 DOI: 10.3168/jds.2023-24243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 12/23/2023] [Indexed: 02/06/2024]
Abstract
Mastitis has a substantial impact on the dairy industry across the world, causing dairy producers to suffer losses due to the reduced quality and quantity of produced milk. A further problem, related to this issue, is the excessive use of antibiotics that leads to the development of resistance in different bacterial strains. The growing consumer awareness oriented toward food safety and rational use of antibiotics has promoted the search for new methods of early identification of cows that may be at risk of developing the disease. Subclinical mastitis does not cause any visible changes to the udder or milk, and therefore it is more difficult to detect than clinical mastitis. The collection of large amounts of data related to milk performance of cows allows using machine learning (ML) methods to build models that could be used for classifying cows into healthy and at risk of subclinical mastitis. The data used for the purpose of this study included information from routine milk recording procedures. The dataset consisted of 19,856 records of 2,227 Polish Holstein-Friesian cows from 3 herds. The authors decided to use the approach of building ensemble ML models, in particular bagging, boosting, stacking, and super-learner models, and comparing them for accuracy of identification of disease-affected cows against single ML models based on the support vector machines, logistic regression, Gaussian Naive Bayes, k-nearest neighbors, and decision tree algorithms. The models were trained and evaluated based on the information recorded for herd 1 and using an 80:20 train-test split ratio according to animal ID (to avoid data leakage). The information recorded for herds 2 and 3 was only used to evaluate on unseen data models developed using the herd 1 dataset. Among the single ML models, the support vector machines model was found to be the most accurate in predicting subclinical mastitis at subsequent test day when used both for the training set (mean F1-score of 0.760) and the testing sets containing data for herds 1, 2, and 3 (F1-score of 0.778, 0.790, and 0.741 respectively). The gradient boosting model was found to be the best performing model among the ensemble ML models (F1-score of 0.762, 0.779, 0.791, and 0.723 for the training set and the testing sets, respectively). The super-learner model, featuring the most advanced design and logistic regression in the meta layer, achieved the highest mean F1-score of 0.775 during the cross validation; however, it was characterized by a slightly worse prediction accuracy of the testing sets (mean F1-score of 0.768, 0.790, and 0.693 for herds 1, 2 and 3 respectively). The study findings confirm the promising role of ensemble ML methods, which were found to be slightly superior with respect to most of the single ML models.
Collapse
Affiliation(s)
- A Satoła
- Department of Genetics, Animal Breeding and Ethology, Faculty of Animal Science, University of Agriculture in Krakow, 30-059 Krakow, Poland.
| | - K Satoła
- Independent researcher, 31-416 Krakow, Poland
| |
Collapse
|
7
|
Zhang X, Li Y, Zhang Y, Yao Z, Zou W, Nie P, Yang L. A New Method to Detect Buffalo Mastitis Using Udder Ultrasonography Based on Deep Learning Network. Animals (Basel) 2024; 14:707. [PMID: 38473092 DOI: 10.3390/ani14050707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Revised: 01/28/2024] [Accepted: 02/05/2024] [Indexed: 03/14/2024] Open
Abstract
Mastitis is one of the most predominant diseases with a negative impact on ranch products worldwide. It reduces milk production, damages milk quality, increases treatment costs, and even leads to the premature elimination of animals. In addition, failure to take effective measures in time will lead to widespread disease. The key to reducing the losses caused by mastitis lies in the early detection of the disease. The application of deep learning with powerful feature extraction capability in the medical field is receiving increasing attention. The main purpose of this study was to establish a deep learning network for buffalo quarter-level mastitis detection based on 3054 ultrasound images of udders from 271 buffaloes. Two data sets were generated with thresholds of somatic cell count (SCC) set as 2 × 105 cells/mL and 4 × 105 cells/mL, respectively. The udders with SCCs less than the threshold value were defined as healthy udders, and otherwise as mastitis-stricken udders. A total of 3054 udder ultrasound images were randomly divided into a training set (70%), a validation set (15%), and a test set (15%). We used the EfficientNet_b3 model with powerful learning capabilities in combination with the convolutional block attention module (CBAM) to train the mastitis detection model. To solve the problem of sample category imbalance, the PolyLoss module was used as the loss function. The training set and validation set were used to develop the mastitis detection model, and the test set was used to evaluate the network's performance. The results showed that, when the SCC threshold was 2 × 105 cells/mL, our established network exhibited an accuracy of 70.02%, a specificity of 77.93%, a sensitivity of 63.11%, and an area under the receiver operating characteristics curve (AUC) of 0.77 on the test set. The classification effect of the model was better when the SCC threshold was 4 × 105 cells/mL than when the SCC threshold was 2 × 105 cells/mL. Therefore, when SCC ≥ 4 × 105 cells/mL was defined as mastitis, our established deep neural network was determined as the most suitable model for farm on-site mastitis detection, and this network model exhibited an accuracy of 75.93%, a specificity of 80.23%, a sensitivity of 70.35%, and AUC 0.83 on the test set. This study established a 1/4 level mastitis detection model which provides a theoretical basis for mastitis detection in buffaloes mostly raised by small farmers lacking mastitis diagnostic conditions in developing countries.
Collapse
Affiliation(s)
- Xinxin Zhang
- National Center for International Research on Animal Genetics, Breeding and Reproduction (NCIRAGBR), Ministry of Science and Technology of the People's Republic of China, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan Li
- National Center for International Research on Animal Genetics, Breeding and Reproduction (NCIRAGBR), Ministry of Science and Technology of the People's Republic of China, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Yiping Zhang
- National Center for International Research on Animal Genetics, Breeding and Reproduction (NCIRAGBR), Ministry of Science and Technology of the People's Republic of China, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Zhiqiu Yao
- National Center for International Research on Animal Genetics, Breeding and Reproduction (NCIRAGBR), Ministry of Science and Technology of the People's Republic of China, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Wenna Zou
- National Center for International Research on Animal Genetics, Breeding and Reproduction (NCIRAGBR), Ministry of Science and Technology of the People's Republic of China, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Pei Nie
- College of Veterinary Medicine, Hunan Agricultural University, Changsha 410128, China
| | - Liguo Yang
- National Center for International Research on Animal Genetics, Breeding and Reproduction (NCIRAGBR), Ministry of Science and Technology of the People's Republic of China, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
8
|
Tian H, Zhou X, Wang H, Xu C, Zhao Z, Xu W, Deng Z. The Prediction of Clinical Mastitis in Dairy Cows Based on Milk Yield, Rumination Time, and Milk Electrical Conductivity Using Machine Learning Algorithms. Animals (Basel) 2024; 14:427. [PMID: 38338070 PMCID: PMC10854744 DOI: 10.3390/ani14030427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Revised: 01/23/2024] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open
Abstract
In commercial dairy farms, mastitis is associated with increased antimicrobial use and associated resistance, which may affect milk production. This study aimed to develop sensor-based prediction models for naturally occurring clinical bovine mastitis using nine machine learning algorithms with data from 447 mastitic and 2146 healthy cows obtained from five commercial farms in Northeast China. The variables were related to daily activity, rumination time, and daily milk yield of cows, as well as milk electrical conductivity. Both Z-standardized and non-standardized datasets pertaining to four specific stages of lactation were used to train and test prediction models. For all four subgroups, the Z-standardized dataset yielded better results than those of the non-standardized one, with the multilayer artificial neural net algorithm showing the best performance. Variables of importance had a similar rank in this algorithm, indicating the consistency of these variables as predictors for bovine mastitis in commercial farms with similar automatic systems. Moreover, the peak milk yield (PMY) of mastitic cows was significantly higher than that of healthy cows (p < 0.005), indicating that high-yielding cattle are more prone to mastitis. Our results show that machine learning algorithms are effective tools for predicting mastitis in dairy cows for immediate intervention and management in commercial farms.
Collapse
Affiliation(s)
- Hong Tian
- College of Science, Heilongjiang Bayi Agricultural University, No. 5 Xinyang Road, Daqing 163319, China;
| | - Xiaojing Zhou
- College of Science, Heilongjiang Bayi Agricultural University, No. 5 Xinyang Road, Daqing 163319, China;
- Heilongjiang Provincial Key Laboratory of Prevention and Control of Bovine Diseases, College of Animal Science and Veterinary Medicine, Heilongjiang Bayi Agricultural University, No. 5 Xinyang Road, Daqing 163319, China;
| | - Hao Wang
- Animal Husbandry and Veterinary Branch, Heilongjiang Academy of Agricultural Science, Qiqihar 161005, China;
| | - Chuang Xu
- College of Veterinary Medicine, China Agricultural University, No. 17 Tsinghua East Road, Haidian District, Beijing 100107, China;
| | - Zixuan Zhao
- Heilongjiang Provincial Key Laboratory of Prevention and Control of Bovine Diseases, College of Animal Science and Veterinary Medicine, Heilongjiang Bayi Agricultural University, No. 5 Xinyang Road, Daqing 163319, China;
| | - Wei Xu
- Department of Biosystems, Division of Animal and Human Health Engineering, KU Leuven, Oude Markt 13, 3000 Leuven, Belgium;
| | - Zhaoju Deng
- College of Veterinary Medicine, China Agricultural University, No. 17 Tsinghua East Road, Haidian District, Beijing 100107, China;
| |
Collapse
|
9
|
Rezaei Z, Tahmasebi A, Pourabbas B. Using meta-analysis and machine learning to investigate the transcriptional response of immune cells to Leishmania infection. PLoS Negl Trop Dis 2024; 18:e0011892. [PMID: 38190401 PMCID: PMC10798641 DOI: 10.1371/journal.pntd.0011892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 01/19/2024] [Accepted: 12/29/2023] [Indexed: 01/10/2024] Open
Abstract
BACKGROUND Leishmaniasis is a parasitic disease caused by the Leishmania protozoan affecting millions of people worldwide, especially in tropical and subtropical regions. The immune response involves the activation of various cells to eliminate the infection. Understanding the complex interplay between Leishmania and the host immune system is crucial for developing effective treatments against this disease. METHODS This study collected extensive transcriptomic data from macrophages, dendritic, and NK cells exposed to Leishmania spp. Our objective was to determine the Leishmania-responsive genes in immune system cells by applying meta-analysis and feature selection algorithms, followed by co-expression analysis. RESULTS As a result of meta-analysis, we discovered 703 differentially expressed genes (DEGs), primarily associated with the immune system and cellular metabolic processes. In addition, we have substantiated the significance of transcription factor families, such as bZIP and C2H2 ZF, in response to Leishmania infection. Furthermore, the feature selection techniques revealed the potential of two genes, namely G0S2 and CXCL8, as biomarkers and therapeutic targets for Leishmania infection. Lastly, our co-expression analysis has unveiled seven hub genes, including PFKFB3, DIAPH1, BSG, BIRC3, GOT2, EIF3H, and ATF3, chiefly related to signaling pathways. CONCLUSIONS These findings provide valuable insights into the molecular mechanisms underlying the response of immune system cells to Leishmania infection and offer novel potential targets for the therapeutic goals.
Collapse
Affiliation(s)
- Zahra Rezaei
- Professor Alborzi Clinical Microbiology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Ahmad Tahmasebi
- Professor Alborzi Clinical Microbiology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
- Shiraz Institute for Cancer Research, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Bahman Pourabbas
- Professor Alborzi Clinical Microbiology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| |
Collapse
|
10
|
Thompson JS, Green MJ, Hyde R, Bradley AJ, O’Grady L. The use of machine learning to predict somatic cell count status in dairy cows post-calving. Front Vet Sci 2023; 10:1297750. [PMID: 38144465 PMCID: PMC10748400 DOI: 10.3389/fvets.2023.1297750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 11/23/2023] [Indexed: 12/26/2023] Open
Abstract
Udder health remains a priority for the global dairy industry to reduce pain, economic losses, and antibiotic usage. The dry period is a critical time for the prevention of new intra-mammary infections and it provides a point for curing existing intra-mammary infections. Given the wealth of udder health data commonly generated through routine milk recording and the importance of udder health to the productivity and longevity of individual cows, an opportunity exists to extract greater value from cow-level data to undertake risk-based decision-making. The aim of this research was to construct a machine learning model, using routinely collected farm data, to make probabilistic predictions at drying off for an individual cow's risk of a raised somatic cell count (hence intra-mammary infection) post-calving. Anonymized data were obtained as a large convenience sample from 108 UK dairy herds that undertook regular milk recording. The outcome measure evaluated was the presence of a raised somatic cell count in the 30 days post-calving in this observational study. Using a 56-farm training dataset, machine learning analysis was performed using the extreme gradient boosting decision tree algorithm, XGBoost. External validation was undertaken on a separate 28-farm test dataset. Statistical assessment to evaluate model performance using the external dataset returned calibration plots, a Scaled Brier Score of 0.095, and a Mean Absolute Calibration Error of 0.009. Test dataset model calibration performance indicated that the probability of a raised somatic cell count post-calving was well differentiated across probabilities to allow an end user to apply group-level risk decisions. Herd-level new intra-mammary infection rate during the dry period was a key driver of the probability that a cow had a raised SCC post-calving, highlighting the importance of optimizing environmental hygiene conditions. In conclusion, this research has determined that probabilistic classification of the risk of a raised SCC in the 30 days post-calving is achievable with a high degree of certainty, using routinely collected data. These predicted probabilities provide the opportunity for farmers to undertake risk decision-making by grouping cows based on their probabilities and optimizing management strategies for individual cows immediately after calving, according to their likelihood of intra-mammary infection.
Collapse
Affiliation(s)
- Jake S. Thompson
- School of Veterinary Medicine and Science, University of Nottingham, Nottingham, United Kingdom
| | - Martin J. Green
- School of Veterinary Medicine and Science, University of Nottingham, Nottingham, United Kingdom
| | - Robert Hyde
- School of Veterinary Medicine and Science, University of Nottingham, Nottingham, United Kingdom
| | - Andrew J. Bradley
- School of Veterinary Medicine and Science, University of Nottingham, Nottingham, United Kingdom
- Quality Milk Management Services Ltd., Easton Hill, United Kingdom
| | - Luke O’Grady
- School of Veterinary Medicine and Science, University of Nottingham, Nottingham, United Kingdom
| |
Collapse
|
11
|
Luo W, Dong Q, Feng Y. Risk prediction model of clinical mastitis in lactating dairy cows based on machine learning algorithms. Prev Vet Med 2023; 221:106059. [PMID: 37951013 DOI: 10.1016/j.prevetmed.2023.106059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 09/28/2023] [Accepted: 10/23/2023] [Indexed: 11/13/2023]
Abstract
Mastitis is the most common disease among dairy cows and is known to have negative effects on both animal welfare and the profitability of dairy farms. Early detection of clinical mastitis cases is considered the best option for preventing cows from developing mastitis. In this study, we developed clinical mastitis prediction models that only required inputting common indicators from the automatic milking system. We utilized multidimensional data from the cow mastitis database of Afimilk (China) Agricultural Technology Co., Ltd. to predict mastitis in dairy cows. All data were screened for the period of 0-150 days of lactation. The data included parity, lactation day, period, mean and standard deviation of milk yield, of electrical conductivity, and of lying time, which were taken as input features. The classification of whether cows suffer from clinical mastitis was determined as output. We analyzed 426 cows with clinical mastitis and 2087 healthy cows by using four machine learning algorithms: Decision Tree, Random Forest, Back Propagation neural networks, and Support Vector Machines. In these four algorithms, the accuracy ranged from 94% to 98%, while the running times varied widely from seconds to minutes. The decision tree prediction model achieved an accuracy of 98% and the precision rate for healthy cows was 99%, while for mastitis cows it was 97%. Machine learning algorithms have played an important role in predicting cow mastitis, with the Decision Tree algorithm showing great performance and higher accuracy in our research.
Collapse
Affiliation(s)
- Wenkuo Luo
- College of Information Engineering, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Qiang Dong
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi, 712100, China.
| | - Yan Feng
- College of Information Engineering, Northwest A&F University, Yangling, Shaanxi, 712100, China.
| |
Collapse
|
12
|
Pakrashi A, Wallace D, Mac Namee B, Greene D, Guéret C. CowMesh: a data-mesh architecture to unify dairy industry data for prediction and monitoring. Front Artif Intell 2023; 6:1209507. [PMID: 37868080 PMCID: PMC10586498 DOI: 10.3389/frai.2023.1209507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 09/08/2023] [Indexed: 10/24/2023] Open
Abstract
Dairy is an economically significant industry that caters to the huge demand for food products in people's lives. To remain profitable, farmers need to manage their farms and the health of the dairy cows in their herds. There are, however, many risks to cow health that can lead to significant challenges to dairy farm management and have the potential to lead to significant losses. Such risks include cow udder infections (i.e., mastitis) and cow lameness. As automation and data recording become more common in the agricultural sector, dairy farms are generating increasing amounts of data. Recently, these data are being used to generate insights into farm and cow health, where the objective is to help farmers manage the health and welfare of dairy cows and reduce losses from cow health issues. Despite the level of data generation on dairy farms, this information is often difficult to access due to a lack of a single, central organization to collect data from individual farms. The prospect of such an organization, however, raises questions about data ownership, with some farmers reluctant to share their farm data for privacy reasons. In this study, we describe a new data mesh architecture designed for the dairy industry that focuses on facilitating access to data from farms in a decentralized fashion. This has the benefit of keeping the ownership of data with dairy farmers while bringing data together by providing a common and uniform set of protocols. Furthermore, this architecture will allow secure access to the data by research groups and product development groups, who can plug in new projects and applications built across the data. No similar framework currently exists in the dairy industry, and such a data mesh can help industry stakeholders by bringing the dairy farms of a country together in a decentralized fashion. This not only helps farmers, dairy researchers, and product builders but also facilitates an overview of all dairy farms which can help governments to decide on regulations to improve the dairy industry at a national level.
Collapse
Affiliation(s)
- Arjun Pakrashi
- School of Computer Science, University College Dublin, Dublin, Ireland
- Insight Centre for Data Analytics, Dublin, Ireland
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Ireland
| | - Duncan Wallace
- School of Computer Science, University College Dublin, Dublin, Ireland
- Insight Centre for Data Analytics, Dublin, Ireland
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Ireland
| | - Brian Mac Namee
- School of Computer Science, University College Dublin, Dublin, Ireland
- Insight Centre for Data Analytics, Dublin, Ireland
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Ireland
| | - Derek Greene
- School of Computer Science, University College Dublin, Dublin, Ireland
- Insight Centre for Data Analytics, Dublin, Ireland
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Ireland
| | | |
Collapse
|
13
|
Ozella L, Brotto Rebuli K, Forte C, Giacobini M. A Literature Review of Modeling Approaches Applied to Data Collected in Automatic Milking Systems. Animals (Basel) 2023; 13:1916. [PMID: 37370426 DOI: 10.3390/ani13121916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 06/02/2023] [Accepted: 06/06/2023] [Indexed: 06/29/2023] Open
Abstract
Automatic milking systems (AMS) have played a pioneering role in the advancement of Precision Livestock Farming, revolutionizing the dairy farming industry on a global scale. This review specifically targets papers that focus on the use of modeling approaches within the context of AMS. We conducted a thorough review of 60 articles that specifically address the topics of cows' health, production, and behavior/management Machine Learning (ML) emerged as the most widely used method, being present in 63% of the studies, followed by statistical analysis (14%), fuzzy algorithms (9%), deterministic models (7%), and detection algorithms (7%). A significant majority of the reviewed studies (82%) primarily focused on the detection of cows' health, with a specific emphasis on mastitis, while only 11% evaluated milk production. Accurate forecasting of dairy cow milk yield and understanding the deviation between expected and observed milk yields of individual cows can offer significant benefits in dairy cow management. Likewise, the study of cows' behavior and herd management in AMSs is under-explored (7%). Despite the growing utilization of machine learning (ML) techniques in the field of dairy cow management, there remains a lack of a robust methodology for their application. Specifically, we found a substantial disparity in adequately balancing the positive and negative classes within health prediction models.
Collapse
Affiliation(s)
- Laura Ozella
- Department of Veterinary Sciences, University of Turin, 10095 Grugliasco, TO, Italy
| | - Karina Brotto Rebuli
- Department of Veterinary Sciences, University of Turin, 10095 Grugliasco, TO, Italy
| | - Claudio Forte
- Department of Veterinary Sciences, University of Turin, 10095 Grugliasco, TO, Italy
| | - Mario Giacobini
- Department of Veterinary Sciences, University of Turin, 10095 Grugliasco, TO, Italy
| |
Collapse
|
14
|
Balasso P, Taccioli C, Serva L, Magrin L, Andrighetto I, Marchesini G. Uncovering Patterns in Dairy Cow Behaviour: A Deep Learning Approach with Tri-Axial Accelerometer Data. Animals (Basel) 2023; 13:1886. [PMID: 37889789 PMCID: PMC10251916 DOI: 10.3390/ani13111886] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 05/30/2023] [Accepted: 06/02/2023] [Indexed: 10/29/2023] Open
Abstract
The accurate detection of behavioural changes represents a promising method of detecting the early onset of disease in dairy cows. This study assessed the performance of deep learning (DL) in classifying dairy cows' behaviour from accelerometry data acquired by single sensors on the cows' left flanks and compared the results with those obtained through classical machine learning (ML) from the same raw data. Twelve cows with a tri-axial accelerometer were observed for 136 ± 29 min each to detect five main behaviours: standing still, moving, feeding, ruminating and resting. For each 8 s time interval, 15 metrics were calculated, obtaining a dataset of 211,720 observation units and 15 columns. The entire dataset was randomly split into training (80%) and testing (20%) datasets. The DL accuracy, precision and sensitivity/recall were calculated and compared with the performance of classical ML models. The best predictive model was an 8-layer convolutional neural network (CNN) with an overall accuracy and F1 score equal to 0.96. The precision, sensitivity/recall and F1 score of single behaviours had the following ranges: 0.93-0.99. The CNN outperformed all the classical ML algorithms. The CNN used to monitor the cows' conditions showed an overall high performance in successfully predicting multiple behaviours using a single accelerometer.
Collapse
Affiliation(s)
| | | | | | | | | | - Giorgio Marchesini
- Dipartimento di Medicina Animale, Produzioni e Salute, Università degli Studi di Padova, 35020 Legnaro, Italy (L.S.)
| |
Collapse
|
15
|
Pakrashi A, Ryan C, Guéret C, Berry DP, Corcoran M, Keane MT, Mac Namee B. Early detection of subclinical mastitis in lactating dairy cows using cow-level features. J Dairy Sci 2023:S0022-0302(23)00297-7. [PMID: 37268591 DOI: 10.3168/jds.2022-22803] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 01/26/2023] [Indexed: 06/04/2023]
Abstract
Subclinical mastitis in cows affects their health, well-being, longevity, and performance, leading to reduced productivity and profit. Early prediction of subclinical mastitis can enable dairy farmers to perform interventions to mitigate its effect. The present study investigated how well predictive models built using machine learning techniques can detect subclinical mastitis up to 7 d before its occurrence. The data set used consisted of 1,346,207 milk-day (i.e., a day when milk was collected on both morning and evening) records spanning 9 yr from 2,389 cows producing on 7 Irish research farms. Individual cow composite milk yield and maximum milk flow were available twice daily, whereas milk composition (i.e., fat, lactose, protein) and somatic cell count (SCC) were collected once per week. Other features describing parity, calving dates, predicted transmitting ability for SCC, body weight, and history of subclinical mastitis were also available. The results of the study showed that a gradient boosting machine model trained to predict the onset of subclinical mastitis 7 d before a subclinical case occurs achieved a sensitivity and specificity of 69.45 and 95.64%, respectively. Reduced data collection frequency, where milk composition and SCC were recorded only every 15, 30, 45, and 60 d was simulated by masking data, to reflect the frequency of recording of this data on commercial dairy farms in Ireland. The sensitivity and specificity scores reduced as recording frequency reduced with respective scores of 66.93 and 80.43% when milk composition and SCC were recorded just every 60 d. Results demonstrate that models built on data that could be recorded routinely available on commercial dairy farms, can achieve useful predictive ability of subclinical mastitis even with reduced frequency of milk composition and SCC recording.
Collapse
Affiliation(s)
- A Pakrashi
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Co. Cork, P61 C996, Ireland; School of Computer Science, University College Dublin, Belfield, D04 V1W8, Ireland; Insight Centre for Data Analytics, University College Dublin, Belfield, Dublin 4, D04 N2E5, Ireland.
| | - C Ryan
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Co. Cork, P61 C996, Ireland; School of Computer Science, University College Dublin, Belfield, D04 V1W8, Ireland; Insight Centre for Data Analytics, University College Dublin, Belfield, Dublin 4, D04 N2E5, Ireland
| | - C Guéret
- Accenture Labs, Grand Canal Dock, Dublin, D02 YN32, Ireland
| | - D P Berry
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Co. Cork, P61 C996, Ireland; Teagasc, Animal & Grassland Research and Innovation Centre, Moorepark, Fermoy P61 P302, Co. Cork, Ireland
| | - M Corcoran
- Accenture Labs, Grand Canal Dock, Dublin, D02 YN32, Ireland
| | - M T Keane
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Co. Cork, P61 C996, Ireland; School of Computer Science, University College Dublin, Belfield, D04 V1W8, Ireland; Insight Centre for Data Analytics, University College Dublin, Belfield, Dublin 4, D04 N2E5, Ireland
| | - B Mac Namee
- VistaMilk SFI Research Centre, Teagasc Moorepark, Fermoy, Co. Cork, P61 C996, Ireland; School of Computer Science, University College Dublin, Belfield, D04 V1W8, Ireland; Insight Centre for Data Analytics, University College Dublin, Belfield, Dublin 4, D04 N2E5, Ireland
| |
Collapse
|
16
|
Gutiérrez-Reinoso MA, Aponte PM, García-Herreros M. Genomic and Phenotypic Udder Evaluation for Dairy Cattle Selection: A Review. Animals (Basel) 2023; 13:ani13101588. [PMID: 37238017 DOI: 10.3390/ani13101588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 05/05/2023] [Accepted: 05/08/2023] [Indexed: 05/28/2023] Open
Abstract
The traditional point of view regarding dairy cattle selection has been challenged by recent genomic studies indicating that livestock productivity prediction can be redefined based on the evaluation of genomic and phenotypic data. Several studies that included different genomic-derived traits only indicated that interactions among them or even with conventional phenotypic evaluation criteria require further elucidation. Unfortunately, certain genomic and phenotypic-derived traits have been shown to be secondary factors influencing dairy production. Thus, these factors, as well as evaluation criteria, need to be defined. Owing to the variety of genomic and phenotypic udder-derived traits which may affect the modern dairy cow functionality and conformation, a definition of currently important traits in the broad sense is indicated. This is essential for cattle productivity and dairy sustainability. The main objective of the present review is to elucidate the possible relationships among genomic and phenotypic udder evaluation characteristics to define the most relevant traits related to selection for function and conformation in dairy cattle. This review aims to examine the potential impact of various udder-related evaluation criteria on dairy cattle productivity and explore how to mitigate the adverse effects of compromised udder conformation and functionality. Specifically, we will consider the implications for udder health, welfare, longevity, and production-derived traits. Subsequently, we will address several concerns covering the application of genomic and phenotypic evaluation criteria with emphasis on udder-related traits in dairy cattle selection as well as its evolution from origins to the present and future prospects.
Collapse
Affiliation(s)
- Miguel A Gutiérrez-Reinoso
- Carrera de Medicina Veterinaria, Facultad de Ciencias Agropecuarias y Recursos Naturales, Universidad Técnica de Cotopaxi (UTC), Latacunga 0501491, Ecuador
- Laboratorio de Biotecnología Animal, Departamento de Ciencia Animal, Facultad de Ciencias Veterinarias, Universidad de Concepción (UdeC), Chillán 3780000, Chile
| | - Pedro M Aponte
- Colegio de Ciencias Biológicas y Ambientales (COCIBA), Universidad San Francisco de Quito USFQ, Quito 170157, Ecuador
- Colegio de Ciencias de la Salud, Escuela de Medicina Veterinaria, Universidad San Francisco de Quito USFQ, Quito 170157, Ecuador
- Campus Cumbayá, Instituto de Investigaciones en Biomedicina "One-Health", Universidad San Francisco de Quito USFQ, Quito 170157, Ecuador
| | - Manuel García-Herreros
- Instituto Nacional de Investigação Agrária e Veterinária (INIAV), 2005-048 Santarém, Portugal
| |
Collapse
|
17
|
Liu Y, Zhuang Y, Yu L, Li Q, Zhao C, Meng R, Zhu J, Guo X. A Machine Learning Framework Based on Extreme Gradient Boosting to Predict the Occurrence and Development of Infectious Diseases in Laying Hen Farms, Taking H9N2 as an Example. Animals (Basel) 2023; 13:1494. [PMID: 37174531 PMCID: PMC10177545 DOI: 10.3390/ani13091494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 04/26/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023] Open
Abstract
The H9N2 avian influenza virus has become one of the dominant subtypes of avian influenza virus in poultry and has been significantly harmful to chickens in China, with great economic losses in terms of reduced egg production or high mortality by co-infection with other pathogens. A prediction of H9N2 status based on easily available production data with high accuracy would be important and essential to prevent and control H9N2 outbreaks in advance. This study developed a machine learning framework based on the XGBoost classification algorithm using 3 months' laying rates and mortalities collected from three H9N2-infected laying hen houses with complete onset cycles. A framework was developed to automatically predict the H9N2 status of individual house for future 3 days (H9N2 status + 0, H9N2 status + 1, H9N2 status + 2) with five time frames (day + 0, day - 1, day - 2, day - 3, day - 4). It had been proven that a high accuracy rate > 90%, a recall rate > 90%, a precision rate of >80%, and an area under the curve of the receiver operator characteristic ≥ 0.85 could be achieved with the prediction models. Models with day + 0 and day - 1 were highly recommended to predict H9N2 status + 0 and H9N2 status + 1 for the direct or auxiliary monitoring of its occurrence and development. Such a framework could provide new insights into predicting H9N2 outbreaks, and other practical potential applications to assist in disease monitor were also considerable.
Collapse
Affiliation(s)
- Yu Liu
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| | - Yanrong Zhuang
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing 100083, China
| | - Ligen Yu
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| | - Qifeng Li
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| | - Chunjiang Zhao
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| | - Rui Meng
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| | - Jun Zhu
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| | - Xiaoli Guo
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
- National Innovation Center of Digital Technology in Animal Husbandry, Beijing 100097, China
| |
Collapse
|
18
|
Bobbo T, Matera R, Pedota G, Manunza A, Cotticelli A, Neglia G, Biffani S. Exploiting machine learning methods with monthly routine milk recording data and climatic information to predict subclinical mastitis in Italian Mediterranean buffaloes. J Dairy Sci 2023; 106:1942-1952. [PMID: 36586801 DOI: 10.3168/jds.2022-22292] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 09/27/2022] [Indexed: 12/31/2022]
Abstract
Mastitis has detrimental effects on the world's dairy industry, reducing animal health, milk production and quality, as well as income for farmers. In addition, consumers' growing interest in food safety and rational usage of antibiotics highlights the need to develop novel strategies to improve mastitis detection, prevention, and management. In the present study we applied machine learning (ML) analyses to predict presence or absence of subclinical mastitis in Italian Mediterranean buffaloes, exploiting information collected the previous month during routine milk recording procedures, as well as climatic data. The data set included 3,891 records of 1,038 buffaloes from 6 herds located in Basilicata Region (South Italy). Prediction models were developed using 4 different ML algorithms (Generalized Linear Model, Support Vector Machines, Random Forest, and Neural Network) and 2 data set splitting approaches for the creation of the training and test sets (by record or by animal ID number, always with 80% of the data used for model training and the remaining 20% for model testing). Support Vector Machine was the best method to predict high or low somatic cell count at the subsequent test-day record in the validation set, and therefore it was used to estimate the contribution of each feature to the best model. Independently from the data set splitting approach, the most important features were somatic cell score, differential somatic cell count, electrical conductivity, and milk production. Among climatic data, the most informative were temperature and relative humidity. When the data were split by animal ID, an improvement in models' predictive performance on the test set was observed, suggesting this as the most appropriate data splitting approach in data sets with repeated measures to avoid data leakage. According to different metrics, Neural Network was the best method for making predictions on the test set. Our findings confirmed the promising role of ML methods to improve prevention and surveillance of subclinical mastitis, exploiting the large amount of data currently available to identify animals that would possibly have high somatic cell count the subsequent month.
Collapse
Affiliation(s)
- T Bobbo
- Consiglio Nazionale delle Ricerche (CNR), Istituto di Biologia e Biotecnologia Agraria (IBBA), 20133 Milan, Italy; Department of Agricultural and Environmental Sciences, University of Milan, 20133 Milan, Italy
| | - R Matera
- Department of Veterinary Medicine and Animal Production, Federico II University, 80137 Naples, Italy
| | - G Pedota
- Associazione Regionale Allevatori della Basilicata, 85100 Potenza, Italy
| | - A Manunza
- Consiglio Nazionale delle Ricerche (CNR), Istituto di Biologia e Biotecnologia Agraria (IBBA), 20133 Milan, Italy
| | - A Cotticelli
- Department of Veterinary Medicine and Animal Production, Federico II University, 80137 Naples, Italy
| | - G Neglia
- Department of Veterinary Medicine and Animal Production, Federico II University, 80137 Naples, Italy.
| | - S Biffani
- Consiglio Nazionale delle Ricerche (CNR), Istituto di Biologia e Biotecnologia Agraria (IBBA), 20133 Milan, Italy
| |
Collapse
|
19
|
Invited Review: Examples and opportunities for artificial intelligence (AI) in dairy farms*. APPLIED ANIMAL SCIENCE 2023. [DOI: 10.15232/aas.2022-02345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
20
|
Thompson J, Everhart Nunn SL, Sarkar S, Clayton B. Diagnostic Screening of Bovine Mastitis Using MALDI-TOF MS Direct-Spotting of Milk and Machine Learning. Vet Sci 2023; 10:vetsci10020101. [PMID: 36851405 PMCID: PMC9962131 DOI: 10.3390/vetsci10020101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/02/2023] [Accepted: 01/26/2023] [Indexed: 02/04/2023] Open
Abstract
Novel strategies for diagnostic screening of animal and herd health are crucial to contain disease outbreaks, maintain animal health, and maximize production efficiency. Mastitis is an inflammation of the mammary gland in dairy cows, often resulting from infection from a microorganism. Mastitis outbreaks result in loss of production, degradation of milk quality, and the need to isolate and treat affected animals. In this work, we evaluate MALDI-TOF mass spectrometry as a diagnostic for the culture-less screening of mastitis state from raw milk samples collected from regional dairies. Since sample preparation requires only minutes per sample using microvolumes of reagents and no cell culture, the technique is promising for rapid sample turnaround and low-cost diagnosis. Machine learning algorithms have been used to detect patterns embedded within MALDI-TOF spectra using a training set of 226 raw milk samples. A separate scoring set of 100 raw milk samples has been used to assess the specificity (spc) and sensitivity (sens) of the approach. Of machine learning models tested, the gradient-boosted tree model gave global optimal results, with the Youden index of J = 0.7, sens = 0.89, and spc = 0.81 achieved for the given set of conditions. Random forest models also performed well, achieving J > 0.63, with sens = 0.83 and spc = 0.81. Naïve Bayes, generalized linear, fast large-margin, and deep learning models failed to produce diagnostic results that were as favorable. We conclude that MALDI-TOF MS combined with machine learning is an alternative diagnostic tool for detection of high somatic cell count (SCC) and subclinical mastitis in dairy herds.
Collapse
Affiliation(s)
- Jonathan Thompson
- School of Veterinary Medicine, Texas Tech University, 7671 Evans Dr., Amarillo, TX 79106, USA
- Correspondence:
| | - Savana L. Everhart Nunn
- School of Veterinary Medicine, Texas Tech University, 7671 Evans Dr., Amarillo, TX 79106, USA
| | - Sumon Sarkar
- School of Veterinary Medicine, Texas Tech University, 7671 Evans Dr., Amarillo, TX 79106, USA
| | - Beth Clayton
- Dairy Herd Improvement Association, 301 23rd St., 117B, Canyon, TX 79015, USA
| |
Collapse
|
21
|
Kaur U, Malacco VMR, Bai H, Price TP, Datta A, Xin L, Sen S, Nawrocki RA, Chiu G, Sundaram S, Min BC, Daniels KM, White RR, Donkin SS, Brito LF, Voyles RM. Invited review: integration of technologies and systems for precision animal agriculture-a case study on precision dairy farming. J Anim Sci 2023; 101:skad206. [PMID: 37335911 PMCID: PMC10370899 DOI: 10.1093/jas/skad206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 06/17/2023] [Indexed: 06/21/2023] Open
Abstract
Precision livestock farming (PLF) offers a strategic solution to enhance the management capacity of large animal groups, while simultaneously improving profitability, efficiency, and minimizing environmental impacts associated with livestock production systems. Additionally, PLF contributes to optimizing the ability to manage and monitor animal welfare while providing solutions to global grand challenges posed by the growing demand for animal products and ensuring global food security. By enabling a return to the "per animal" approach by harnessing technological advancements, PLF enables cost-effective, individualized care for animals through enhanced monitoring and control capabilities within complex farming systems. Meeting the nutritional requirements of a global population exponentially approaching ten billion people will likely require the density of animal proteins for decades to come. The development and application of digital technologies are critical to facilitate the responsible and sustainable intensification of livestock production over the next several decades to maximize the potential benefits of PLF. Real-time continuous monitoring of each animal is expected to enable more precise and accurate tracking and management of health and well-being. Importantly, the digitalization of agriculture is expected to provide collateral benefits of ensuring auditability in value chains while assuaging concerns associated with labor shortages. Despite notable advances in PLF technology adoption, a number of critical concerns currently limit the viability of these state-of-the-art technologies. The potential benefits of PLF for livestock management systems which are enabled by autonomous continuous monitoring and environmental control can be rapidly enhanced through an Internet of Things approach to monitoring and (where appropriate) closed-loop management. In this paper, we analyze the multilayered network of sensors, actuators, communication, networking, and analytics currently used in PLF, focusing on dairy farming as an illustrative example. We explore the current state-of-the-art, identify key shortcomings, and propose potential solutions to bridge the gap between technology and animal agriculture. Additionally, we examine the potential implications of advancements in communication, robotics, and artificial intelligence on the health, security, and welfare of animals.
Collapse
Affiliation(s)
- Upinder Kaur
- School of Engineering Technology, Purdue University, West Lafayette, IN, 47907, USA
| | - Victor M R Malacco
- Department of Animal Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Huiwen Bai
- School of Engineering Technology, Purdue University, West Lafayette, IN, 47907, USA
| | - Tanner P Price
- Department of Animal and Poultry Sciences, Virginia Tech, Blacksburg, VA, 24061, USA
| | - Arunashish Datta
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Lei Xin
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Shreyas Sen
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Robert A Nawrocki
- School of Engineering Technology, Purdue University, West Lafayette, IN, 47907, USA
| | - George Chiu
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
- School of Mechanical Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Shreyas Sundaram
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Byung-Cheol Min
- Department of Computer and Information Technology, West Lafayette, IN, 47907, USA
| | - Kristy M Daniels
- Department of Animal and Poultry Sciences, Virginia Tech, Blacksburg, VA, 24061, USA
| | - Robin R White
- Department of Animal and Poultry Sciences, Virginia Tech, Blacksburg, VA, 24061, USA
| | - Shawn S Donkin
- Department of Animal Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Richard M Voyles
- School of Engineering Technology, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
22
|
Sun G, Zhang Q, Dong Z, Dong D, Fang H, Wang C, Dong Y, Wu J, Tan X, Zhu P, Wan Y. Antibiotic resistant bacteria: A bibliometric review of literature. Front Public Health 2022; 10:1002015. [PMID: 36466520 PMCID: PMC9713414 DOI: 10.3389/fpubh.2022.1002015] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 10/20/2022] [Indexed: 11/18/2022] Open
Abstract
Antibiotic-resistant bacteria (ARB) are a serious threat to the health of people and the ecological environment. With this problem becoming more and more serious, more countries made research on the ARB, and the research number has been sharply increased particularly over the past decade. Therefore, it is quite necessary to globally retrace relevant researches on the ARB published from 2010 to 2020. This will help researchers to understand the current research situation, research trends and research hotspots in this field. This paper uses bibliometrics to examine publications in the field of ARB from 2010 to 2020 that were retrieved from the Web of Science (WOS). Our study performed a statistical analysis of the countries, institutions, journals, authors, research areas, author keywords, Essential Science Indicators (ESI) highly cited papers, and ESI hotspots papers to provide an overview of the ARB field as well as research trends, research hotspots, and future research directions in the field. The results showed that the number of related studies is increasing year by year; the USA is most published in the field of ARB; China is the most active in this field in the recent years; the Chinese Acad Sci published the most articles; Sci. Total Environ. published the greatest number of articles; CM Manaia has the most contributions; Environmental Sciences and Ecology is the most popular research area; and "antibiotic resistance," "antibiotics," and "antibiotic resistance genes" were the most frequently occurring author keywords. A citation analysis showed that aquatic environment-related antibiotic resistance is a key research area in this field, while antimicrobial nanomaterial-related research is a recent popular topic.
Collapse
Affiliation(s)
- Guojun Sun
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Qian Zhang
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Zuojun Dong
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Dashun Dong
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Hui Fang
- Institute of Information Resource, Zhejiang University of Technology, Hangzhou, China
| | - Chaojun Wang
- Hangzhou Aeronautical Sanatorium for Special Service of Chinese Air Force, Hangzhou, China
| | - Yichen Dong
- Department of Chinese Medicine, Macau University of Science and Technology, Taipa, Macau SAR, China
| | - Jiezhou Wu
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Xuanzhe Tan
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Peiyao Zhu
- College of Pharmaceutical Science, Zhejiang University of Technology, Hangzhou, China
| | - Yuehua Wan
- Institute of Information Resource, Zhejiang University of Technology, Hangzhou, China
| |
Collapse
|
23
|
Govic A, Nasser H, Levay EA, Zelko M, Ebrahimie E, Mohammadi Dehcheshmeh M, Kent S, Penman J, Hazi A. Long-Term Calorie Restriction Alters Anxiety-like Behaviour and the Brain and Adrenal Gland Transcriptomes of the Ageing Male Rat. Nutrients 2022; 14:nu14214670. [PMID: 36364936 PMCID: PMC9654051 DOI: 10.3390/nu14214670] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 10/28/2022] [Accepted: 11/01/2022] [Indexed: 11/06/2022] Open
Abstract
Further examination of the molecular regulators of long-term calorie restriction (CR), reported to have an anxiolytic effect, may highlight novel therapeutic targets for anxiety disorders. Here, adult male Hooded Wistar rats were exposed to a 25% CR whilst anxiety-like behaviour was assessed at 6-, 12-, and 18-months of age via the elevated plus maze, open field, and acoustic startle tests. Next-generation sequencing was then used to measure transcriptome-wide gene expression in the hypothalamus, amygdala, pituitary, and adrenal glands. Results showed an anxiolytic behavioural profile across early, middle, and late adulthood by CR, with the strongest effects noted at 6-months. Transcriptomic analysis by seven attribute weighting algorithms, including Info Gain Ratio, Rule, Chi Squared, Gini Index, Uncertainty, Relief, and Info Gain, led to the development of a signature of long-term CR, independent of region. Complement C1q A chain (C1qa), an extracellular protein, expression was significantly decreased by CR in most regions examined. Furthermore, text mining highlighted the positive involvement of C1qa in anxiety, depression, neurodegeneration, stress, and ageing, collectively identifying a suitable biomarker candidate for CR. Overall, the current study identified anxiety-related phenotypic changes and a novel transcriptome signature of long-term CR, indicating potential therapeutic targets for anxiety, depression, and neurodegeneration.
Collapse
Affiliation(s)
- Antonina Govic
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3010, Australia
- Epigenes Australia Pty Ltd., Melbourne, VIC 3010, Australia
- Correspondence: or ; Tel.: +61-3-9780-9996
| | - Helen Nasser
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3010, Australia
- Epigenes Australia Pty Ltd., Melbourne, VIC 3010, Australia
| | - Elizabeth A. Levay
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3010, Australia
- Epigenes Australia Pty Ltd., Melbourne, VIC 3010, Australia
| | - Matt Zelko
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3010, Australia
- Epigenes Australia Pty Ltd., Melbourne, VIC 3010, Australia
| | - Esmaeil Ebrahimie
- Genomics Research Platform, School of Agriculture, Biomedicine and Environment, La Trobe University, Melbourne, VIC 3000, Australia
- School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, SA 5371, Australia
- School of BioSciences, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - Manijeh Mohammadi Dehcheshmeh
- Genomics Research Platform, School of Agriculture, Biomedicine and Environment, La Trobe University, Melbourne, VIC 3000, Australia
- School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, SA 5371, Australia
| | - Stephen Kent
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3010, Australia
| | - Jim Penman
- Epigenes Australia Pty Ltd., Melbourne, VIC 3010, Australia
| | - Agnes Hazi
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3010, Australia
| |
Collapse
|
24
|
Thompson JE. Matrix-assisted laser desorption ionization-time-of-flight mass spectrometry in veterinary medicine: Recent advances (2019-present). Vet World 2022; 15:2623-2657. [PMID: 36590115 PMCID: PMC9798047 DOI: 10.14202/vetworld.2022.2623-2657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 10/11/2022] [Indexed: 11/22/2022] Open
Abstract
Matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF) mass spectrometry (MS) has become a valuable laboratory tool for rapid diagnostics, research, and exploration in veterinary medicine. While instrument acquisition costs are high for the technology, cost per sample is very low, the method requires minimal sample preparation, and analysis is easily conducted by end-users requiring minimal training. Matrix-assisted laser desorption ionization-time-of-flight MS has found widespread application for the rapid identification of microorganisms, diagnosis of dermatophytes and parasites, protein/lipid profiling, molecular diagnostics, and the technique demonstrates significant promise for 2D chemical mapping of tissue sections collected postmortem. In this review, an overview of the MALDI-TOF technique will be reported and manuscripts outlining current uses of the technology for veterinary science since 2019 will be summarized. The article concludes by discussing gaps in knowledge and areas of future growth.
Collapse
Affiliation(s)
- Jonathan E. Thompson
- School of Veterinary Medicine, Texas Tech University, Amarillo, Texas 79106, United States,Corresponding author: Jonathan E. Thompson, e-mail:
| |
Collapse
|
25
|
Bausewein M, Mansfeld R, Doherr MG, Harms J, Sorge US. Sensitivity and Specificity for the Detection of Clinical Mastitis by Automatic Milking Systems in Bavarian Dairy Herds. Animals (Basel) 2022; 12:ani12162131. [PMID: 36009724 PMCID: PMC9405299 DOI: 10.3390/ani12162131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 08/11/2022] [Accepted: 08/14/2022] [Indexed: 11/20/2022] Open
Abstract
In automatic milking systems (AMSs), the detection of clinical mastitis (CM) and the subsequent separation of abnormal milk should be reliably performed by commercial AMSs. Therefore, the objectives of this cross-sectional study were (1) to determine the sensitivity (SN) and specificity (SP) of CM detection of AMS by the four most common manufacturers in Bavarian dairy farms, and (2) to identify routinely collected cow data (AMS and monthly test day data of the regional Dairy Herd Improvement Association (DHIA)) that could improve the SN and SP of clinical mastitis detection. Bavarian dairy farms with AMS from the manufacturers DeLaval, GEA Farm Technologies, Lely, and Lemmer-Fullwood were recruited with the aim of sampling at least 40 cows with clinical mastitis per AMS manufacturer in addition to clinically healthy ones. During a single farm visit, cow-level milking information was first electronically extracted from each AMS and then all lactating cows examined for their udder health status in the barn. Clinical mastitis was defined as at least the presence of visibly abnormal milk. In addition, available DHIA test results from the previous six months were collected. None of the manufacturers provided a definition for clinical mastitis (i.e., visually abnormal milk), therefore, the SN and SP of AMS warning lists for udder health were assessed for each manufacturer individually, based on the clinical evaluation results. Generalized linear mixed models (GLMMs) with herd as random effect were used to determine the potential influence of routinely recorded parameters on SN and SP. A total of 7411 cows on 114 farms were assessed; of these, 7096 cows could be matched to AMS data and were included in the analysis. The prevalence of clinical mastitis was 3.4% (239 cows). When considering the 95% confidence interval (95% CI), all but one manufacturer achieved the minimum SN limit of >80%: DeLaval (SN: 61.4% (95% CI: 49.0%−72.8%)), GEA (75.9% (62.4%−86.5%)), Lely (78.2% (67.4%−86.8%)), and Lemmer-Fullwood (67.6% (50.2%−82.0%)). However, none of the evaluated AMSs achieved the minimum SP limit of 99%: DeLaval (SP: 89.3% (95% CI: 87.7%−90.7%)), GEA (79.2% (77.1%−81.2%)), Lely (86.2% (84.6%−87.7%)), and Lemmer-Fullwood (92.2% (90.8%−93.5%)). All AMS manufacturers’ robots showed an association of SP with cow classification based on somatic cell count (SCC) measurement from the last two DHIA test results: cows that were above the threshold of 100,000 cells/mL for subclinical mastitis on both test days had lower chances of being classified as healthy by the AMS compared to cows that were below the threshold. In conclusion, the detection of clinical mastitis cases was satisfactory across AMS manufacturers. However, the low SP will lead to unnecessarily discarded milk and increased workload to assess potentially false-positive mastitis cases. Based on the results of our study, farmers must evaluate all available data (test day data, AMS data, and daily assessment of their cows in the barn) to make decisions about individual cows and to ultimately ensure animal welfare, food quality, and the economic viability of their farm.
Collapse
Affiliation(s)
- Mathias Bausewein
- Bavarian Animal Health Services, 85586 Poing-Grub, Germany
- Clinic for Ruminants with Ambulatory and Herd Health Services, Centre for Clinical Veterinary Medicine, LMU Munich, 85764 Oberschleissheim, Germany
- Correspondence:
| | - Rolf Mansfeld
- Clinic for Ruminants with Ambulatory and Herd Health Services, Centre for Clinical Veterinary Medicine, LMU Munich, 85764 Oberschleissheim, Germany
| | - Marcus G. Doherr
- Institute for Veterinary Epidemiology and Biostatistics, Freie Universität, 14163 Berlin, Germany
| | - Jan Harms
- Institute for Agricultural Engineering and Animal Husbandry, Bavarian State Research Centre for Agriculture, 85586 Poing-Grub, Germany
| | | |
Collapse
|
26
|
Cantor MC, Casella E, Silvestri S, Renaud DL, Costa JHC. Using Machine Learning and Behavioral Patterns Observed by Automated Feeders and Accelerometers for the Early Indication of Clinical Bovine Respiratory Disease Status in Preweaned Dairy Calves. FRONTIERS IN ANIMAL SCIENCE 2022. [DOI: 10.3389/fanim.2022.852359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The objective of this retrospective cohort study was to evaluate a K-nearest neighbor (KNN) algorithm to classify and indicate bovine respiratory disease (clinical BRD) status using behavioral patterns in preweaned dairy calves. Calves (N=106) were enrolled in this study, which occurred at one facility for the preweaning period. Precision dairy technologies were used to record feeding behavior with an automated feeder and activity behavior with a pedometer (automated features). Daily, calves were manually health-scored for bovine respiratory disease (clinical BRD; Wisconsin scoring system, WI, USA), and weights were taken twice weekly (manual features). All calves were also scored for ultrasonographic lung consolidation twice weekly. A clinical BRD bout (day 0) was defined as 2 scores classified as abnormal on the Wisconsin scoring system and an area of consolidated lung ≥3.0 cm2. There were 54 calves dignosed with a clinical BRD bout. Two scenarios were considered for KNN inference. In the first scenario (diagnosis scenario), the KNN algorithm classified calves as clinical BRD positive or as negative for respiratory infection. For the second scenario (preclinical BRD bout scenario), the 14 days before a clinical BRD bout was evaluated to determine if behavioral changes were indicative of calves destined for disease. Both scenarios investigated the use of automated features or manual features or both. For the diagnosis scenario, manual features had negligible improvements compared to automated features, with an accuracy of 0.95 ± 0.02 and 0.94 ± 0.02, respectively, for classifying calves as negative for respiratory infection. There was an equal accuracy of 0.98 ± 0.01 for classifying calves as sick using automated and manual features. For the preclinical BRD bout scenario, automated features were highly accurate at -6 days prior to diagnosis (0.90 ± 0.02), while manual features had low accuracy at -6 days (0.52 ± 0.03). Automated features were near perfectly accurate at -1 day before clinical BRD diagnosis compared to the high accuracy of manual features (0.86 ± 0.03). This research indicates that machine-learning algorithms accurately predict clinical BRD status at up to -6 days using a myriad of feeding behaviors and activity levels in calves. Precision dairy technologies hold the potential to indicate the BRD status in preweaned calves.
Collapse
|
27
|
Jafari O, Ebrahimi M, Hedayati SAA, Zeinalabedini M, Poorbagher H, Nasrolahpourmoghadam M, Fernandes JMO. Integration of Morphometrics and Machine Learning Enables Accurate Distinction between Wild and Farmed Common Carp. LIFE (BASEL, SWITZERLAND) 2022; 12:life12070957. [PMID: 35888047 PMCID: PMC9315565 DOI: 10.3390/life12070957] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 06/16/2022] [Accepted: 06/20/2022] [Indexed: 11/16/2022]
Abstract
Morphology and feature selection are key approaches to address several issues in fisheries science and stock management, such as the hypothesis of admixture of Caspian common carp (Cyprinus carpio) and farmed carp stocks in Iran. The present study was performed to investigate the population classification of common carp in the southern Caspian basin using data mining algorithms to find the most important characteristic(s) differing between Iranian and farmed common carp. A total of 74 individuals were collected from three locations within the southern Caspian basin and from one farm between November 2015 and April 2016. A dataset of 26 traditional morphometric (TMM) attributes and a dataset of 14 geometric landmark points were constructed and then subjected to various machine learning methods. In general, the machine learning methods had a higher prediction rate with TMM datasets. The highest decision tree accuracy of 77% was obtained by rule and decision tree parallel algorithms, and “head height on eye area” was selected as the best marker to distinguish between wild and farmed common carp. Various machine learning algorithms were evaluated, and we found that the linear discriminant was the best method, with 81.1% accuracy. The results obtained from this novel approach indicate that Darwin’s domestication syndrome is observed in common carp. Moreover, they pave the way for automated detection of farmed fish, which will be most beneficial to detect escapees and improve restocking programs.
Collapse
Affiliation(s)
- Omid Jafari
- International Sturgeon Research Institute, Iranian Fisheries Science Research Institute, Agricultural Research, Education and Extension Organization, Rasht 416353464, Iran
- Correspondence: (O.J.); (J.M.O.F.)
| | - Mansour Ebrahimi
- Department of Biology, School of Basic Science, University of Qom, Qom 3716146611, Iran;
| | - Seyed Ali-Akbar Hedayati
- Department of Fisheries, Faculty of Fisheries and Environmental Sciences, Gorgan University of Agricultural Sciences and Natural Resources, Gorgan 4913815739, Iran;
| | - Mehrshad Zeinalabedini
- Department of Genomics, Agricultural Biotechnology Research Institute of Iran (ABRII), Karaj 3135933151, Iran;
| | - Hadi Poorbagher
- Department of Fisheries Sciences, Faculty of Natural Resources, University of Tehran, Karaj 3158777871, Iran; (H.P.); (M.N.)
| | - Maryam Nasrolahpourmoghadam
- Department of Fisheries Sciences, Faculty of Natural Resources, University of Tehran, Karaj 3158777871, Iran; (H.P.); (M.N.)
| | - Jorge M. O. Fernandes
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
- Correspondence: (O.J.); (J.M.O.F.)
| |
Collapse
|
28
|
Body Weight Prediction from Linear Measurements of Icelandic Foals: A Machine Learning Approach. Animals (Basel) 2022; 12:ani12101234. [PMID: 35625080 PMCID: PMC9137917 DOI: 10.3390/ani12101234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Revised: 04/20/2022] [Accepted: 05/09/2022] [Indexed: 12/02/2022] Open
Abstract
Simple Summary Knowing the body weight of a growing horse is important both for horse breeders and veterinarians because this information helps to identify abnormalities of the growing process, determine adequate feeding rations or choose an appropriate drug treatment regimen. It is not always possible to measure accurately a horse’s body weight using special scales, and a visual assessment, which is the easiest method for finding out a horse’s body weight, produces heavily biased results. Simple formulas are being sought to allow making accurate estimates of body weight in horses based on their body measurements. This study relates the estimation of body weight in Icelandic foals with the use of models relying on machine learning methods. Based on their evaluation, two of the models are recommended for use in practical applications. Abstract Knowledge of the body weight of horses permits breeders to provide appropriate feeding and care regimen and allows veterinarians to monitor the animals’ health. It is not always possible to perform an accurate measurement of the body weight of horses using horse weighbridges, and therefore, new body weight formulas based on biometric measurements are required. The objective of this study is to develop and validate models for estimating body weight in Icelandic foals using machine learning methods. The study was conducted using 312 data records of body measurements on 24 Icelandic foals (12 colts and 12 fillies) from birth to 404 days of age. The best performing model was the polynomial model that included features such as heart girth, body circumference and cannon bone circumference. The mean percentage error for this model was 4.1% based on cross-validation and 3.8% for a holdout dataset. The body weight of Icelandic foals can also be estimated using a less complex model taking a single trait defined as the square of heart girth multiplied by body circumference. The mean percentage error for this model was up to 5% both for the training and the holdout datasets. The study results suggest that machine learning methods can be considered a useful tool for designing models for the estimation of body weight in horses.
Collapse
|
29
|
Evaluation of tree-based ensemble algorithms for predicting the big five personality traits based on social media photos: Evidence from an Iranian sample. PERSONALITY AND INDIVIDUAL DIFFERENCES 2022. [DOI: 10.1016/j.paid.2021.111479] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
30
|
Machine learning models effectively distinguish attention-deficit/hyperactivity disorder using event-related potentials. Cogn Neurodyn 2022; 16:1335-1349. [PMID: 36408064 PMCID: PMC9666608 DOI: 10.1007/s11571-021-09746-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 07/18/2021] [Accepted: 10/29/2021] [Indexed: 11/30/2022] Open
Abstract
Accurate diagnosis of Attention-Deficit/Hyperactivity Disorder (ADHD) is a significant challenge. Misdiagnosis has significant negative medical side effects. Due to the complex nature of this disorder, there is no computational expert system for diagnosis. Recently, automatic diagnosis of ADHD by machine learning analysis of brain signals has received an increased attention. This paper aimed to achieve an accurate model to discriminate between ADHD patients and healthy controls by pattern discovery. Event-Related Potentials (ERP) data were collected from ADHD patients and healthy controls. After pre-processing, ERP signals were decomposed and features were calculated for different frequency bands. The classification was carried out based on each feature using seven machine learning algorithms. Important features were then selected and combined. To find specific patterns for each model, the classification was repeated using the proposed patterns. Results indicated that the combination of complementary features can significantly improve the performance of the predictive models. The newly developed features, defined based on band power, were able to provide the best classification using the Generalized Linear Model, Logistic Regression, and Deep Learning with the average accuracy and Receiver operating characteristic curve > %99.85 and > 0.999, respectively. High and low frequencies (Beta, Delta) performed better than the mid, frequencies in the discrimination of ADHD from control. Altogether, this study developed a machine learning expert system that minimises misdiagnosis of ADHD and is beneficial for the evaluation of treatment efficacy.
Collapse
|
31
|
Bidimensional and Tridimensional Poincaré Maps in Cardiology: A Multiclass Machine Learning Study. ELECTRONICS 2022. [DOI: 10.3390/electronics11030448] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Heart rate is a nonstationary signal and its variation may contain indicators of current disease or warnings about impending cardiac diseases. Hence, heart rate variation analysis has become a noninvasive tool to further study the activities of the autonomic nervous system. In this scenario, the Poincaré plot analysis has proven to be a valuable tool to support cardiac diseases diagnosis. The study’s aim is a preliminary exploration of the feasibility of machine learning to classify subjects belonging to five cardiac states (healthy, hypertension, myocardial infarction, congestive heart failure and heart transplanted) using ten unconventional quantitative parameters extracted from bidimensional and three-dimensional Poincaré maps. Knime Analytic Platform was used to implement several machine learning algorithms: Gradient Boosting, Adaptive Boosting, k-Nearest Neighbor and Naïve Bayes. Accuracy, sensitivity and specificity were computed to assess the performances of the predictive models using the leave-one-out cross-validation. The Synthetic Minority Oversampling technique was previously performed for data augmentation considering the small size of the dataset and the number of features. A feature importance, ranked on the basis of the Information Gain values, was computed. Preliminarily, a univariate statistical analysis was performed through one-way Kruskal Wallis plus post-hoc for all the features. Machine learning analysis achieved interesting results in terms of evaluation metrics, such as demonstrated by Adaptive Boosting and k-Nearest Neighbor (accuracies greater than 90%). Gradient Boosting and k-Nearest Neighbor reached even 100% score in sensitivity and specificity, respectively. The most important features according to information gain are in line with the results obtained from the statistical analysis confirming their predictive power. The study shows the proposed combination of unconventional features extracted from Poincaré maps and well-known machine learning algorithms represents a valuable approach to automatically classify patients with different cardiac diseases. Future investigations on enriched datasets will further confirm the potential application of this methodology in diagnostic.
Collapse
|
32
|
Sharifi S, Lotfi Shahreza M, Pakdel A, Reecy JM, Ghadiri N, Atashi H, Motamedi M, Ebrahimie E. Systems Biology–Derived Genetic Signatures of Mastitis in Dairy Cattle: A New Avenue for Drug Repurposing. Animals (Basel) 2021; 12:ani12010029. [PMID: 35011134 PMCID: PMC8749881 DOI: 10.3390/ani12010029] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 11/21/2021] [Accepted: 12/17/2021] [Indexed: 02/07/2023] Open
Abstract
Simple Summary Therapeutic success of bovine mastitis depends mainly on accurately diagnosing the type of pathogen involved. Despite the development prospects for bovine mastitis diagnosis, including new biomarker discovery to target specific pathogens with high sensitivity and specificity, treatment studies have shown controversial results, and the most efficient, safe, and economical treatments for mastitis are still topics of scientific debate. The goal of this research is the integration of different levels of systems biology data to predict candidate drugs for the control and management of E. coli mastitis. We propose that the novel drugs could be used by pharmaceutical scientists or veterinarians to find commercially efficacious medicines. Abstract Mastitis, a disease with high incidence worldwide, is the most prevalent and costly disease in the dairy industry. Gram-negative bacteria such as Escherichia coli (E. coli) are assumed to be among the leading agents causing acute severe infection with clinical signs. E. Coli, environmental mastitis pathogens, are the primary etiological agents of bovine mastitis in well-managed dairy farms. Response to E. Coli infection has a complex pattern affected by genetic and environmental parameters. On the other hand, the efficacy of antibiotics and/or anti-inflammatory treatment in E. coli mastitis is still a topic of scientific debate, and studies on the treatment of clinical cases show conflicting results. Unraveling the bio-signature of mastitis in dairy cattle can open new avenues for drug repurposing. In the current research, a novel, semi-supervised heterogeneous label propagation algorithm named Heter-LP, which applies both local and global network features for data integration, was used to potentially identify novel therapeutic avenues for the treatment of E. coli mastitis. Online data repositories relevant to known diseases, drugs, and gene targets, along with other specialized biological information for E. coli mastitis, including critical genes with robust bio-signatures, drugs, and related disorders, were used as input data for analysis with the Heter-LP algorithm. Our research identified novel drugs such as Glibenclamide, Ipratropium, Salbutamol, and Carbidopa as possible therapeutics that could be used against E. coli mastitis. Predicted relationships can be used by pharmaceutical scientists or veterinarians to find commercially efficacious medicines or a combination of two or more active compounds to treat this infectious disease.
Collapse
Affiliation(s)
- Somayeh Sharifi
- Department of Animal Sciences, College of Agriculture, Isfahan University of Technology, Isfahan 84156-83111, Iran;
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA;
- Correspondence: (S.S.); (E.E.)
| | - Maryam Lotfi Shahreza
- Department of Computer Engineering, Shahreza Campus, University of Isfahan, Isfahan 86149-56841, Iran;
| | - Abbas Pakdel
- Department of Animal Sciences, College of Agriculture, Isfahan University of Technology, Isfahan 84156-83111, Iran;
| | - James M. Reecy
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA;
| | - Nasser Ghadiri
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan 84156-83111, Iran;
| | - Hadi Atashi
- Department of Animal Science, Shiraz University, Shiraz 71946-84334, Iran;
| | - Mahmood Motamedi
- Department of Animal Sciences, University of Tehran, Tehran 1417935840, Iran;
| | - Esmaeil Ebrahimie
- Genomics Research Platform, School of Life Sciences, College of Science, Health and Engineering, La Trobe University, Melbourne, VIC 3086, Australia
- School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, SA 5371, Australia
- School of BioSciences, The University of Melbourne, Melbourne, VIC 3010, Australia
- Correspondence: (S.S.); (E.E.)
| |
Collapse
|
33
|
Evaluation of the Effectiveness of Herbal Components Based on Their Regulatory Signature on Carcinogenic Cancer Cells. Cells 2021; 10:cells10113139. [PMID: 34831362 PMCID: PMC8621084 DOI: 10.3390/cells10113139] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/06/2021] [Accepted: 11/09/2021] [Indexed: 12/28/2022] Open
Abstract
Predicting cancer cells’ response to a plant-derived agent is critical for the drug discovery process. Recently transcriptomes advancements have provided an opportunity to identify regulatory signatures to predict drug activity. Here in this study, a combination of meta-analysis and machine learning models have been used to determine regulatory signatures focusing on differentially expressed transcription factors (TFs) of herbal components on cancer cells. In order to increase the size of the dataset, six datasets were combined in a meta-analysis from studies that had evaluated the gene expression in cancer cell lines before and after herbal extract treatments. Then, categorical feature analysis based on the machine learning methods was applied to examine transcription factors in order to find the best signature/pattern capable of discriminating between control and treated groups. It was found that this integrative approach could recognize the combination of TFs as predictive biomarkers. It was observed that the random forest (RF) model produced the best combination rules, including AIP/TFE3/VGLL4/ID1 and AIP/ZNF7/DXO with the highest modulating capacity. As the RF algorithm combines the output of many trees to set up an ultimate model, its predictive rules are more accurate and reproducible than other trees. The discovered regulatory signature suggests an effective procedure to figure out the efficacy of investigational herbal compounds on particular cells in the drug discovery process.
Collapse
|
34
|
Lasser J, Matzhold C, Egger-Danner C, Fuerst-Waltl B, Steininger F, Wittek T, Klimek P. Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach. J Anim Sci 2021; 99:6400292. [PMID: 34662372 DOI: 10.1093/jas/skab294] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Accepted: 10/15/2021] [Indexed: 12/25/2022] Open
Abstract
Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1 = 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
Collapse
Affiliation(s)
- Jana Lasser
- Section for Science of Complex Systems, Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria.,Institute for Interactive Systems and Data Science, Graz University of Technology, 8010 Graz, Austria.,Complexity Science Hub Vienna, 1080 Vienna, Austria
| | - Caspar Matzhold
- Section for Science of Complex Systems, Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria.,Complexity Science Hub Vienna, 1080 Vienna, Austria
| | | | - Birgit Fuerst-Waltl
- Division of Livestock Sciences, University of Natural Resources and Life Sciences, 1180 Vienna, Austria
| | | | - Thomas Wittek
- Vetmeduni Vienna, University Clinic for Ruminants, 1210 Vienna, Austria
| | - Peter Klimek
- Section for Science of Complex Systems, Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria.,Complexity Science Hub Vienna, 1080 Vienna, Austria
| |
Collapse
|
35
|
Ebrahimie E, Zamansani F, Alanazi IO, Sabi EM, Khazandi M, Ebrahimi F, Mohammadi-Dehcheshmeh M, Ebrahimi M. Advances in understanding the specificity function of transporters by machine learning. Comput Biol Med 2021; 138:104893. [PMID: 34598069 DOI: 10.1016/j.compbiomed.2021.104893] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 09/20/2021] [Accepted: 09/22/2021] [Indexed: 11/25/2022]
Abstract
Understanding the underlying molecular mechanism of transporter activity is one of the major discussions in structural biology. A transporter can exclusively transport one ion (specific transporter) or multiple ions (general transporter). This study compared categorical and numerical features of general and specific calcium transporters using machine learning and attribute weighting models. To this end, 444 protein features, such as the frequency of dipeptides, organism, and subcellular location, were extracted for general (n = 103) and specific calcium transporters (n = 238). Aliphatic index, subcellular location, organism, Ile-Leu frequency, Glycine frequency, hydrophobic frequency, and specific dipeptides such as Ile-Leu, Phe-Val, and Tyr-Gln were the key features in differentiating general from specific calcium transporters. Calcium transporters in the cell outer membranes were specific, while the inner ones were general; additionally, when the hydrophobic frequency or Aliphatic index is increased, the calcium transporter act as a general transporter. Random Forest with accuracy criterion showed the highest accuracy (88.88% ±5.75%) and high AUC (0.964 ± 0.020), based on 5-fold cross-validation. Decision Tree with accuracy criterion was able to predict the specificity of calcium transporter irrespective of the organism and subcellular location. This study demonstrates the precise classification of transporter function based on sequence-derived physicochemical features.
Collapse
Affiliation(s)
- Esmaeil Ebrahimie
- Genomics Research Platform, School of Life Sciences, College of Science, Health and Engineering, La Trobe University, Melbourne, Victoria, 3086, Australia; School of Animal and Veterinary Sciences, The University of Adelaide, South Australia, 5371, Australia.
| | - Fatemeh Zamansani
- Department of Crop Production and Plant Breeding, College of Agriculture, Shiraz University, Shiraz, Iran.
| | - Ibrahim O Alanazi
- National Center for Biotechnology, Life Science and Environment Research Institute, King Abdulaziz City for Science and Technology (KACST), Riyadh, 6086, Saudi Arabia.
| | - Essa M Sabi
- Department of Pathology, Clinical Biochemistry Unit, College of Medicine, King Saud University, Riyadh, 11461, Saudi Arabia.
| | - Manouchehr Khazandi
- UniSA Clinical and Health Sciences, The University of South Australia, Adelaide, 5000, Australia.
| | - Faezeh Ebrahimi
- Faculty of Life Sciences and Biotechnology, Department of Microbiology and Microbial Biotechnology, Shahid Beheshti University, Tehran, Iran.
| | | | - Mansour Ebrahimi
- School of Animal and Veterinary Sciences, The University of Adelaide, South Australia, 5371, Australia; Department of Biology, School of Basic Sciences, University of Qom, Qom, Iran.
| |
Collapse
|
36
|
MasPA: A Machine Learning Application to Predict Risk of Mastitis in Cattle from AMS Sensor Data. AGRIENGINEERING 2021. [DOI: 10.3390/agriengineering3030037] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Mastitis is a common disease that prevails in cattle owing mainly to environmental pathogens; they are also the most expensive disease for cattle in dairy farms. Several prevention and treatment methods are available, although most of these options are quite expensive, especially for small farms. In this study, we utilized a dataset of 6600 cattle along with several of their sensory parameters (collected via inexpensive sensors) and their prevalence to mastitis. Supervised machine learning approaches were deployed to determine the most effective parameters that could be utilized to predict the risk of mastitis in cattle. To achieve this goal, 26 classification models were built, among which the best performing model (the highest accuracy in the shortest time) was selected. Hyper parameter tuning and K-fold cross validation were applied to further boost the top model’s performance, while at the same time avoiding bias and overfitting of the model. The model was then utilized to build a GUI application that could be used online as a web application. The application can predict the risk of mastitis in cattle from the inhale and exhale limits of their udder and their temperature with an accuracy of 98.1% and sensitivity and specificity of 99.4% and 98.8%, respectively. The full potential of this application can be utilized via the standalone version, which can be easily integrated into an automatic milking system to detect the risk of mastitis in real time.
Collapse
|
37
|
Fadul-Pacheco L, Delgado H, Cabrera VE. Exploring machine learning algorithms for early prediction of clinical mastitis. Int Dairy J 2021. [DOI: 10.1016/j.idairyj.2021.105051] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
38
|
Predicting Arsenic (As) Exposure on Human Health for Better Management of Drinking Water Sources. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18157997. [PMID: 34360286 PMCID: PMC8345792 DOI: 10.3390/ijerph18157997] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 07/22/2021] [Accepted: 07/24/2021] [Indexed: 11/17/2022]
Abstract
Chemical pollution in the transboundary Langat River in Malaysia is common both from point and non-point sources. Therefore, the water treatment plants (WTPS) at the Langat River Basin have experienced frequent shutdown incidents. However, the Langat River is one of the main sources of drinking water to almost one-third of the population in Selangor state. Meanwhile, several studies have reported a high concentration of Arsenic (As) in the Langat River that is toxic if ingested via drinking water. However, this is a pioneer study that predicts the As concentration in the Langat River based on time-series data from 2005-2014 to estimate the health risk associated with As ingestion via drinking water at the Langat River Basin. Several time-series prediction models were tested and Gradient Boosted Tree (GBT) gained the best result. This GBT model also fits better to predict the As concentration until December 2024. The mean concentration of As in the Langat River for both 2014 and 2024, as well as the carcinogenic and non-carcinogenic health risks of As ingestion via drinking water, were within the drinking water quality standards proposed by the World Health Organization and Ministry of Health Malaysia. However, the ingestion of trace amounts of As over a long period might be detrimental to human health because of its non-biodegradable characteristics. Therefore, it is important to manage the drinking water sources to minimise As exposure risks to human health.
Collapse
|
39
|
Zhao Z, Cheng X, Sun X, Ma S, Feng H, Zhao L. Prediction Model of Anastomotic Leakage Among Esophageal Cancer Patients After Receiving an Esophagectomy: Machine Learning Approach. JMIR Med Inform 2021; 9:e27110. [PMID: 34313597 PMCID: PMC8367102 DOI: 10.2196/27110] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 04/10/2021] [Accepted: 06/16/2021] [Indexed: 12/13/2022] Open
Abstract
Background Anastomotic leakage (AL) is one of the severe postoperative adverse events (5%-30%), and it is related to increased medical costs in cancer patients who undergo esophagectomies. Machine learning (ML) methods show good performance at predicting risk for AL. However, AL risk prediction based on ML models among the Chinese population is unavailable. Objective This study uses ML techniques to develop and validate a risk prediction model to screen patients with emerging AL risk factors. Methods Analyses were performed using medical records from 710 patients who underwent esophagectomies at the National Clinical Research Center for Cancer between January 2010 and May 2015. We randomly split (9:1) the data set into a training data set of 639 patients and a testing data set of 71 patients using a computer algorithm. We assessed multiple classification tools to create a multivariate risk prediction model. Our ML algorithms contained decision tree, random forest, naive Bayes, and logistic regression with least absolute shrinkage and selection operator. The optimal AL prediction model was selected based on model evaluation metrics. Results The final risk panel included 36 independent risk features. Of those, 10 features were significantly identified by the logistic model, including aortic calcification (OR 2.77, 95% CI 1.32-5.81), celiac trunk calcification (OR 2.79, 95% CI 1.20-6.48), forced expiratory volume 1% (OR 0.51, 95% CI 0.30-0.89); TLco (OR 0.56, 95% CI 0.27-1.18), peripheral vascular disease (OR 4.97, 95% CI 1.44-17.07), laparoscope (OR 3.92, 95% CI 1.23-12.51), postoperative length of hospital stay (OR 1.17, 95% CI 1.13-1.21), vascular permeability activity (OR 0.46, 95% CI 0.14-1.48), and fat liquefaction of incisions (OR 4.36, 95% CI 1.86-10.21). Logistic regression with least absolute shrinkage and selection operator offered the highest prediction quality with an area under the receiver operator characteristic of 72% in the training data set. The testing model also achieved similar high performance. Conclusions Our model offered a prediction of AL with high accuracy, assisting in AL prevention and treatment. A personalized ML prediction model with a purely data-driven selection of features is feasible and effective in predicting AL in patients who underwent esophagectomy.
Collapse
Affiliation(s)
- Ziran Zhao
- Thoracic Surgery Department, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Xi Cheng
- Department of Global Health Management, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, United States
| | - Xiao Sun
- Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, United States
| | - Shanrui Ma
- Thoracic Surgery Department, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hao Feng
- Thoracic Surgery Department, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Liang Zhao
- Thoracic Surgery Department, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
40
|
van der Voort M, Jensen D, Kamphuis C, Athanasiadis IN, De Vries A, Hogeveen H. Invited review: Toward a common language in data-driven mastitis detection research. J Dairy Sci 2021; 104:10449-10461. [PMID: 34304870 DOI: 10.3168/jds.2021-20311] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 05/30/2021] [Indexed: 11/19/2022]
Abstract
Sensor technologies for mastitis detection have resulted in the collection and availability of a large amount of data. As a result, scientific publications reporting mastitis detection research have become less driven by approaches based on biological assumptions and more by data-driven modeling. Most of these approaches try to predict mastitis events from (combinations of) raw sensor data to which a wide variety of methods are applied originating from machine learning and classical statistical approaches. However, an even wider variety in terminologies is used by researchers for methods that are similar in nature. This makes it difficult for readers from other disciplines to understand the specific methods that are used and how these differ from each other. The aim of this paper was to provide a framework (filtering, transformation, and classification) for describing the different methods applied in sensor data-based clinical mastitis detection research and use this framework to review and categorize the approaches and underlying methods described in the scientific literature on mastitis detection. We identified 40 scientific publications between 1992 and 2020 that applied methods to detect clinical mastitis from sensor data. Based on these publications, we developed and used the framework and categorized these scientific publications into the 2 data processing techniques of filtering and transformation. These data processing techniques make raw data more amendable to be used for the third step in our framework, that of classification, which is used to distinguish between healthy and nonhealthy (mastitis) cows. Most publications (n = 34) used filtering or transformation, or a combination of these 2, for data processing before classification, whereas the remaining publications (n = 6) classified the observations directly from raw data. Concerning classification, applying a simple threshold was the most used method (n = 19 publications). Our work identified that within approaches several different methods and terminologies for similar methods were used. Not all publications provided a clear description of the method used, and therefore it seemed that different methods were used between publications, whereas in fact just a different terminology was used, or the other way around. This paper is intended to serve as a reference for people from various research disciplines who need to collaborate and communicate efficiently about the topic of sensor-based mastitis detection and the methods used in this context. The framework used in this paper can support future research to correctly classify approaches and methods, which can improve the understanding of scientific publication. We encourage future research on sensor-based animal disease detection, including that of mastitis detection, to use a more coherent terminology for methods, and clearly state which technique (e.g., filtering) and approach (e.g., moving average) are used. This paper, therefore, can serve as a starting point and further stimulates the interdisciplinary cooperation in sensor-based mastitis research.
Collapse
Affiliation(s)
- M van der Voort
- Business Economics Group, Wageningen University & Research, 6706 KN Wageningen, the Netherlands.
| | - D Jensen
- Department of Veterinary and Animal Sciences, University of Copenhagen, 1870 Frederiksberg C, Denmark
| | - C Kamphuis
- Animal Breeding & Genomics, Wageningen University & Research, 6708 PB Wageningen, the Netherlands
| | - I N Athanasiadis
- Geo-Information Science and Remote Sensing Laboratory, Wageningen University & Research, 6706 KN Wageningen, the Netherlands
| | - A De Vries
- Department of Animal Sciences, University of Florida, Gainesville 32611
| | - H Hogeveen
- Business Economics Group, Wageningen University & Research, 6706 KN Wageningen, the Netherlands
| |
Collapse
|
41
|
Satoła A, Bauer EA. Predicting Subclinical Ketosis in Dairy Cows Using Machine Learning Techniques. Animals (Basel) 2021; 11:ani11072131. [PMID: 34359259 PMCID: PMC8300340 DOI: 10.3390/ani11072131] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 07/15/2021] [Accepted: 07/17/2021] [Indexed: 11/16/2022] Open
Abstract
Simple Summary The maintenance of cows in good health and physical condition is an important component of dairy cattle management. One of the major metabolic disorders in dairy cows is subclinical ketosis. Due to financial and organizational reasons it is often impossible to test all cows in a herd for ketosis using standard blood examination method. Using milk data from test-day records, obtained without additional costs for breeders, we found diagnostic models identifying cows-at-risk of subclinical ketosis. In addition, to select the best models, we present a general scoring approach for various machine learning models. With our models, breeders can identify dairy cows-at-risk of subclinical ketosis and implement appropriate management strategies and prevent losses in milk production. Abstract The diagnosis of subclinical ketosis in dairy cows based on blood ketone bodies is a challenging and costly procedure. Scientists are searching for tools based on results of milk performance assessment that would allow monitoring the risk of subclinical ketosis. The objective of the study was (1) to design a scoring system that would allow choosing the best machine learning models for the identification of cows-at-risk of subclinical ketosis, (2) to select the best performing models, and (3) to validate them using a testing dataset containing unseen data. The scoring system was developed using two machine learning modeling pipelines, one for regression and one for classification. As part of the system, different feature selections, outlier detection, data scaling and oversampling methods were used. Various linear and non-linear models were fit using training datasets and evaluated on holdout, testing the datasets. For the assessment of suitability of individual models for predicting subclinical ketosis, three β-hydroxybutyrate concentration in blood (bBHB) thresholds were defined: 1.0, 1.2 and 1.4 mmol/L. Considering the thresholds of 1.2 and 1.4, the logistic regression model was found to be the best fitted model, which included independent variables such as fat-to-protein ratio, acetone and β-hydroxybutyrate concentrations in milk, lactose percentage, lactation number and days in milk. In the cross-validation, this model showed an average sensitivity of 0.74 or 0.75 and specificity of 0.76 or 0.78, at the pre-defined bBHB threshold 1.2 or 1.4 mmol/L, respectively. The values of these metrics were also similar in the external validation on the testing dataset (0.72 or 0.74 for sensitivity and 0.80 or 0.81 for specificity). For the bBHB threshold at 1.0 mmol/L, the best classification model was the model based on the SVC (Support Vector Classification) machine learning method, for which the sensitivity in the cross-validation was 0.74 and the specificity was 0.73. These metrics had lower values for the testing dataset (0.57 and 0.72 respectively). Regression models were characterized by poor fitness to data (R2 < 0.4). The study results suggest that the prediction of subclinical ketosis based on data from test-day records using classification methods and machine learning algorithms can be a useful tool for monitoring the incidence of this metabolic disorder in dairy cattle herds.
Collapse
Affiliation(s)
- Alicja Satoła
- Department of Genetics, Animal Breeding and Ethology, Faculty of Animal Science, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
- Correspondence:
| | - Edyta Agnieszka Bauer
- Department of Animal Reproduction, Anatomy and Genomics, Faculty of Animal Science, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland;
| |
Collapse
|
42
|
Comparison of machine learning methods to predict udder health status based on somatic cell counts in dairy cows. Sci Rep 2021; 11:13642. [PMID: 34211046 PMCID: PMC8249463 DOI: 10.1038/s41598-021-93056-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/21/2021] [Indexed: 11/29/2022] Open
Abstract
Bovine mastitis is one of the most important economic and health issues in dairy farms. Data collection during routine recording procedures and access to large datasets have shed the light on the possibility to use trained machine learning algorithms to predict the udder health status of cows. In this study, we compared eight different machine learning methods (Linear Discriminant Analysis, Generalized Linear Model with logit link function, Naïve Bayes, Classification and Regression Trees, k-Nearest Neighbors, Support Vector Machines, Random Forest and Neural Network) to predict udder health status of cows based on somatic cell counts. Prediction accuracies of all methods were above 75%. According to different metrics, Neural Network, Random Forest and linear methods had the best performance in predicting udder health classes at a given test-day (healthy or mastitic according to somatic cell count below or above a predefined threshold of 200,000 cells/mL) based on the cow’s milk traits recorded at previous test-day. Our findings suggest machine learning algorithms as a promising tool to improve decision making for farmers. Machine learning analysis would improve the surveillance methods and help farmers to identify in advance those cows that would possibly have high somatic cell count in the subsequent test-day.
Collapse
|
43
|
Ahsan R, Tahsili MR, Ebrahimi F, Ebrahimie E, Ebrahimi M. Image processing unravels the evolutionary pattern of SARS-CoV-2 against SARS and MERS through position-based pattern recognition. Comput Biol Med 2021; 134:104471. [PMID: 34004573 PMCID: PMC8106241 DOI: 10.1016/j.compbiomed.2021.104471] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Revised: 04/27/2021] [Accepted: 05/02/2021] [Indexed: 12/16/2022]
Abstract
SARS-COV-2, Severe Acute Respiratory Syndrome (SARS), and the Middle East respiratory syndrome-related coronavirus (MERS) viruses are from the coronaviridae family; the former became a global pandemic (with low mortality rate) while the latter were confined to a limited region (with high mortality rates). To investigate the possible structural differences at basic levels for the three viruses, genomic and proteomic sequences were downloaded and converted to polynomial datasets. Seven attribute weighting (feature selection) models were employed to find the key differences in their genome's nucleotide sequence. Most attribute weighting models selected the final nucleotide sequences (from 29,000th nucleotide positions to the end of the genome) as significantly different among the three virus classes. The genome and proteome sequences of this hot zone area (which corresponds to the 3'UTR region and encodes for nucleoprotein (N)) and Spike (S) protein sequences (as the most important viral protein) were converted into binary images and were analyzed by image processing techniques and Convolutional deep Neural Network (CNN). Although the predictive accuracy of CNN for Spike (S) proteins was low (0.48%), the machine-based learning algorithms were able to classify the three members of coronaviridae viruses with 100% accuracy based on 3'UTR region. For the first time ever, the relationship between the possible structural differences of coronaviruses at the sequential levels and their pathogenesis are being reported, which paves the road to deciphering the high pathogenicity of the SARS-COV-2 virus.
Collapse
Affiliation(s)
- Reza Ahsan
- Department of Computer Engineering, Qom Branch, Islamic Azad University, Qom, Iran
| | | | - Faezeh Ebrahimi
- Faculty of Life Sciences and Biotechnology, Department of Microbiology and Microbial Biotechnology, Shahid Beheshti University, Tehran, Iran
| | - Esmaeil Ebrahimie
- Genomics Research Platform, School of Life Sciences, College of Science, Health and Engineering, La Trobe University, Melbourne, Victoria, 3086, Australia,School of Animal and Veterinary Sciences, The University of Adelaide, South Australia, 5371, Australia
| | - Mansour Ebrahimi
- Department of Biology, School of Basic Sciences, University of Qom, Qom, Iran,School of Animal and Veterinary Sciences, The University of Adelaide, South Australia, 5371, Australia,Corresponding author. Department of Biology, School of Basic Sciences, University of Qom, Qom, Iran
| |
Collapse
|
44
|
Rule Discovery in Milk Content towards Mastitis Diagnosis: Dealing with Farm Heterogeneity over Multiple Years through Classification Based on Associations. Animals (Basel) 2021; 11:ani11061638. [PMID: 34205858 PMCID: PMC8227403 DOI: 10.3390/ani11061638] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 05/21/2021] [Accepted: 05/28/2021] [Indexed: 12/02/2022] Open
Abstract
Simple Summary Invisible (subclinical) mastitis decreases milk quality and production. Invisible mastitis is linked to an increased use of antimicrobials. The risk of the emergence of antimicrobial-resistant bacteria is a major public health concern worldwide. Therefore, early detection of infected cows is of great importance. Machine learning has opened a new avenue for early mastitis prediction based on simple and accessible milking parameters, such as milk volume, fat, protein, lactose, electrical conductivity (EC), milking time, and milking peak flow. However, farm heterogeneity is a major challenge where multiple patterns can predict mastitis. Here, we employed a classification based on associations and scaling approach for multiple pattern discovery over multiple years. The approach we have developed helps to address farm heterogeneity and generalise machine learning-based diagnosis of mastitis worldwide. Abstract Subclinical mastitis, an economically challenging disease of dairy cattle, is associated with an increased use of antimicrobials which reduces milk quantity and quality. It is more common than clinical mastitis and far more difficult to detect. Recently, much attention has been paid to the development of machine-learning expert systems for early detection of subclinical mastitis from milking features. However, differences between animals within a farm as well as between farms, particularly across multiple years, are major obstacles to the generalisation of machine learning models. Here, for the first time, we integrated scaling by quartiling with classification based on associations in a multi-year study to deal with farm heterogeneity by discovery of multiple patterns towards mastitis. The data were obtained from one farm comprising Holstein Friesian cows in Ongaonga, New Zealand, using an electronic automated monitoring system. The data collection was repeated annually over 3 consecutive years. Some discovered rules, such as when the milking peak flow is low, electrical conductivity (EC) of milk is low, milk lactose is low, milk fat is high, and milk volume is low, the cow has subclinical mastitis, reached high confidence (>70%) in multiple years. On averages, over 3 years, low level of milk lactose and high value of milk EC were part of 93% and 83.8% of all subclinical mastitis detecting rules, offering a reproducible pattern of subclinical mastitis detection. The scaled year-independent combinational rules provide an easy-to-apply and cost-effective machine-learning expert system for early detection of hidden mastitis using milking parameters.
Collapse
|
45
|
|
46
|
Prediction of Streptococcus uberis clinical mastitis treatment success in dairy herds by means of mass spectrometry and machine-learning. Sci Rep 2021; 11:7736. [PMID: 33833319 PMCID: PMC8032699 DOI: 10.1038/s41598-021-87300-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 03/26/2021] [Indexed: 12/26/2022] Open
Abstract
Streptococcus uberis is one of the leading pathogens causing mastitis worldwide. Identification of S. uberis strains that fail to respond to treatment with antibiotics is essential for better decision making and treatment selection. We demonstrate that the combination of supervised machine learning and matrix-assisted laser desorption ionization/time of flight (MALDI-TOF) mass spectrometry can discriminate strains of S. uberis causing clinical mastitis that are likely to be responsive or unresponsive to treatment. Diagnostics prediction systems trained on 90 individuals from 26 different farms achieved up to 86.2% and 71.5% in terms of accuracy and Cohen’s kappa. The performance was further increased by adding metadata (parity, somatic cell count of previous lactation and count of positive mastitis cases) to encoded MALDI-TOF spectra, which increased accuracy and Cohen’s kappa to 92.2% and 84.1% respectively. A computational framework integrating protein–protein networks and structural protein information to the machine learning results unveiled the molecular determinants underlying the responsive and unresponsive phenotypes.
Collapse
|
47
|
Robles I, Nolan DT, Fendley CA, Stokley HL, France TL, Ferrell JL, Costa JHC. Technical note: Evaluation of a commercial on-farm milk leukocyte differential tester to identify subclinical mastitis cases in dairy cows. J Dairy Sci 2021; 104:4942-4949. [PMID: 33612234 DOI: 10.3168/jds.2020-19299] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 11/27/2020] [Indexed: 01/16/2023]
Abstract
The objective of this study was to validate the precision and accuracy of a milk leukocyte differential tester to identify subclinical mastitis cases in dairy cows. Milk samples from individual quarters (n = 320) of 80 Holstein cows were aseptically collected and analyzed in this study. Each sample was divided into 2 replicate samples after mixing. One replicate was analyzed for somatic cell count (SCC) using the current gold standard of flow cytometry immediately after milking. The second sample was evaluated using the on-farm milk leukocyte differential tester directly after milking, where total leukocyte count (TLC; cells/mL) was obtained. The SCC and TLC were used to calculate somatic cell score (SCS) and TLC score [TLS = log2 (TLC/100,000) + 3]. Two subclinical mastitis thresholds were set: >200,000 (low) and >400,000 (high) cells/mL. First, precision was determined between the 2 methods. Total leukocyte count and calculated TLS from the milk leukocyte differential device were compared with the gold standard using correlation and regression coefficient of determination analyses. Correlation coefficients (r) were 0.97 for TLC and SCC and 0.90 for TLS and SCS. The coefficient of determination for regression (R2) was 0.94 for TLC and SCC and 0.80 for TLS and SCS. Slopes of regression for scores and measures were 0.36 [95% confidence interval (CI): 0.35-0.37] and 0.69 (CI: 0.65-0.73), respectively; both were significantly different from 1. Sensitivity, specificity, and diagnostic accuracy were calculated for correct diagnosis of the 2 SCC thresholds using the gold standard as reference. The sensitivity of the on-farm test was 58% (95% CI: 44 to 71%) and 73% (95% CI: 56 to 86%) for the low and high thresholds, respectively. The specificities for the on-farm test were 100% (95% CI: 99 to 100%) and 100% (95% CI: 98 to 100%) for the low and high thresholds, respectively. Subclinical diagnosis accuracies were 93% (95% CI: 89 to 95%) and 96% (95% CI: 92 to 98%) for the low and high thresholds, respectively. The on-farm milk leukocyte differential tester was precise but not overall accurate for total cell counts; it had high specificity and accuracy for diagnosis compared with a standard diagnostic tool. These results suggest that the tested system is a promising technology to detect subclinical mastitis on-farm.
Collapse
Affiliation(s)
- I Robles
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215
| | - D T Nolan
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215
| | - C A Fendley
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215
| | - H L Stokley
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215
| | - T L France
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215
| | - J L Ferrell
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215
| | - J H C Costa
- Dairy Science Program, Department of Animal and Food Sciences, University of Kentucky, Lexington 40546-0215.
| |
Collapse
|
48
|
Water Demand Prediction Using Machine Learning Methods: A Case Study of the Beijing–Tianjin–Hebei Region in China. WATER 2021. [DOI: 10.3390/w13030310] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Predicting water demand helps decision-makers allocate regional water resources efficiently, thereby preventing water waste and shortage. The aim of this study is to predict water demand in the Beijing–Tianjin–Hebei region of North China. The explanatory variables associated with economy, community, water use, and resource availability were identified. Eleven statistical and machine learning models were built, which used data covering the 2004–2019 period. Interpolation and extrapolation scenarios were conducted to find the most suitable predictive model. The results suggest that the gradient boosting decision tree (GBDT) model demonstrates the best prediction performance in the two scenarios. The model was further tested for three other regions in China, and its robustness was validated. The water demand in 2020–2021 was provided. The results show that the identified explanatory variables were effective in water demand prediction. The machine learning models outperformed the statistical models, with the ensemble models being superior to the single predictor models. The best predictive model can also be applied to other regions to help forecast water demand to ensure sustainable water resource management.
Collapse
|
49
|
Panahi R, Ebrahimie E, Niazi A, Afsharifar A. Integration of meta-analysis and supervised machine learning for pattern recognition in breast cancer using epigenetic data. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100629] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
50
|
Application of machine learning to improve dairy farm management: A systematic literature review. Prev Vet Med 2020; 187:105237. [PMID: 33418514 DOI: 10.1016/j.prevetmed.2020.105237] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 11/20/2020] [Accepted: 12/13/2020] [Indexed: 11/22/2022]
Abstract
In recent years, several researchers and practitioners applied machine learning algorithms in the dairy farm context and discussed several solutions to predict various variables of interest, most of which were related to incipient diseases. The objective of this article is to identify, assess, and synthesize the papers that discuss the application of machine learning in the dairy farm management context. Using a systematic literature review (SLR) protocol, we retrieved 427 papers, of which 38 papers were determined as primary studies and thus were analysed in detail. More than half of the papers (55 %) addressed disease detection. The other two categories of problems addressed were milk production and milk quality. Seventy-one independent variables were identified and grouped into seven categories. The two prominent categories that were used in more than half of the papers were milking parameters and milk properties. The other categories of independent variables were milk content, pregnancy/calving information, cow characteristics, lactation, and farm characteristics. Twenty-three algorithms were identified, which we grouped into four categories. Decision tree-based algorithms are by far the most used followed by artificial neural network-based algorithms. Regression-based algorithms and other algorithms that do not belong to the previous categories were used in 13 papers. Twenty-three evaluation parameters were identified of which 7 were used 3 or more times. The three evaluation parameters that were used by more than half of the papers are sensitivity, specificity, RMSE. The challenges most encountered were feature selection and unbalanced data and together with problem size, overfitting/estimating, and parameter tuning account for three-quarters of the challenges identified. To the best of our knowledge, this is the first SLR study on the use of machine learning to improve dairy farm management, and to this end, this study will be valuable not only for researchers but also practitioners in dairy farms.
Collapse
|