1
|
Qasrawi R, Badrasawi M, Al-Halawa DA, Polo SV, Khader RA, Al-Taweel H, Alwafa RA, Zahdeh R, Hahn A, Schuchardt JP. Identification and prediction of association patterns between nutrient intake and anemia using machine learning techniques: results from a cross-sectional study with university female students from Palestine. Eur J Nutr 2024; 63:1635-1649. [PMID: 38512358 PMCID: PMC11329411 DOI: 10.1007/s00394-024-03360-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 02/26/2024] [Indexed: 03/23/2024]
Abstract
PURPOSE This study utilized data mining and machine learning (ML) techniques to identify new patterns and classifications of the associations between nutrient intake and anemia among university students. METHODS We employed K-means clustering analysis algorithm and Decision Tree (DT) technique to identify the association between anemia and vitamin and mineral intakes. We normalized and balanced the data based on anemia weighted clusters for improving ML models' accuracy. In addition, t-tests and Analysis of Variance (ANOVA) were performed to identify significant differences between the clusters. We evaluated the models on a balanced dataset of 755 female participants from the Hebron district in Palestine. RESULTS Our study found that 34.8% of the participants were anemic. The intake of various micronutrients (i.e., folate, Vit A, B5, B6, B12, C, E, Ca, Fe, and Mg) was below RDA/AI values, which indicated an overall unbalanced malnutrition in the present cohort. Anemia was significantly associated with intakes of energy, protein, fat, Vit B1, B5, B6, C, Mg, Cu and Zn. On the other hand, intakes of protein, Vit B2, B5, B6, C, E, choline, folate, phosphorus, Mn and Zn were significantly lower in anemic than in non-anemic subjects. DT classification models for vitamins and minerals (accuracy rate: 82.1%) identified an inverse association between intakes of Vit B2, B3, B5, B6, B12, E, folate, Zn, Mg, Fe and Mn and prevalence of anemia. CONCLUSIONS Besides the nutrients commonly known to be linked to anemia-like folate, Vit B6, C, B12, or Fe-the cluster analyses in the present cohort of young female university students have also found choline, Vit E, B2, Zn, Mg, Mn, and phosphorus as additional nutrients that might relate to the development of anemia. Further research is needed to elucidate if the intake of these nutrients might influence the risk of anemia.
Collapse
Affiliation(s)
- Radwan Qasrawi
- Department of Computer Science, Al-Quds University, Jerusalem, Palestine
- Department of Computer Engineering, Istinye University, Istanbul, Turkey
| | - Manal Badrasawi
- Department of Nutrition and Food Technology, Faculty of Agriculture and Veterinary Medicine, An-Najah National University, Nablus, West Bank, Palestine
| | | | | | - Rami Abu Khader
- Department of Computer Science, Al-Quds University, Jerusalem, Palestine
| | - Haneen Al-Taweel
- Department of Computer Science, Al-Quds University, Jerusalem, Palestine
| | - Reem Abu Alwafa
- Department of Nutrition and Food Technology, Faculty of Agriculture and Veterinary Medicine, An-Najah National University, Nablus, West Bank, Palestine
| | - Rana Zahdeh
- Department of Applied Chemistry and Biology, College of Applied Sciences, Palestine Polytechnic University, Hebron, West Bank, Palestine
| | - Andreas Hahn
- Institute of Food Science and Human Nutrition, Leibniz University Hannover, Hannover, Germany
| | - Jan Philipp Schuchardt
- Institute of Food Science and Human Nutrition, Leibniz University Hannover, Hannover, Germany.
| |
Collapse
|
2
|
Darshan BSD, Sampathila N, Bairy MG, Belurkar S, Prabhu S, Chadaga K. Detection of anemic condition in patients from clinical markers and explainable artificial intelligence. Technol Health Care 2024; 32:2431-2444. [PMID: 38339945 DOI: 10.3233/thc-231207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2024]
Abstract
BACKGROUND Anaemia is a commonly known blood illness worldwide. Red blood cell (RBC) count or oxygen carrying capability being insufficient are two ways to describe anaemia. This disorder has an impact on the quality of life. If anaemia is detected in the initial stage, appropriate care can be taken to prevent further harm. OBJECTIVE This study proposes a machine learning approach to identify anaemia from clinical markers, which will help further in clinical practice. METHODS The models are designed with a dataset of 364 samples and 12 blood test attributes. The developed algorithm is expected to provide decision support to the clinicians based on blood markers. Each model is trained and validated on several performance metrics. RESULTS The accuracy obtained by the random forest, K nearest neighbour, support vector machine, Naive Bayes, xgboost, and catboost are 97%, 98%, 95%, 95%, 98% and 97% respectively. Four explainers such as Shapley Additive Values (SHAP), QLattice, Eli5 and local interpretable model-agnostic explanations (LIME) are explored for interpreting the model predictions. CONCLUSION The study provides insights into the potential of machine learning algorithms for classification and may help in the development of automated and accurate diagnostic tools for anaemia.
Collapse
Affiliation(s)
- B S Dhruva Darshan
- Department of Biomedical Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Karnataka, India
| | - Niranjana Sampathila
- Department of Biomedical Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Karnataka, India
| | - Muralidhar G Bairy
- Department of Biomedical Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Karnataka, India
| | - Sushma Belurkar
- Haematology and Clinical Pathology Lab, Kasturba Medical College, Manipal Academy of Higher Education, Karnataka, India
| | - Srikanth Prabhu
- Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Karnataka, India
| | - Krishnaraj Chadaga
- Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Karnataka, India
| |
Collapse
|
3
|
A New Artificial Intelligence Approach Using Extreme Learning Machine as the Potentially Effective Model to Predict and Analyze the Diagnosis of Anemia. Healthcare (Basel) 2023; 11:healthcare11050697. [PMID: 36900702 PMCID: PMC10000789 DOI: 10.3390/healthcare11050697] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 02/09/2023] [Accepted: 02/16/2023] [Indexed: 03/02/2023] Open
Abstract
The procedure to diagnose anemia is time-consuming and resource-intensive due to the existence of a multitude of symptoms that can be felt physically or seen visually. Anemia also has several forms, which can be distinguished based on several characteristics. It is possible to diagnose anemia through a quick, affordable, and easily accessible laboratory test known as the complete blood count (CBC), but the method cannot directly identify different kinds of anemia. Therefore, further tests are required to establish a gold standard for the type of anemia in a patient. These tests are uncommon in settings that offer healthcare on a smaller scale because they require expensive equipment. Moreover, it is also difficult to discern between beta thalassemia trait (BTT), iron deficiency anemia (IDA), hemoglobin E (HbE), and combination anemias despite the presence of multiple red blood cell (RBC) formulas and indices with differing optimal cutoff values. This is due to the existence of several varieties of anemia in individuals, making it difficult to distinguish between BTT, IDA, HbE, and combinations. Therefore, a more precise and automated prediction model is proposed to distinguish these four types to accelerate the identification process for doctors. Historical data were retrieved from the Laboratory of the Department of Clinical Pathology and Laboratory Medicine, Faculty of Medicine, Public Health, and Nursing, Universitas Gadjah Mada, Yogyakarta, Indonesia for this purpose. Furthermore, the model was developed using the algorithm for the extreme learning machine (ELM). This was followed by the measurement of the performance using the confusion matrix and 190 data representing the four classes, and the results showed 99.21% accuracy, 98.44% sensitivity, 99.30% precision, and an F1 score of 98.84%.
Collapse
|
4
|
Identification of Anemia and Its Severity Level in a Peripheral Blood Smear Using 3-Tier Deep Neural Network. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12105030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The automatic detection of blood cell elements for identifying morphological deformities is still a challenging research domain. It has a pivotal role in cognition and detecting the severity level of disease. Using a simple microscope, manual disease detection, and morphological disorders in blood cells is mostly time-consuming and erroneous. Due to the overlapped structure of RBCs, pathologists face challenges in differentiating between normal and abnormal cell shape and size precisely. Currently, convolutional neural network-based algorithms are effective tools for addressing this issue. Existing techniques fail to provide effective anemia detection, and severity level prediction due to RBCs’ dense and overlapped structure, non-availability of standard datasets related to blood diseases, and severity level detection techniques. This work proposed a three tier deep convolutional fused network (3-TierDCFNet) to extract optimum morphological features and identify anemic images to predict the severity of anemia. The proposed model comprises two modules: Module-I classifies the input image into two classes, i.e., Healthy and Anemic, while Module-II detects the anemia severity level and categorizes it into Mild or Chronic. After each tier’s training, a validation function is employed to reduce the inappropriate feature selection. To authenticate the proposed model for healthy, anemic RBC classification and anemia severity level detection, a state-of-the-art anemic and healthy RBC dataset was developed in collaboration with Shaukat Khanum Hospital and Research Center (SKMCH&RC), Pakistan. To evaluate the proposed model, the training, validation, and test accuracies were computed along with recall, F1-Score, and specificity. The global results reveal that the proposed model achieved 91.37%, 88.85%, and 86.06% training, validation, and test accuracies with 98.95%, 98.12%, and 98.12% recall F1-Score and specificity, respectively.
Collapse
|
5
|
Qasrawi R, Abu Al-Halawa D. Cluster Analysis and Classification Model of Nutritional Anemia Associated Risk Factors Among Palestinian Schoolchildren, 2014. Front Nutr 2022; 9:838937. [PMID: 35619964 PMCID: PMC9127973 DOI: 10.3389/fnut.2022.838937] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 03/21/2022] [Indexed: 11/13/2022] Open
Abstract
Nutritional inadequacy has been a major health problem worldwide. One of the many health problems that result from it is anemia. Anemia is considered a health concern among all ages, particularly children, as it has been associated with cognitive and developmental delays. Researchers have investigated the association between nutritional deficiencies and anemia through various methods. As novel analytical methods are needed to ascertain the association and reveal indirect ones, we aimed to classify nutritional anemia using the cluster analysis approach. In this study, we included 4,762 students aged between 10 and 17 years attending public and UNRWA schools in the West Bank. Students' 24-h food recall and blood sample data were collected for nutrient intake and hemoglobin analysis. The K-means cluster analysis was used to cluster the hemoglobin levels into two groups. Vitamin B12, folate, and iron intakes were used as the indicators of nutrient intake associated with anemia and were classified as per the Recommended Dietary Allowance (RDA) values. We applied the Classification and Regression Tree (CRT) model for studying the association between hemoglobin clusters and vitamin B12, folate, and iron intakes, sociodemographic variables, and health-related risk factors, accounting for grade and age. Results indicated that 46.4% of the students were classified into the low hemoglobin cluster, and 60.7, 72.5, and 30.3% of vitamin B12, folate, and iron intakes, respectively, were below RDA. The CRT analysis indicated that vitamin B12, iron, and folate intakes are important factors related to anemia in girls associated with age, locality, food consumption patterns, and physical activity levels, while iron and folate intakes were significant factors related to anemia in boys associated with the place of residence and the educational level of their mothers. The deployment of clustering and classification techniques for identifying the association between anemia and nutritional factors might facilitate the development of nutritional anemia prevention and intervention programs that will improve the health and wellbeing of schoolchildren.
Collapse
Affiliation(s)
- Radwan Qasrawi
- Department of Computer Science, Al-Quds University, Jerusalem, Palestine
- Department of Computer Engineering, Istinye University, Istanbul, Turkey
| | | |
Collapse
|
6
|
Meitei AJ, Saini A, Mohapatra BB, Singh KJ. Predicting child anaemia in the North-Eastern states of India: a machine learning approach. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT 2022; 13:2949-2962. [PMCID: PMC9441193 DOI: 10.1007/s13198-022-01765-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 07/14/2022] [Accepted: 08/08/2022] [Indexed: 01/07/2024]
Abstract
Child anaemia is a serious global health issue and India is one of the highest contributors among the developing nations. Researchers identify many harmful effects of anaemia, which include psychomotor retardation, which in turn decreases the learning ability and causes low intelligence among pre-school children. The effects also include behavioural delays, low immunity, and susceptibility to frequent infections, increased mortality, and disability. The present study aims to predict anaemia among children in North-East India by applying Machine Learning (ML) algorithms to latest available National Family Health Survey (NFHS)-4 data. Out of the total 29,312 eligible children (6–59 months) in North-East India, a total of 21,000 children with demographic variables without any missing observations, wherein 10,460 are anaemic, is considered for this study. Machine learning (ML) algorithms have been applied through 3 different types of penalized regression methods—ridge, least absolute shrinkage and selection operator, and elastic net for predicting anaemia. A systematic assessment of algorithms is performed in terms of accuracy, sensitivity, specificity, F1-Score, and Cohen’s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k$$\end{document} k -Statistics. Having achieved the receiver operating characteristic value of over 70% in training and accuracy of above 64% while testing, it can be safely asserted that factors like mother’s anaemic status, age of the child, social status, mother’s age, mother’s education, religion are important in identifying the child as anaemic.
Collapse
Affiliation(s)
- A. Jiran Meitei
- Department of Mathematics, Maharaja Agrasen College, University of Delhi, New Delhi, India
| | - Akanksha Saini
- Department of Operational Research, University of Delhi, New Delhi, Delhi 110007 India
| | | | | |
Collapse
|
7
|
Haas O, Maier A, Rothgang E. Rule-Based Models for Risk Estimation and Analysis of In-hospital Mortality in Emergency and Critical Care. Front Med (Lausanne) 2021; 8:785711. [PMID: 34820408 PMCID: PMC8606583 DOI: 10.3389/fmed.2021.785711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 10/18/2021] [Indexed: 11/14/2022] Open
Abstract
We propose a novel method that uses associative classification and odds ratios to predict in-hospital mortality in emergency and critical care. Manual mortality risk scores have previously been used to assess the care needed for each patient and their need for palliative measures. Automated approaches allow providers to get a quick and objective estimation based on electronic health records. We use association rule mining to find relevant patterns in the dataset. The odds ratio is used instead of classical association rule mining metrics as a quality measure to analyze association instead of frequency. The resulting measures are used to estimate the in-hospital mortality risk. We compare two prediction models: one minimal model with socio-demographic factors that are available at the time of admission and can be provided by the patients themselves, namely gender, ethnicity, type of insurance, language, and marital status, and a full model that additionally includes clinical information like diagnoses, medication, and procedures. The method was tested and validated on MIMIC-IV, a publicly available clinical dataset. The minimal prediction model achieved an area under the receiver operating characteristic curve value of 0.69, while the full prediction model achieved a value of 0.98. The models serve different purposes. The minimal model can be used as a first risk assessment based on patient-reported information. The full model expands on this and provides an updated risk assessment each time a new variable occurs in the clinical case. In addition, the rules in the models allow us to analyze the dataset based on data-backed rules. We provide several examples of interesting rules, including rules that hint at errors in the underlying data, rules that correspond to existing epidemiological research, and rules that were previously unknown and can serve as starting points for future studies.
Collapse
Affiliation(s)
- Oliver Haas
- Department of Industrial Engineering and Health, Institute of Medical Engineering, Technical University Amberg-Weiden, Weiden, Germany.,Pattern Recognition Lab, Department of Computer Science, Technical Faculty, Friedrich-Alexander University, Erlangen, Germany
| | - Andreas Maier
- Pattern Recognition Lab, Department of Computer Science, Technical Faculty, Friedrich-Alexander University, Erlangen, Germany
| | - Eva Rothgang
- Department of Industrial Engineering and Health, Institute of Medical Engineering, Technical University Amberg-Weiden, Weiden, Germany
| |
Collapse
|
8
|
KILICARSLAN S, CELIK M, SAHIN Ş. Hybrid models based on genetic algorithm and deep learning algorithms for nutritional Anemia disease classification. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2020.102231] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
9
|
Sosa-Moreno A, Reinoso-González S, Mendez MA. Anemia in women of reproductive age in Ecuador: Data from a national survey. PLoS One 2020; 15:e0239585. [PMID: 32970743 PMCID: PMC7514054 DOI: 10.1371/journal.pone.0239585] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Accepted: 09/09/2020] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Anemia is a condition characterized by a decline in the number of functional red blood cells or hemoglobin. Women of reproductive age from low- and middle-income countries are at higher risk of anemia, which could lead to prenatal, obstetric and perinatal complications. The aim of our study was to explore associations between anemia status and a set of demographic, socio-economic and reproductive factors, among Ecuadorian women of reproductive age (WRA). METHODS We used data from non-pregnant, WRA (≥12 and ≤49 years) women enrolled in the nationally representative cross-sectional Ecuadorian National Health and Nutrition Survey 2012 (ENSANUT-ECU 2012). Anemia and moderate-severe anemia were assessed using hemoglobin concentrations cutoffs of <12 g/dL and <11 g/dL, respectively. Logistic regression was used to obtain unadjusted and adjusted prevalence odds ratios (aOR). All analyzes were adjusted for multi-stage sampling, stratification and clustering. RESULTS The study population included a subset of 7415 non-pregnant WRA. Mean hemoglobin concentration was 12.84 g/dL (95% CI = 12.8-12.9). The overall prevalence of anemia and moderate-severe anemia was 16.8% and 5.0%, respectively. Some factors were associated with an increase in anemia prevalence odds: living in Guayaquil (aOR 1.82, 95% CI 1.16-2.84) and Quito (aOR 1.84, 95% CI 1.17-2.90) compared to living in the rural Amazon, having given birth to more than four alive children compared with being nulliparous (aOR 1.85, 95% CI 1.00-3.43), currently taking contraceptives compared with former use (aOR 1.46, 95% CI 1.09-1.97). In addition, moderate-severe anemia was associated with age and region of residence. CONCLUSION In 2012, the prevalence of anemia among Ecuadorian WRA was considered a mild public health concern. However, we identified groups with higher anemia prevalence. Thus, emphasizing the importance of analyzing the prevalence in sub-populations of WRA and identifying populations where more frequent surveillance may be helpful.
Collapse
Affiliation(s)
- Andrea Sosa-Moreno
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, United States of America
| | - Sharon Reinoso-González
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Colegio de Ciencias e Ingenierías, Politécnico, Universidad San Francisco de Quito, Quito, Ecuador
| | - Miguel Angel Mendez
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Colegio de Ciencias e Ingenierías, Politécnico, Universidad San Francisco de Quito, Quito, Ecuador
| |
Collapse
|
10
|
Schwalbe N, Wahl B. Artificial intelligence and the future of global health. Lancet 2020; 395:1579-1586. [PMID: 32416782 PMCID: PMC7255280 DOI: 10.1016/s0140-6736(20)30226-9] [Citation(s) in RCA: 224] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/21/2019] [Revised: 01/21/2020] [Accepted: 01/22/2020] [Indexed: 02/07/2023]
Abstract
Concurrent advances in information technology infrastructure and mobile computing power in many low and middle-income countries (LMICs) have raised hopes that artificial intelligence (AI) might help to address challenges unique to the field of global health and accelerate achievement of the health-related sustainable development goals. A series of fundamental questions have been raised about AI-driven health interventions, and whether the tools, methods, and protections traditionally used to make ethical and evidence-based decisions about new technologies can be applied to AI. Deployment of AI has already begun for a broad range of health issues common to LMICs, with interventions focused primarily on communicable diseases, including tuberculosis and malaria. Types of AI vary, but most use some form of machine learning or signal processing. Several types of machine learning methods are frequently used together, as is machine learning with other approaches, most often signal processing. AI-driven health interventions fit into four categories relevant to global health researchers: (1) diagnosis, (2) patient morbidity or mortality risk assessment, (3) disease outbreak prediction and surveillance, and (4) health policy and planning. However, much of the AI-driven intervention research in global health does not describe ethical, regulatory, or practical considerations required for widespread use or deployment at scale. Despite the field remaining nascent, AI-driven health interventions could lead to improved health outcomes in LMICs. Although some challenges of developing and deploying these interventions might not be unique to these settings, the global health community will need to work quickly to establish guidelines for development, testing, and use, and develop a user-driven research agenda to facilitate equitable and ethical use.
Collapse
Affiliation(s)
- Nina Schwalbe
- Heilbrunn Department of Population and Family Health, Columbia Mailman School of Public Health, New York, NY, USA; Spark Street Advisors, New York, NY, USA.
| | - Brian Wahl
- Spark Street Advisors, New York, NY, USA; Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| |
Collapse
|