Wu D, Cui G, Huang X, Chen Y, Liu G, Ren L, Li Y. An accurate and explainable ensemble learning method for carotid plaque prediction in an asymptomatic population.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022;
221:106842. [PMID:
35569238 DOI:
10.1016/j.cmpb.2022.106842]
[Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 04/17/2022] [Accepted: 04/26/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND OBJECTIVE
The identification of carotid plaque, one of the most crucial tasks in stroke screening, is of great significance in the assessment of subclinical atherosclerosis and preventing the onset of stroke. However, traditional ultrasound examination is not prevalent or cost-effective for asymptomatic people, particularly low-income individuals in rural areas. Thus, it is necessary to develop an accurate and explainable model for early identification of the risk of plaque prevalence that can help in the primary prevention of stroke.
METHODS
We developed an ensemble learning method to predict the occurrence of carotid plaques. A dataset comprising 1440 subjects (50% with plaques and 50% without plaques) and ten-fold cross-validation were utilized to evaluate the model performance. Four machine learning methods (extreme gradient boosting (XGBoost), gradient boosting decision tree, random forest, and support vector machine) were evaluated. Subsequently, the interpretability of the XGBoost model, which provided the best performance, was analyzed from three aspects: feature importance, feature effect on prediction model, and feature effect on prediction decision for a specific subject.
RESULTS
The XGBoost algorithm provided the best performance (sensitivity: 0.8678, specificity: 0.8592, accuracy: 0.8632, F1 score: 0.8621, area under the curve: 0.8635) in carotid plaque prediction and also had excellent performance under missing data circumstances. Further, interpretability analysis showed that the decisions of the XGBoost model were highly congruent with clinical knowledge.
CONCLUSION
The model results are superior to those of state-of-the-art methods. Thus, it is a promising carotid plaque prediction tool that could be used in the primary prevention of stroke.
Collapse