1
|
Li G, Wu N, Zhang J, Song Y, Ye T, Zhang Y, Zhao D, Yu P, Wang L, Zhuang C. Proximal humeral bone density assessment and prediction analysis using machine learning techniques: An innovative approach in medical research. Heliyon 2024; 10:e35451. [PMID: 39166094 PMCID: PMC11334883 DOI: 10.1016/j.heliyon.2024.e35451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 07/28/2024] [Accepted: 07/29/2024] [Indexed: 08/22/2024] Open
Abstract
Background Patients with fractures of the proximal humerus often local complications and failures attributed to osteoporosis. Currently, there is a lack of straight forward screening methods for assessing the extent of local osteoporosis in the proximal humerus. This study utilizes machine learning techniques to establish a diagnostic approach for evaluating local osteoporosis by analyzing the patient's demographic data, bone density, and X-ray ratio of the proximal humerus. Methods A cohort comprising a total of 102 hospitalized patients admitted during the period spanning from 2021 to 2023 underwent random selection procedures. Resulting in exclusion of 5 patients while enrolling 97 patients for analysis encompassing patient demographics, shoulder joint anteroposterior radiographs, and bone density information. Using the modified Tingart index methodology involving multiple measurements denoted as M1 through M4 obtained from humeral shafts. Within this cohort comprised 76 females (78.4 %) and 21 males (21.6 %), with an average age of 73.0 years (range: 43-98 years). There were 25 cases with normal bone density, 35 with osteopenia, and 37 with osteoporosis. Machine learning techniques were used to randomly divide the 97 cases into training (n = 59) and validation (n = 38) sets with a ratio of 6:4 using stratified random sampling. A decision tree model was built in the training set, and significant diagnostic indicators were selected, with the performance of the decision tree evaluated using the validation set. Multinomial logistic regression methods were used to verify the strength of the relationship between the selected indicators and osteoporosis. Results The decision tree identified significant diagnostic indicators as the humeral shaft medullary cavity ratio M2/M4, age, and gender. M2/M4 ≥ 1.13 can be used as an important screening criterion; M2/M4 < 1.13 was predicted as local osteoporosis; M2/M4 ≥ 1.13 and age ≥83 years were also predicted as osteoporosis. M2/M4 ≥ 1.13 and age <64 years or males aged between 64 and 83 years were predicted as the normal population; M2/M4 ≥ 1.13 and females aged between 64 and 83 years were predicted as having osteopenia. The decision tree's accuracy in the training set was 0.7627 (95 % CI (0.6341, 0.8638)), and its accuracy in the test set was 0.7895 (95 % CI (0.6268, 0.9045)). Multinomial logistic regression results showed that humeral shaft medullary cavity ratios M2/M4, age, and gender in X-ray images were significantly associated with the occurrence of osteoporosis. Conclusion Utilizing X-ray data of the proximal humerus in conjunction with demographic information such as gender and age enable the prediction of localized osteoporosis, facilitating physicians' rapid comprehension of osteoporosis in patients and optimization of clinical treatment plans. Level of evidence Level IV retrospective case study.
Collapse
Affiliation(s)
- Gen Li
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Nienju Wu
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Jiong Zhang
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Yanyan Song
- Department of Biostatistics, Clinical research institute, Shanghai JiaoTong University School of medicine, Shanghai, PR China
| | - Tingjun Ye
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Yin Zhang
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Dahang Zhao
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Pei Yu
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Lei Wang
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| | - Chengyu Zhuang
- Department of Orthopedics, Ruijin Hospital, Shanghai JiaoTong University School of Medicine, 197 Ruijin 2nd Road, Shanghai, 200025, PR China
| |
Collapse
|
2
|
Liu Y, Zhao H. Predicting synergistic effects between compounds through their structural similarity and effects on transcriptomes. Bioinformatics 2016; 32:3782-3789. [PMID: 27540269 DOI: 10.1093/bioinformatics/btw509] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2016] [Revised: 07/16/2016] [Accepted: 07/28/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Combinatorial therapies have been under intensive research for cancer treatment. However, due to the large number of possible combinations among candidate compounds, exhaustive screening is prohibitive. Hence, it is important to develop computational tools that can predict compound combination effects, prioritize combinations and limit the search space to facilitate and accelerate the development of combinatorial therapies. RESULTS In this manuscript we consider the NCI-DREAM Drug Synergy Prediction Challenge dataset to identify features informative about combination effects. Through systematic exploration of differential expression profiles after single compound treatments and comparison of molecular structures of compounds, we found that synergistic levels of combinations are statistically significantly associated with compounds' dissimilarity in structure and similarity in induced gene expression changes. These two types of features offer complementary information in predicting experimentally measured combination effects of compound pairs. Our findings offer insights on the mechanisms underlying different combination effects and may help prioritize promising combinations in the very large search space. AVAILABILITY AND IMPLEMENTATION The R code for the analysis is available on https://github.com/YiyiLiu1/DrugCombination CONTACT: hongyu.zhao@yale.eduSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yiyi Liu
- Department of Biostatistics, School of Public Health, Yale University New Haven, CT, 06520, USA
| | - Hongyu Zhao
- Department of Biostatistics, School of Public Health, Yale University New Haven, CT, 06520, USA.,Program of Computational Biology and Bioinformatics, CT0610, Yale University, New Haven, CT, 06511, USA
| |
Collapse
|
3
|
Li L, Yu S, Xiao W, Li Y, Huang L, Zheng X, Zhou S, Yang H. Sequence-based identification of recombination spots using pseudo nucleic acid representation and recursive feature extraction by linear kernel SVM. BMC Bioinformatics 2014; 15:340. [PMID: 25409550 PMCID: PMC4289199 DOI: 10.1186/1471-2105-15-340] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 09/29/2014] [Indexed: 02/08/2023] Open
Abstract
Background Identification of the recombination hot/cold spots is critical for understanding the mechanism of recombination as well as the genome evolution process. However, experimental identification of recombination spots is both time-consuming and costly. Developing an accurate and automated method for reliably and quickly identifying recombination spots is thus urgently needed. Results Here we proposed a novel approach by fusing features from pseudo nucleic acid composition (PseNAC), including NAC, n-tier NAC and pseudo dinucleotide composition (PseDNC). A recursive feature extraction by linear kernel support vector machine (SVM) was then used to rank the integrated feature vectors and extract optimal features. SVM was adopted for identifying recombination spots based on these optimal features. To evaluate the performance of the proposed method, jackknife cross-validation test was employed on a benchmark dataset. The overall accuracy of this approach was 84.09%, which was higher (from 0.37% to 3.79%) than those of state-of-the-art tools. Conclusions Comparison results suggested that linear kernel SVM is a useful vehicle for identifying recombination hot/cold spots.
Collapse
Affiliation(s)
| | | | | | | | | | - Xiaoqi Zheng
- Department of General Surgery, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, China.
| | | | | |
Collapse
|