1
|
Rojas-López AG, Rodríguez-Molina A, Uriarte-Arcia AV, Villarreal-Cervantes MG. Vertebral Column Pathology Diagnosis Using Ensemble Strategies Based on Supervised Machine Learning Techniques. Healthcare (Basel) 2024; 12:1324. [PMID: 38998860 PMCID: PMC11241707 DOI: 10.3390/healthcare12131324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Revised: 06/25/2024] [Accepted: 06/28/2024] [Indexed: 07/14/2024] Open
Abstract
One expanding area of bioinformatics is medical diagnosis through the categorization of biomedical characteristics. Automatic medical strategies to boost the diagnostic through machine learning (ML) methods are challenging. They require a formal examination of their performance to identify the best conditions that enhance the ML method. This work proposes variants of the Voting and Stacking (VC and SC) ensemble strategies based on diverse auto-tuning supervised machine learning techniques to increase the efficacy of traditional baseline classifiers for the automatic diagnosis of vertebral column orthopedic illnesses. The ensemble strategies are created by first combining a complete set of auto-tuned baseline classifiers based on different processes, such as geometric, probabilistic, logic, and optimization. Next, the three most promising classifiers are selected among k-Nearest Neighbors (kNN), Naïve Bayes (NB), Logistic Regression (LR), Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), Support Vector Machine (SVM), Artificial Neural Networks (ANN), and Decision Tree (DT). The grid-search K-Fold cross-validation strategy is applied to auto-tune the baseline classifier hyperparameters. The performances of the proposed ensemble strategies are independently compared with the auto-tuned baseline classifiers. A concise analysis evaluates accuracy, precision, recall, F1-score, and ROC-ACU metrics. The analysis also examines the misclassified disease elements to find the most and least reliable classifiers for this specific medical problem. The results show that the VC ensemble strategy provides an improvement comparable to that of the best baseline classifier (the kNN). Meanwhile, when all baseline classifiers are included in the SC ensemble, this strategy surpasses 95% in all the evaluated metrics, standing out as the most suitable option for classifying vertebral column diseases.
Collapse
Affiliation(s)
- Alam Gabriel Rojas-López
- Optimal Mechatronic Design Laboratory, Postgraduate Department, Instituto Politécnico Nacional—Centro de Innovación y Desarrollo Tecnológico en Cómputo, Mexico City 07700, Mexico; (A.G.R.-L.); (A.V.U.-A.)
| | | | - Abril Valeria Uriarte-Arcia
- Optimal Mechatronic Design Laboratory, Postgraduate Department, Instituto Politécnico Nacional—Centro de Innovación y Desarrollo Tecnológico en Cómputo, Mexico City 07700, Mexico; (A.G.R.-L.); (A.V.U.-A.)
| | - Miguel Gabriel Villarreal-Cervantes
- Optimal Mechatronic Design Laboratory, Postgraduate Department, Instituto Politécnico Nacional—Centro de Innovación y Desarrollo Tecnológico en Cómputo, Mexico City 07700, Mexico; (A.G.R.-L.); (A.V.U.-A.)
| |
Collapse
|
2
|
Masseran N, Safari MAM, Tajuddin RRM. Probabilistic classification of the severity classes of unhealthy air pollution events. ENVIRONMENTAL MONITORING AND ASSESSMENT 2024; 196:523. [PMID: 38717514 DOI: 10.1007/s10661-024-12700-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 04/30/2024] [Indexed: 06/21/2024]
Abstract
Air pollution events can be categorized as extreme or non-extreme on the basis of their magnitude of severity. High-risk extreme air pollution events will exert a disastrous effect on the environment. Therefore, public health and policy-making authorities must be able to determine the characteristics of these events. This study proposes a probabilistic machine learning technique for predicting the classification of extreme and non-extreme events on the basis of data features to address the above issue. The use of the naïve Bayes model in the prediction of air pollution classes is proposed to leverage its simplicity as well as high accuracy and efficiency. A case study was conducted on the air pollution index data of Klang, Malaysia, for the period of January 01, 1997, to August 31, 2020. The trained naïve Bayes model achieves high accuracy, sensitivity, and specificity on the training and test datasets. Therefore, the naïve Bayes model can be easily applied in air pollution analysis while providing a promising solution for the accurate and efficient prediction of extreme or non-extreme air pollution events. The findings of this study provide reliable information to public authorities for monitoring and managing sustainable air quality over time.
Collapse
Affiliation(s)
- Nurulkamal Masseran
- Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.
| | - Muhammad Aslam Mohd Safari
- Department of Mathematics and Statistics, Faculty of Science, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
| | - Razik Ridzuan Mohd Tajuddin
- Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia
| |
Collapse
|
3
|
Santos DA, Zhang L, Do KA, Bednarski BK, Robinson Ledet C, Limmer A, Gibson H, You YN. Chemotherapy and Abdominal Wall Closure Technique Increase the Probability of Postoperative Ventral Incisional Hernia in Patients With Colon Cancer. Am Surg 2023; 89:98-107. [PMID: 33877925 DOI: 10.1177/00031348211011149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
BACKGROUND Chemotherapy is associated with postoperative ventral incisional hernia (PVIH) after right hemicolectomy (RHC) for colon cancer, and abdominal wall closure technique may affect PVIH. We sought to identify clinical predictors of PVIH. METHODS We retrospectively analyzed patients who underwent RHC for colon cancer from 2008-2018 and later developed PVIH. Time to PVIH was analyzed with Kaplan-Meier analysis, clinical predictors were identified with multivariable Cox proportional hazards modeling, and the probability of PVIH given chemotherapy and the suture technique was estimated with Bayesian analysis. RESULTS We identified 399 patients (209 no adjuvant chemotherapy and 190 adjuvant chemotherapy), with an overall PVIH rate of 38%. The 5-year PVIH rate was 55% for adjuvant chemotherapy, compared with 38% for none (log-rank P < .05). Adjuvant chemotherapy (hazard ratio [HR] 1.65, 95% confidence interval [CI] 1.18-2.31, P < .01), age (HR .99, 95% CI .97-1.00, P < .01), body mass index (HR 1.02, 95% CI 1.00-1.04, P < .01), and neoadjuvant chemotherapy (HR 1.92, 95% CI 1.21-3.00, P < .01) were independently associated with PVIH. Postoperative ventral incisional hernia was more common overall in patients who received adjuvant chemotherapy (46% compared with 30%, P < .01). In patients who received adjuvant chemotherapy, the probability of PVIH for incision closure with #1 running looped polydioxanone was 42%, compared with 59% for incision closure with #0 single interrupted polyglactin 910. DISCUSSION Exposure to chemotherapy increases the probability of PVIH after RHC, and non-short stitch incision closure further increases this probability, more so than age or body mass index. The suture technique deserves further study as a modifiable factor in this high-risk population.
Collapse
Affiliation(s)
- David A Santos
- Department of Surgical Oncology, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Liangliang Zhang
- Department of Biostatistics, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Kim-Anh Do
- Department of Biostatistics, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Brian K Bednarski
- Department of Surgical Oncology, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Celia Robinson Ledet
- Department of Surgical Oncology, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Angela Limmer
- Department of Surgical Oncology, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Heather Gibson
- Department of Surgical Oncology, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Y Nancy You
- Department of Surgical Oncology, 4002The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| |
Collapse
|
4
|
Huang Z, Kang M, Li G, Xiong P, Chen H, Kang L, Li S, Lu C, Li Q, Bai M. Predictive effect of Bayes discrimination in the level of serum protein factors and cognitive dysfunction in schizophrenia. J Psychiatr Res 2022; 151:539-545. [PMID: 35636029 DOI: 10.1016/j.jpsychires.2022.05.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 03/23/2022] [Accepted: 05/09/2022] [Indexed: 01/19/2023]
Abstract
Finding molecular biomarkers that can be related to the degree of cognitive dysfunction in schizophrenia remains a challenge. The levels of 6 Serum Protein Factors (NGF, BDNF, IL-6, TNF-α, S100β, GFAP) in peripheral blood of patients with schizophrenia were measured. The cognitive function of patients with schizophrenia was assessed by MATRICS Consensus Cognitive Battery (MCCB), a systematic assessment tool of international gold standard for cognitive function assessment of schizophrenia. To explore the correlation between these 6 biomarkers and the degree of cognitive dysfunction in schizophrenia,78 schizophrenic patients and 71 healthy controls were included in the study. The serum concentrations of BDNF and GFAP were lower in the patient group, but the concentrations of IL-6, TNF-α and S100β were higher. The speed of information processing, word learning, reasoning and problem solving, visual learning T-score of the patient group were lower than the control group. Bayes discriminant function model has a high correct discriminant rate for the severity of cognitive dysfunction in schizophrenia. The level of serum protein factor and clinical symptom score of schizophrenia may forecast the degree of cognitive dysfunction, which is expected to be a potential biomarker to identify the degree of cognitive dysfunction of schizophrenia, and provide objective basis for the clinical diagnosis and treatment of patients with schizophrenia.
Collapse
Affiliation(s)
- Zhengyuan Huang
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Minmin Kang
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China; Department of Psychiatry, The Affiliated Hospital of Hubei Minzu University, #39 Xueyuan Road, Enshi, Hubei,445000, China
| | - Guangyu Li
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Peng Xiong
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China; Yunnan Clinical Research Center for Mental Health, Kunming, Yunnan, 650032, China.
| | - Hongxu Chen
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Lin Kang
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Shan Li
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Cailian Lu
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Qianqian Li
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| | - Meiyan Bai
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical University, #295 Xichang, Road, Kunming, Yunnan, 650032, China
| |
Collapse
|
5
|
Tarimo CS, Bhuyan SS, Li Q, Mahande MJJ, Wu J, Fu X. Validating machine learning models for the prediction of labour induction intervention using routine data: a registry-based retrospective cohort study at a tertiary hospital in northern Tanzania. BMJ Open 2021; 11:e051925. [PMID: 34857568 PMCID: PMC8647548 DOI: 10.1136/bmjopen-2021-051925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
OBJECTIVES We aimed at identifying the important variables for labour induction intervention and assessing the predictive performance of machine learning algorithms. SETTING We analysed the birth registry data from a referral hospital in northern Tanzania. Since July 2000, every birth at this facility has been recorded in a specific database. PARTICIPANTS 21 578 deliveries between 2000 and 2015 were included. Deliveries that lacked information regarding the labour induction status were excluded. PRIMARY OUTCOME Deliveries involving labour induction intervention. RESULTS Parity, maternal age, body mass index, gestational age and birth weight were all found to be important predictors of labour induction. Boosting method demonstrated the best discriminative performance (area under curve, AUC=0.75: 95% CI (0.73 to 0.76)) while logistic regression presented the least (AUC=0.71: 95% CI (0.70 to 0.73)). Random forest and boosting algorithms showed the highest net-benefits as per the decision curve analysis. CONCLUSION All of the machine learning algorithms performed well in predicting the likelihood of labour induction intervention. Further optimisation of these classifiers through hyperparameter tuning may result in an improved performance. Extensive research into the performance of other classifier algorithms is warranted.
Collapse
Affiliation(s)
- Clifford Silver Tarimo
- College of Public Health, Zhengzhou University, Zhengzhou, China
- Science and Laboratory Technology, Dar es Salaam Institute of Technology, Dar es Salaam, Tanzania, United Republic of
| | - Soumitra S Bhuyan
- School of Planning and Public Policy, Rutgers University-New Brunswick, New York, New York, USA
| | - Quanman Li
- College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Michael Johnson J Mahande
- Institute of Public Health, Kilimanjaro Christian Medical University College, Moshi, Tanzania, United Republic of
| | - Jian Wu
- College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Xiaoli Fu
- College of Public Health, Zhengzhou University, Zhengzhou, China
| |
Collapse
|
6
|
Shen Y, Li Y, Zheng HT, Tang B, Yang M. Enhancing ontology-driven diagnostic reasoning with a symptom-dependency-aware Naïve Bayes classifier. BMC Bioinformatics 2019; 20:330. [PMID: 31196129 PMCID: PMC6567606 DOI: 10.1186/s12859-019-2924-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 05/31/2019] [Indexed: 11/10/2022] Open
Abstract
Background Ontology has attracted substantial attention from both academia and industry. Handling uncertainty reasoning is important in researching ontology. For example, when a patient is suffering from cirrhosis, the appearance of abdominal vein varices is four times more likely than the presence of bitter taste. Such medical knowledge is crucial for decision-making in various medical applications but is missing from existing medical ontologies. In this paper, we aim to discover medical knowledge probabilities from electronic medical record (EMR) texts to enrich ontologies. First, we build an ontology by identifying meaningful entity mentions from EMRs. Then, we propose a symptom-dependency-aware naïve Bayes classifier (SDNB) that is based on the assumption that there is a level of dependency among symptoms. To ensure the accuracy of the diagnostic classification, we incorporate the probability of a disease into the ontology via innovative approaches. Results We conduct a series of experiments to evaluate whether the proposed method can discover meaningful and accurate probabilities for medical knowledge. Based on over 30,000 deidentified medical records, we explore 336 abdominal diseases and 81 related symptoms. Among these 336 gastrointestinal diseases, the probabilities of 31 diseases are obtained via our method. These 31 probabilities of diseases and 189 conditional probabilities between diseases and the symptoms are added into the generated ontology. Conclusion In this paper, we propose a medical knowledge probability discovery method that is based on the analysis and extraction of EMR text data for enriching a medical ontology with probability information. The experimental results demonstrate that the proposed method can effectively identify accurate medical knowledge probability information from EMR data. In addition, the proposed method can efficiently and accurately calculate the probability of a patient suffering from a specified disease, thereby demonstrating the advantage of combining an ontology and a symptom-dependency-aware naïve Bayes classifier.
Collapse
Affiliation(s)
- Ying Shen
- School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, People's Republic of China
| | | | - Hai-Tao Zheng
- School of Information Science and Technology, Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, People's Republic of China
| | - Buzhou Tang
- Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, People's Republic of China
| | - Min Yang
- SIAT, Chinese Academy of Sciences, Shenzhen, 518055, People's Republic of China.
| |
Collapse
|
7
|
Garosi Y, Sheklabadi M, Conoscenti C, Pourghasemi HR, Van Oost K. Assessing the performance of GIS- based machine learning models with different accuracy measures for determining susceptibility to gully erosion. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 664:1117-1132. [PMID: 30901785 DOI: 10.1016/j.scitotenv.2019.02.093] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Revised: 02/05/2019] [Accepted: 02/05/2019] [Indexed: 05/27/2023]
Abstract
The main purpose was to compare discrimination and reliability of four machine learning models to create gully erosion susceptibility map (GESM) in a part of Ekbatan Dam Basin, Hamedan, western Iran. Extensive field surveys using GPS, and the visual interpretation of satellite images, used to prepare a digital map of the spatial distribution of gullies. 130 locations were sampled to elucidate the spatial distribution of the soil surface properties. Topographic attributes were provided from digital elevation model (DEM). The land use and normalized difference vegetation index (NDVI) maps were created by satellite imagery. The functional relationships between gully erosion and controlling factors were calculated using the random forest (RF), support vector machine (SVM), Naïve Bayes (NB), and generalized additive model (GAM) models. The performance of models was evaluated by 10-fold cross-validation based on efficiency, Kappa coefficient, receiver operating characteristic curve (ROC), mean absolute error (MAE), and root mean square error (RMSE). The results showed that the RF model had the highest amount of efficiency, Kappa coefficient, and AUC and the lowest amounts of MAE and RMSE compared with SVM, NB, and GAM. The RF model showed the highest predictive performance (mean AUC = 92.4%), followed by SVM (mean AUC = 90.9%), GAM (mean AUC = 89.9%), and NB (mean AUC = 87.2%) models. Overall accuracy of the models ranged from excellent (NB, GAM) to outstanding (RF, SVM) classes. The capacity of all models for creating GESM was quite stable when the calibration and validation samples were changed through10-fold cross-validation technique. According to variable importance analysis performed by RF model, the most important variables are distance from rivers, calcium carbonate equivalent (CCE), and topographic position index (TPI). The obtained maps can help identifying areas at risk of gully erosion and facilitate the implementation of plans for soil conservation and sustainable management.
Collapse
Affiliation(s)
- Younes Garosi
- Faculty of Agriculture, Department of Soil Science, Bu Ali Sina University, Ahmadi Roshan Avenue, 6517838695 Hamedan, Iran
| | - Mohsen Sheklabadi
- Faculty of Agriculture, Department of Soil Science, Bu Ali Sina University, Ahmadi Roshan Avenue, 6517838695 Hamedan, Iran.
| | - Christian Conoscenti
- Department of Earth and Sea Sciences (DISTEM), University of Palermo, Via Archirafi 22, 90123 Palermo, Italy
| | - Hamid Reza Pourghasemi
- College of Marine Sciences and Engineering, Nanjing Normal University, Nanjing, 210023, China; Department of Natural Resources and Environmental Engineering, College of Agriculture, Shiraz University, Shiraz, Iran
| | - Kristof Van Oost
- A- Fonds de la recherche Scientifique, FNRS Rue d'Egmont 5, 1000 Brussels, Belgium; B-TECLIM - Georges Lemaître Centre for Earth and Climate Research, Université catholique de Louvain, BE-1348 Louvain-la-Neuve, Belgium
| |
Collapse
|
8
|
Guan M, Cho S, Petro R, Zhang W, Pasche B, Topaloglu U. Natural language processing and recurrent network models for identifying genomic mutation-associated cancer treatment change from patient progress notes. JAMIA Open 2019; 2:139-149. [PMID: 30944913 PMCID: PMC6435007 DOI: 10.1093/jamiaopen/ooy061] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2018] [Revised: 11/26/2018] [Accepted: 12/21/2018] [Indexed: 01/16/2023] Open
Abstract
Objectives Natural language processing (NLP) and machine learning approaches were used to build classifiers to identify genomic-related treatment changes in the free-text visit progress notes of cancer patients. Methods We obtained 5889 deidentified progress reports (2439 words on average) for 755 cancer patients who have undergone a clinical next generation sequencing (NGS) testing in Wake Forest Baptist Comprehensive Cancer Center for our data analyses. An NLP system was implemented to process the free-text data and extract NGS-related information. Three types of recurrent neural network (RNN) namely, gated recurrent unit, long short-term memory (LSTM), and bidirectional LSTM (LSTM_Bi) were applied to classify documents to the treatment-change and no-treatment-change groups. Further, we compared the performances of RNNs to 5 machine learning algorithms including Naive Bayes, K-nearest Neighbor, Support Vector Machine for classification, Random forest, and Logistic Regression. Results Our results suggested that, overall, RNNs outperformed traditional machine learning algorithms, and LSTM_Bi showed the best performance among the RNNs in terms of accuracy, precision, recall, and F1 score. In addition, pretrained word embedding can improve the accuracy of LSTM by 3.4% and reduce the training time by more than 60%. Discussion and Conclusion NLP and RNN-based text mining solutions have demonstrated advantages in information retrieval and document classification tasks for unstructured clinical progress notes.
Collapse
Affiliation(s)
- Meijian Guan
- Department of Computer Science, Wake Forest University, Winston-Salem, North Carolina, USA.,Wake Forest Baptist Comprehensive Cancer Center, Winston Salem, North Carolina, USA
| | - Samuel Cho
- Department of Computer Science, Wake Forest University, Winston-Salem, North Carolina, USA.,Department of Physics, Wake Forest University, Winston-Salem, North Carolina, USA
| | - Robin Petro
- Wake Forest Baptist Comprehensive Cancer Center, Winston Salem, North Carolina, USA
| | - Wei Zhang
- Wake Forest Baptist Comprehensive Cancer Center, Winston Salem, North Carolina, USA.,Department of Cancer Biology, Wake Forest School of Medicine, Winston Salem, North Carolina, USA
| | - Boris Pasche
- Wake Forest Baptist Comprehensive Cancer Center, Winston Salem, North Carolina, USA.,Department of Cancer Biology, Wake Forest School of Medicine, Winston Salem, North Carolina, USA
| | - Umit Topaloglu
- Wake Forest Baptist Comprehensive Cancer Center, Winston Salem, North Carolina, USA.,Department of Cancer Biology, Wake Forest School of Medicine, Winston Salem, North Carolina, USA
| |
Collapse
|
9
|
Verma L, Srivastava S, Negi PC. An intelligent noninvasive model for coronary artery disease detection. COMPLEX INTELL SYST 2017. [DOI: 10.1007/s40747-017-0048-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
10
|
Pruengkarn R, Wong KW, Fung CC. A Review of Data Mining Techniques and Applications. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2017. [DOI: 10.20965/jaciii.2017.p0031] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Data mining is the analytics and knowledge discovery process of analyzing large volumes of data from various sources and transforming the data into useful information. Various disciplines have contributed to its development and is becoming increasingly important in the scientific and industrial world. This article presents a review of data mining techniques and applications from 1996 to 2016. Techniques are divided into two main categories: predictive methods and descriptive methods. Due to the huge number of publications available on this topic, only a selected number are used in this review to highlight the developments of the past 20 years. Applications are included to provide some insights into how each data mining technique has evolved over the last two decades. Recent research trends focus more on large data sets and big data. Recently there have also been more applications in area of health informatics with the advent of newer algorithms.
Collapse
|
11
|
Langarizadeh M, Moghbeli F. Applying Naive Bayesian Networks to Disease Prediction: a Systematic Review. Acta Inform Med 2016; 24:364-369. [PMID: 28077895 PMCID: PMC5203736 DOI: 10.5455/aim.2016.24.364-369] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2016] [Accepted: 10/11/2016] [Indexed: 12/15/2022] Open
Abstract
INTRODUCTION Naive Bayesian networks (NBNs) are one of the most effective and simplest Bayesian networks for prediction. OBJECTIVE This paper aims to review published evidence about the application of NBNs in predicting disease and it tries to show NBNs as the fundamental algorithm for the best performance in comparison with other algorithms. METHODS PubMed was electronically checked for articles published between 2005 and 2015. For characterizing eligible articles, a comprehensive electronic searching method was conducted. Inclusion criteria were determined based on NBN and its effects on disease prediction. A total of 99 articles were found. After excluding the duplicates (n= 5), the titles and abstracts of 94 articles were skimmed according to the inclusion criteria. Finally, 38 articles remained. They were reviewed in full text and 15 articles were excluded. Eventually, 23 articles were selected which met our eligibility criteria and were included in this study. RESULT In this article, the use of NBN in predicting diseases was described. Finally, the results were reported in terms of Accuracy, Sensitivity, Specificity and Area under ROC curve (AUC). The last column in Table 2 shows the differences between NBNs and other algorithms. DISCUSSION This systematic review (23 studies, 53,725 patients) indicates that predicting diseases based on a NBN had the best performance in most diseases in comparison with the other algorithms. Finally in most cases NBN works better than other algorithms based on the reported accuracy. CONCLUSION The method, termed NBNs is proposed and can efficiently construct a prediction model for disease.
Collapse
Affiliation(s)
- Mostafa Langarizadeh
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Fateme Moghbeli
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
12
|
Zhou M, Chaudhury B, Hall LO, Goldgof DB, Gillies RJ, Gatenby RA. Identifying spatial imaging biomarkers of glioblastoma multiforme for survival group prediction. J Magn Reson Imaging 2016; 46:115-123. [PMID: 27678245 DOI: 10.1002/jmri.25497] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 09/14/2016] [Indexed: 11/06/2022] Open
Abstract
PURPOSE Glioblastoma multiforme (GBM) is the most common malignant brain tumor in adults. Most GBMs exhibit extensive regional heterogeneity at tissue, cellular, and molecular scales, but the clinical relevance of the observed spatial imaging characteristics remains unknown. We investigated pretreatment magnetic resonance imaging (MRI) scans of GBMs to identify tumor subregions and quantify their image-based spatial characteristics that are associated with survival time. MATERIALS AND METHODS We quantified tumor subregions (termed habitats) in GBMs, which are hypothesized to capture intratumoral characteristics using multiple MRI sequences. For proof-of-concept, we developed a computational framework that used intratumoral grouping and spatial mapping to identify GBM tumor subregions and yield habitat-based features. Using a feature selector and three classifiers, experimental results from two datasets are reported, including Dataset1 with 32 GBM patients (594 tumor slices) and Dataset2 with 22 GBM patients, who did not undergo resection (261 tumor slices) for survival group prediction. RESULTS In both datasets, we show that habitat-based features achieved 87.50% and 86.36% accuracies for survival group prediction, respectively, using leave-one-out cross-validation. Experimental results revealed that spatially correlated features between signal-enhanced subregions were effective for predicting survival groups (P < 0.05 for all three machine-learning classifiers). CONCLUSION The quantitative spatial-correlated features derived from MRI-defined tumor subregions in GBM could be effectively used to predict the survival time of patients. LEVEL OF EVIDENCE 2 J. MAGN. RESON. IMAGING 2017;46:115-123.
Collapse
Affiliation(s)
- Mu Zhou
- Stanford Center for Biomedical Informatics, Stanford University, Stanford, California, USA
| | - Baishali Chaudhury
- Department of Radiology, H. Lee Moffitt Cancer and Research Institute, Tampa, Florida, USA
| | - Lawrence O Hall
- Department of Computer Science and Engineering, University of South Florida, Tampa, Florida, USA
| | - Dmitry B Goldgof
- Department of Computer Science and Engineering, University of South Florida, Tampa, Florida, USA
| | - Robert J Gillies
- Department of Radiology, H. Lee Moffitt Cancer and Research Institute, Tampa, Florida, USA
| | - Robert A Gatenby
- Department of Radiology, H. Lee Moffitt Cancer and Research Institute, Tampa, Florida, USA
| |
Collapse
|
13
|
Predicting distant failure in early stage NSCLC treated with SBRT using clinical parameters. Radiother Oncol 2016; 119:501-4. [PMID: 27156652 DOI: 10.1016/j.radonc.2016.04.029] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Revised: 03/30/2016] [Accepted: 04/16/2016] [Indexed: 01/02/2023]
Abstract
PURPOSE/OBJECTIVE The aim of this study is to predict early distant failure in early stage non-small cell lung cancer (NSCLC) treated with stereotactic body radiation therapy (SBRT) using clinical parameters by machine learning algorithms. MATERIALS/METHODS The dataset used in this work includes 81 early stage NSCLC patients with at least 6months of follow-up who underwent SBRT between 2006 and 2012 at a single institution. The clinical parameters (n=18) for each patient include demographic parameters, tumor characteristics, treatment fraction schemes, and pretreatment medications. Three predictive models were constructed based on different machine learning algorithms: (1) artificial neural network (ANN), (2) logistic regression (LR) and (3) support vector machine (SVM). Furthermore, to select an optimal clinical parameter set for the model construction, three strategies were adopted: (1) clonal selection algorithm (CSA) based selection strategy; (2) sequential forward selection (SFS) method; and (3) statistical analysis (SA) based strategy. 5-cross-validation is used to validate the performance of each predictive model. The accuracy was assessed by area under the receiver operating characteristic (ROC) curve (AUC), sensitivity and specificity of the system was also evaluated. RESULTS The AUCs for ANN, LR and SVM were 0.75, 0.73, and 0.80, respectively. The sensitivity values for ANN, LR and SVM were 71.2%, 72.9% and 83.1%, while the specificity values for ANN, LR and SVM were 59.1%, 63.6% and 63.6%, respectively. Meanwhile, the CSA based strategy outperformed SFS and SA in terms of AUC, sensitivity and specificity. CONCLUSIONS Based on clinical parameters, the SVM with the CSA optimal parameter set selection strategy achieves better performance than other strategies for predicting distant failure in lung SBRT patients.
Collapse
|
14
|
Interactive Decision-Support Tool for Risk-Based Radiation Therapy Plan Comparison for Hodgkin Lymphoma. Int J Radiat Oncol Biol Phys 2015; 91:683. [DOI: 10.1016/j.ijrobp.2014.10.051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Revised: 10/27/2014] [Accepted: 10/27/2014] [Indexed: 11/22/2022]
|
15
|
Bandyopadhyay S, Wolfson J, Vock DM, Vazquez-Benitez G, Adomavicius G, Elidrisi M, Johnson PE, O’Connor PJ. Data mining for censored time-to-event data: a Bayesian network model for predicting cardiovascular risk from electronic health record data. Data Min Knowl Discov 2014. [DOI: 10.1007/s10618-014-0386-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
16
|
Piotrowski T, Jodda A. How to compare treatment plans? Personalized perspective. Rep Pract Oncol Radiother 2014; 20:77-8. [PMID: 25859395 DOI: 10.1016/j.rpor.2014.04.011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Indexed: 11/27/2022] Open
Affiliation(s)
- Tomasz Piotrowski
- Department of Medical Physics, Greater Poland Cancer Centre, University of Medical Sciences, Garbary 15, 61-866 Poznan, Poland ; Department of Electroradiology, University of Medical Sciences, Garbary 15, 61-866 Poznan, Poland
| | - Agata Jodda
- Department of Medical Physics, Greater Poland Cancer Centre, University of Medical Sciences, Garbary 15, 61-866 Poznan, Poland
| |
Collapse
|
17
|
Tsolaki E, Svolos P, Kousi E, Kapsalaki E, Fezoulidis I, Fountas K, Theodorou K, Kappas C, Tsougos I. Fast spectroscopic multiple analysis (FASMA) for brain tumor classification: a clinical decision support system utilizing multi-parametric 3T MR data. Int J Comput Assist Radiol Surg 2014; 10:1149-66. [PMID: 25024116 DOI: 10.1007/s11548-014-1088-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 05/05/2014] [Indexed: 01/14/2023]
Abstract
INTRODUCTION A clinical decision support system (CDSS) for brain tumor classification can be used to assist in the diagnosis and grading of brain tumors. A Fast Spectroscopic Multiple Analysis (FASMA) system that uses combinations of multiparametric MRI data sets was developed as a CDSS for brain tumor classification. METHODS MRI metabolic ratios and spectra, from long and short TE, respectively, as well as diffusion and perfusion data were acquired from the intratumoral and peritumoral area of 126 patients with untreated intracranial tumors. These data were categorized based on the pathology, and different machine learning methods were evaluated regarding their classification performance for glioma grading and differentiation of infiltrating versus non-infiltrating lesions. Additional databases were embedded to the system, including updated literature values of the related MR parameters and typical tumor characteristics (imaging and histological), for further comparisons. Custom Graphical User Interface (GUI) layouts were developed to facilitate classification of the unknown cases based on the user's available MR data. RESULTS The highest classification performance was achieved with a support vector machine (SVM) using the combination of all MR features. FASMA correctly classified 89 and 79% in the intratumoral and peritumoral area, respectively, for cases from an independent test set. FASMA produced the correct diagnosis, even in the misclassified cases, since discrimination between infiltrative versus non-infiltrative cases was possible. CONCLUSIONS FASMA is a prototype CDSS, which integrates complex quantitative MR data for brain tumor characterization. FASMA was developed as a diagnostic assistant that provides fast analysis, representation and classification for a set of MR parameters. This software may serve as a teaching tool on advanced MRI techniques, as it incorporates additional information regarding typical tumor characteristics derived from the literature.
Collapse
Affiliation(s)
- Evangelia Tsolaki
- Medical Physics Department, Medical School, University of Thessaly, 41110 , Biopolis, Larissa, Greece
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Beam orientation in stereotactic radiosurgery using an artificial neural network. Radiother Oncol 2014; 111:296-300. [DOI: 10.1016/j.radonc.2014.03.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Revised: 03/16/2014] [Accepted: 03/19/2014] [Indexed: 01/06/2023]
|
19
|
Svolos P, Tsolaki E, Theodorou K, Fountas K, Kapsalaki E, Fezoulidis I, Tsougos I. Classification methods for the differentiation of atypical meningiomas using diffusion and perfusion techniques at 3-T MRI. Clin Imaging 2013; 37:856-64. [PMID: 23849831 DOI: 10.1016/j.clinimag.2013.03.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 03/21/2013] [Indexed: 10/26/2022]
Abstract
The purpose was to investigate the contribution of machine learning algorithms using diffusion and perfusion techniques in the differentiation of atypical meningiomas from glioblastomas and metastases. Apparent diffusion coefficient, fractional anisotropy, and relative cerebral blood volume were measured in different tumor regions. Naive Bayes, k-Nearest Neighbor, and Support Vector Machine classifiers were used in the classification procedure. The application of classification methods adds incremental differential diagnostic value. Differentiation is mainly achieved using diffusion metrics, while perfusion measurements may provide significant information for the peritumoral regions.
Collapse
Affiliation(s)
- Patricia Svolos
- Medical Physics Department, University of Thessaly, Biopolis, 41110, Larissa, Greece
| | | | | | | | | | | | | |
Collapse
|
20
|
Tsolaki E, Svolos P, Kousi E, Kapsalaki E, Fountas K, Theodorou K, Tsougos I. Automated differentiation of glioblastomas from intracranial metastases using 3T MR spectroscopic and perfusion data. Int J Comput Assist Radiol Surg 2013; 8:751-61. [PMID: 23334798 DOI: 10.1007/s11548-012-0808-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Accepted: 12/17/2012] [Indexed: 01/14/2023]
Abstract
PURPOSE Differentiation of glioblastomas from metastases is clinical important, but may be difficult even for expert observers. To investigate the contribution of machine learning algorithms in the differentiation of glioblastomas multiforme (GB) from metastases, we developed and tested a pattern recognition system based on 3T magnetic resonance (MR) data. MATERIALS AND METHODS Single and multi-voxel proton magnetic resonance spectroscopy (1H-MRS) and dynamic susceptibility contrast (DSC) MRI scans were performed on 49 patients with solitary brain tumors (35 glioblastoma multiforme and 14 metastases). Metabolic (NAA/Cr, Cho/Cr, (Lip [Formula: see text] Lac)/Cr) and perfusion (rCBV) parameters were measured in both intratumoral and peritumoral regions. The statistical significance of these parameters was evaluated. For the classification procedure, three datasets were created to find the optimum combination of parameters that provides maximum differentiation. Three machine learning methods were utilized: Naïve-Bayes, Support Vector Machine (SVM) and [Formula: see text]-nearest neighbor (KNN). The discrimination ability of each classifier was evaluated with quantitative performance metrics. RESULTS Glioblastoma and metastases were differentiable only in the peritumoral region of these lesions ([Formula: see text]). SVM achieved the highest overall performance (accuracy 98%) for both the intratumoral and peritumoral areas. Naïve-Bayes and KNN presented greater variations in performance. The proper selection of datasets plays a very significant role as they are closely correlated to the underlying pathophysiology. CONCLUSION The application of pattern recognition techniques using 3T MR-based perfusion and metabolic features may provide incremental diagnostic value in the differentiation of common intraaxial brain tumors, such as glioblastoma versus metastasis.
Collapse
Affiliation(s)
- Evangelia Tsolaki
- Medical Physics Department, Medical School, University of Thessaly, 41110 , Biopolis, Larissa, Greece,
| | | | | | | | | | | | | |
Collapse
|
21
|
Piotrowski T, Ryczkowski A, Kazmierska J. B-Spline Registration Based on New Concept of an Intelligent Masking Procedure and GPU Computations for the Head and Neck Adaptive Tomotherapy. Technol Cancer Res Treat 2012; 11:257-66. [DOI: 10.7785/tcrt.2012.500294] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The deformable image registration (DIR) procedure has been optimized for helical tomotherapy. The data on registration shifts obtained on matching planning image with pre-treatment megavoltage CT are used in our software for acceleration of the first step (rigid registration) of the DIR procedure and for implementation of the B-Spline algorithm with intelligent masking. Priorities of the masks were automatically calculated based on disagreement detected during rigid registration. Evaluation tasks included: (a) comparison of accuracy and rate for schemes of pre-registered and non-registered images; (b) qualification of the effectiveness of the intelligent masking process, and (c) determination of acceleration of achievable with GPU computing. A specially designed head and neck phantom used for evaluation included structures with controlled changes of position, volume, density, and shape. Re-contouring procedures were performed with an Adaptive Planning software (Tomotherapy Inc.). No statistical difference was observed in accuracy of DIR based on structure position match on the tomotherapy unit and non pre-registered images (p > 0.7). Using pre-registered data reduces the total time required for execution of the elastic registration procedure by 5%. These data are also necessary for intelligent masking procedure during B-Spine registration. Intelligent masking procedure increases accuracy of the registration for a masked structure (p < 0.04) without decreasing the accuracy in non-masked tissues and additionally reduces the total time by 13%. GPU computations speed up procedure 30 times. GPU computing of the DIR in current status of our investigation could be realized in a relatively short time after pre-treatment imaging. The proposed approach can be used in the routine assessment of anatomic changes occurring in healthy tissue during the course of radiotherapy. Further developments will be concentrated on the full integration of DIR computations in the imaging and treatment process of helical tomotherapy.
Collapse
Affiliation(s)
- T. Piotrowski
- Department of Medical Physics, Greater Poland Cancer Center, Garbary 15 st, 61-866, Poznan, Poland
- Department of Electroradiology, University of Medical Sciences, Poznan, Garbary 15 st, 61-866, Poznan, Poland
| | - A. Ryczkowski
- Department of Medical Physics, Greater Poland Cancer Center, Garbary 15 st, 61-866, Poznan, Poland
| | - J. Kazmierska
- II Radiotherapy Department, Greater Poland Cancer Center, Garbary 15 st, 61-866, Poznan, Poland
- Department of Electroradiology, University of Medical Sciences, Poznan, Garbary 15 st, 61-866, Poznan, Poland
| |
Collapse
|
22
|
Niedoszytko M, Bruinenberg M, van Doormaal JJ, de Monchy JGR, Nedoszytko B, Koppelman GH, Nawijn MC, Wijmenga C, Jassem E, Elberink JNGO. Gene expression analysis predicts insect venom anaphylaxis in indolent systemic mastocytosis. Allergy 2011; 66:648-57. [PMID: 21143240 DOI: 10.1111/j.1398-9995.2010.02521.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
BACKGROUND Anaphylaxis to insect venom (Hymenoptera) is most severe in patients with mastocytosis and may even lead to death. However, not all patients with mastocytosis suffer from anaphylaxis. The aim of the study was to analyze differences in gene expression between patients with indolent systemic mastocytosis (ISM) and a history of insect venom anaphylaxis (IVA) compared to those patients without a history of anaphylaxis, and to determine the predictive use of gene expression profiling. METHODS Whole-genome gene expression analysis was performed in peripheral blood cells. RESULTS Twenty-two adults with ISM were included: 12 with a history of IVA and 10 without a history of anaphylaxis of any kind. Significant differences in single gene expression corrected for multiple testing were found for 104 transcripts (P < 0.05). Gene ontology analysis revealed that the differentially expressed genes were involved in pathways responsible for the development of cancer and focal and cell adhesion suggesting that the expression of genes related to the differentiation state of cells is higher in patients with a history of anaphylaxis. Based on the gene expression profiles, a naïve Bayes prediction model was built identifying patients with IVA. CONCLUSIONS In ISM, gene expression profiles are different between patients with a history of IVA and those without. These findings might reflect a more pronounced mast cells dysfunction in patients without a history of anaphylaxis. Gene expression profiling might be a useful tool to predict the risk of anaphylaxis on insect venom in patients with ISM. Prospective studies are needed to substantiate any conclusions.
Collapse
Affiliation(s)
- M Niedoszytko
- Department of Allergology, Medical University of Gdansk, Gdansk, Poland.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Niedoszytko M, Oude Elberink JNG, Bruinenberg M, Nedoszytko B, de Monchy JGR, te Meerman GJ, Weersma RK, Mulder AB, Jassem E, van Doormaal JJ. Gene expression profile, pathways, and transcriptional system regulation in indolent systemic mastocytosis. Allergy 2011; 66:229-37. [PMID: 21208217 DOI: 10.1111/j.1398-9995.2010.02477.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
BACKGROUND Mastocytosis is an uncommon disease resulting from proliferation of abnormal mast cells infiltrating skin, bone marrow, liver, and other tissues. The aim of this study was to find differences in gene expression in peripheral blood cells of patients with indolent systemic mastocytosis compared to healthy controls. The second aim was to define a specific gene expression profile in patients with mastocytosis. METHODS Twenty-two patients with indolent systemic mastocytosis and 43 healthy controls were studied. Whole genome gene expression analysis was performed on RNA samples isolated from the peripheral blood. For amplification and labelling of the RNA, the Illumina TotalPrep 96 RNA Amplification Kit was used. Human HT-12_V3_expression arrays were processed. Data analysis was performed using GeneSpring, Genecodis, and Transcriptional System Regulators. RESULTS Comparison of gene expression between patients and controls revealed a significant difference (P < 0.05 corrected for multiple testing) and the fold change difference >2 in gene expression in 2303 of the 48.794 analysed transcripts. Functional annotation indicated that the main pathways in which the differently expressed genes were involved are ubiquitin-mediated proteolysis, MAPK signalling pathway, pathways in cancer, and Jak-STAT signalling. The expression distributions for both groups did not overlap at all, indicating that many genes are highly differentially expressed in both groups. CONCLUSION We were able to find abnormalities in gene expression in peripheral blood cells of patients with indolent systemic mastocytosis and to construct a gene expression profile which may be useful in clinical practice to predict the presence of mastocytosis and in further research of novel drugs.
Collapse
Affiliation(s)
- M Niedoszytko
- Department of Allergology Medical University of Gdansk, Debinki 7, Gdansk, Poland.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Rajendran P, Madheswaran M. An improved brain image classification technique with mining and shape prior segmentation procedure. J Med Syst 2010; 36:747-64. [PMID: 20703655 DOI: 10.1007/s10916-010-9542-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2010] [Accepted: 06/06/2010] [Indexed: 10/19/2022]
Abstract
The shape prior segmentation procedure and pruned association rule with ImageApriori algorithm has been used to develop an improved brain image classification system are presented in this paper. The CT scan brain images have been classified into three categories namely normal, benign and malignant, considering the low-level features extracted from the images and high level knowledge from specialists to enhance the accuracy in decision process. The experimental results on pre-diagnosed brain images showed 97% sensitivity, 91% specificity and 98.5% accuracy. The proposed algorithm is expected to assist the physicians for efficient classification with multiple key features per image.
Collapse
Affiliation(s)
- P Rajendran
- Department of Computer Science and Engineering, K. S. Rangasamy College of Technology, Tamilnadu, India.
| | | |
Collapse
|
25
|
Niedoszytko M, Bruinenberg M, de Monchy J, Wijmenga C, Platteel M, Jassem E, Oude Elberink JN. Gene expression analysis in predicting the effectiveness of insect venom immunotherapy. J Allergy Clin Immunol 2010; 125:1092-7. [DOI: 10.1016/j.jaci.2010.01.021] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2009] [Revised: 12/29/2009] [Accepted: 01/06/2010] [Indexed: 12/01/2022]
|