1
|
Kumar S, Balaya RDA, Kanekar S, Raju R, Prasad TSK, Kandasamy RK. Computational tools for exploring peptide-membrane interactions in gram-positive bacteria. Comput Struct Biotechnol J 2023; 21:1995-2008. [PMID: 36950221 PMCID: PMC10025024 DOI: 10.1016/j.csbj.2023.02.051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/27/2023] [Accepted: 02/27/2023] [Indexed: 03/05/2023] Open
Abstract
The vital cellular functions in Gram-positive bacteria are controlled by signaling molecules known as quorum sensing peptides (QSPs), considered promising therapeutic interventions for bacterial infections. In the bacterial system QSPs bind to membrane-coupled receptors, which then auto-phosphorylate and activate intracellular response regulators. These response regulators induce target gene expression in bacteria. One of the most reliable trends in drug discovery research for virulence-associated molecular targets is the use of peptide drugs or new functionalities. In this perspective, computational methods act as auxiliary aids for biologists, where methodologies based on machine learning and in silico analysis are developed as suitable tools for target peptide identification. Therefore, the development of quick and reliable computational resources to identify or predict these QSPs along with their receptors and inhibitors is receiving considerable attention. The databases such as Quorumpeps and Quorum Sensing of Human Gut Microbes (QSHGM) provide a detailed overview of the structures and functions of QSPs. The tools and algorithms such as QSPpred, QSPred-FL, iQSP, EnsembleQS and PEPred-Suite have been used for the generic prediction of QSPs and feature representation. The availability of compiled key resources for utilizing peptide features based on amino acid composition, positional preferences, and motifs as well as structural and physicochemical properties, including biofilm inhibitory peptides, can aid in elucidating the QSP and membrane receptor interactions in infectious Gram-positive pathogens. Herein, we present a comprehensive survey of diverse computational approaches that are suitable for detecting QSPs and QS interference molecules. This review highlights the utility of these methods for developing potential biomarkers against infectious Gram-positive pathogens.
Collapse
Key Words
- 3-HBA, 3–Hydroxybenzoic Acid
- AAC, Amino Acid Composition
- ABC, ATP-binding cassette
- ACD, Available Chemicals Database
- AIP, Autoinducing Peptide
- AMP, Anti-Microbial Peptide
- ATP, Adenosine Triphosphate
- Agr, Accessory gene regulator
- BFE, Binding Free Energy
- BIP Inhibitors
- BIP, Biofilm Inhibitory Peptides
- BLAST, Basic Local Alignment Search Tool
- BNB, Bernoulli Naïve-Bayes
- CADD, Computer-Aided Drug Design
- CSP, Competence Stimulating Peptide
- CTD, Composition-Transition-Distribution
- D, Aspartate
- DCH, 3,3′-(3,4-dichlorobenzylidene)-bis-(4-hydroxycoumarin)
- DT, Decision Tree
- FDA, Food and Drug Administration
- GBM, Gradient Boosting Machine
- GDC, g-gap Dipeptide
- GNB, Gaussian NB
- Gram-positive bacteria
- H, Histidine
- H-Kinase, Histidine Kinase
- H-phosphotransferase, Histidine Phosphotransferase
- HAM, Hamamelitannin
- HGM, Human Gut Microbiota
- HNP, Human Neutrophil Peptide
- IT, Information Theory Features
- In silico approaches
- KNN, K-Nearest Neighbors
- MCC, Mathew Co-relation Coefficient
- MD, Molecular Dynamics
- MDR, Multiple Drug Resistance
- ML, Machine Learning
- MRSA, Methicillin Resistant S. aureus
- MSL, Multiple Sequence Alignment
- OMR, Omargliptin
- OVP, Overlapping Property Features
- PCP, Physicochemical Properties
- PDB, Protein Data Bank
- PPIs, Protein-Protein Interactions
- PSM, Phenol-Soluble Modulin
- PTM, Post Translational Modification
- QS, Quorum Sensing
- QSCN, QS communication network
- QSHGM, Quorum Sensing of Human Gut Microbes
- QSI, QS Inhibitors
- QSIM, QS Interference Molecules
- QSP inhibitors
- QSP predictors
- QSP, QS Peptides
- QSPR, Quantitative Structure Property Relationship
- Quorum sensing peptides
- RAP, RNAIII-activating protein
- RF, Random Forest
- RIP, RNAIII-inhibiting peptide
- ROC, Receiver Operating Characteristic
- SAR, Structure-Activity Relationship
- SFS, Sequential Forward Search
- SIT, Sitagliptin
- SVM, Support Vector Machine
- TCS, Two-Component Sensory
- TRAP, Target of RAP
- TRG, Trelagliptin
- WHO, World Health Organization
- mRMR, minimum Redundancy and Maximum Relevance
Collapse
Affiliation(s)
- Shreya Kumar
- Centre for Integrative Omics Data Science, Yenepoya (Deemed to be University), Mangalore 575018, India
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | | | - Saptami Kanekar
- Centre for Integrative Omics Data Science, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science, Yenepoya (Deemed to be University), Mangalore 575018, India
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | | | - Richard K. Kandasamy
- Centre of Molecular Inflammation Research (CEMIR), and Department of Clinical and Molecular Medicine (IKOM), Norwegian University of Science and Technology, 7491 Trondheim, Norway
- Department of Laboratory Medicine and Pathology, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA
| |
Collapse
|
2
|
Lau LCM, Chui ECS, Man GCW, Xin Y, Ho KKW, Mak KKK, Ong MTY, Law SW, Cheung WH, Yung PSH. A novel image-based machine learning model with superior accuracy and predictability for knee arthroplasty loosening detection and clinical decision making. J Orthop Translat 2022; 36:177-83. [PMID: 36263380 DOI: 10.1016/j.jot.2022.07.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 06/13/2022] [Accepted: 07/04/2022] [Indexed: 11/08/2022] Open
Abstract
Background Loosening is the leading cause of total knee arthroplasty (TKA) revision. This is a heavy burden toward the healthcare system owing to the difficulty in diagnosis and complications occurring from the delay management. Based on automatic analytical model building, machine learning, may potentially help to automatically recognize the risk of loosening based on radiographs alone. The aim of this study was to build an image-based machine-learning model for detecting TKA loosening. Methods Image-based machine-learning model was developed based on ImageNet, Xception model and a TKA patient X-ray image dataset. Based on a dataset with TKA patient clinical parameters, another system was then created for developing the clinical-information-based machine learning model with random forest classifier. In addition, the Xception Model was pre-trained on the ImageNet database with python and TensorFlow deep learning library for the prediction of loosening. Class activation maps were also used to interpret the prediction decision made by model. Two senior orthopaedic specialists were invited to assess loosening from X-ray images for 3 attempts in setting up comparison benchmark. Result In the image-based machine learning loosening model, the precision rate and recall rate were 0.92 and 0.96, respectively. While for the accuracy rate, 96.3% for visualization classification was observed. However, the addition of clinical-information-based model, with precision rate of 0.71 and recall rate of 0.20, did not further showed improvement on the accuracy. Moreover, as class activation maps showed corresponding signals over bone-implant interface that is loosened radiographically, this confirms that the current model utilized a similar image recognition pattern as that of inspection by clinical specialists. Conclusion The image-based machine learning model developed demonstrated high accuracy and predictability of knee arthroplasty loosening. And the class activation heatmap matched well with the radiographic features used clinically to detect loosening, which highlighting its potential role in assisting clinicians in their daily practice. However, addition of clinical-information-based machine-learning model did not offer further improvement in detection. As far as we know, this is the first report of pure image-based machine learning model with high detection accuracy. Importantly, this is also the first model to show relevant class activation heatmap corresponding to loosening location. Translational potential The finding in this study indicated image-based machine learning model can detect knee arthroplasty loosening with high accuracy and predictability, which the class activation heatmap can potentially assist surgeons to identify the sites of loosening.
Collapse
|
3
|
Sykes AL, Larrieu E, Poggio TV, Céspedes MG, Mujica GB, Basáñez MG, Prada JM. Modelling diagnostics for Echinococcus granulosus surveillance in sheep using Latent Class Analysis: Argentina as a case study. One Health 2022; 14:100359. [PMID: 34977321 PMCID: PMC8683760 DOI: 10.1016/j.onehlt.2021.100359] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 11/30/2021] [Accepted: 12/03/2021] [Indexed: 11/03/2022] Open
Abstract
Echinococcus granulosus sensu lato is a globally prevalent zoonotic parasitic cestode leading to cystic echinococcosis (CE) in both humans and sheep with both medical and financial impacts, whose reduction requires the application of a One Health approach to its control. Regarding the animal health component of this approach, lack of accurate and practical diagnostics in livestock impedes the assessment of disease burden and the implementation and evaluation of control strategies. We use of a Bayesian Latent Class Analysis (LCA) model to estimate ovine CE prevalence in sheep samples from the Río Negro province of Argentina accounting for uncertainty in the diagnostics. We use model outputs to evaluate the performance of a novel recombinant B8/2 antigen B subunit (rEgAgB8/2) indirect enzyme-linked immunosorbent assay (ELISA) for detecting E. granulosus in sheep. Necropsy (as a partial gold standard), western blot (WB) and ELISA diagnostic data were collected from 79 sheep within two Río Negro slaughterhouses, and used to estimate individual infection status (assigned as a latent variable within the model). Using the model outputs, the performance of the novel ELISA at both individual and flock levels was evaluated, respectively, using a receiver operating characteristic (ROC) curve, and simulating a range of sample sizes and prevalence levels within hypothetical flocks. The estimated (mean) prevalence of ovine CE was 27.5% (95%Bayesian credible interval (95%BCI): 13.8%-58.9%) within the sample population. At the individual level, the ELISA had a mean sensitivity and specificity of 55% (95%BCI: 46%-68%) and 68% (95%BCI: 63%-92%), respectively, at an optimal optical density (OD) threshold of 0.378. At the flock level, the ELISA had an 80% probability of correctly classifying infection at an optimal cut-off threshold of 0.496. These results suggest that the novel ELISA could play a useful role as a flock-level diagnostic for CE surveillance in the region, supplementing surveillance activities in the human population and thus strengthening a One Health approach. Importantly, selection of ELISA cut-off threshold values must be tailored according to the epidemiological situation.
Collapse
Key Words
- Argentina
- BCI, Bayesian Credible Interval
- Bayesian inference
- CE, Cystic Echinococcosis
- CI, Confidence Interval
- DALY, Disability-adjusted life year
- Diagnostics
- ELISA, Enzyme-Linked Immunosorbent Assay
- Echinococcosis
- JAGS, Just Another Gibbs Sampler
- LCA, Latent class analysis
- Latent class analysis
- MCAR, Missing completely at random
- MCMC, Markov Chain Monte Carlo
- OD, Optical density
- ROC, Receiver Operating Characteristic
- SD, Standard deviation
- Surveillance
- USD, United States Dollar
- WB, Western blot
- WHO, World Health Organization.
Collapse
Affiliation(s)
- Abagael L. Sykes
- London Centre for Neglected Tropical Disease Research and MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, UK
| | - Edmundo Larrieu
- Facultad de Ciencias Veterinarias, Universidad Nacional de La Pampa, General Pico, Argentina
- Escuela de Veterinaria, Universidad Nacional de Río Negro, Choele Choel, Argentina
| | | | | | | | - Maria-Gloria Basáñez
- London Centre for Neglected Tropical Disease Research and MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, UK
| | - Joaquin M. Prada
- Faculty of Health and Medical Sciences, University of Surrey, Guildford, UK
| |
Collapse
|
4
|
Karampela I, Dalamaga M. Serum bilirubin to fetuin-A ratio as a prognostic biomarker in critically ill patients with sepsis. Metabol Open 2021; 10:100094. [PMID: 34027380 PMCID: PMC8131912 DOI: 10.1016/j.metop.2021.100094] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Accepted: 04/26/2021] [Indexed: 12/15/2022] Open
Abstract
Liver dysfunction during sepsis is associated with increased bilirubin and decreased fetuin-A, a major hepatokine. We aimed to explore the association of bilirubin to fetuin-A (B/F) ratio early in sepsis with severity and outcome in critically ill patients. Based on a previous prospective study, we analyzed data of 90 critically ill patients (52 males, age: 65 ± 15 years, APACHE II: 24 ± 7 and SOFA: 10 ± 3) with sepsis. Bilirubin and fetuin-A increased during the first week of sepsis, (median (IQR) 0.45 (0.32-1) vs 0.55 (0.29-0.78) mg/dL, p = 0.03 and 302 (248-336) vs 358 (307-399) μg/mL, p < 0.001, respectively) while the B/F ratio did not change significantly. However, the B/F ratio at baseline and one week later was significantly higher in patients with septic shock (N = 38) and nonsurvivors (N = 28) compared to patients with sepsis (N = 52) and survivors (N = 62), respectively. The B/F ratio was positively associated with severity scores and outperformed bilirubin as a predictor of mortality in ROC curve analysis (AUC 0.78 (0.69-0.88), p < 0.001 and 0.69 (0.57-0.8), p = 0.003 respectively). The B/F ratio may be a promising sepsis biomarker with possible predictive value in critically ill patients.
Collapse
Affiliation(s)
- Irene Karampela
- Second Department of Critical Care, Attikon General University Hospital, Medical School, National and Kapodistrian University of Athens, Chaidari, Greece
- Corresponding author. .Second Department of Critical Care, Attikon General University Hospital, Medical School, National and Kapodistrian University of Athens, 1 Rimini Street, 12462, Ηaidari, Greece.
| | - Maria Dalamaga
- Department of Biological Chemistry, Medical School, National and Kapodistrian University of Athens, Athens, Greece
- Corresponding author. Biological Chemistry, Clinical Biochemistry Department of Biological Chemistry, Medical School, National and Kapodistrian University of Athens, 27 Mikras Asias, 11527, Goudi, Athens, Greece.
| |
Collapse
|
5
|
Montalbo FJP. Diagnosing Covid-19 chest x-rays with a lightweight truncated DenseNet with partial layer freezing and feature fusion. Biomed Signal Process Control 2021; 68:102583. [PMID: 33828610 PMCID: PMC8015405 DOI: 10.1016/j.bspc.2021.102583] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 03/23/2021] [Accepted: 03/26/2021] [Indexed: 12/26/2022]
Abstract
Due to the unforeseen turn of events, our world has undergone another global pandemic from a highly contagious novel coronavirus named COVID-19. The novel virus inflames the lungs similarly to Pneumonia, making it challenging to diagnose. Currently, the common standard to diagnose the virus's presence from an individual is using a molecular real-time Reverse-Transcription Polymerase Chain Reaction (rRT-PCR) test from fluids acquired through nasal swabs. Such a test is difficult to acquire in most underdeveloped countries with a few experts that can perform the test. As a substitute, the widely available Chest X-Ray (CXR) became an alternative to rule out the virus. However, such a method does not come easy as the virus still possesses unknown characteristics that even experienced radiologists and other medical experts find difficult to diagnose through CXRs. Several studies have recently used computer-aided methods to automate and improve such diagnosis of CXRs through Artificial Intelligence (AI) based on computer vision and Deep Convolutional Neural Networks (DCNN), which some require heavy processing costs and other tedious methods to produce. Therefore, this work proposed the Fused-DenseNet-Tiny, a lightweight DCNN model based on a densely connected neural network (DenseNet) truncated and concatenated. The model trained to learn CXR features based on transfer learning, partial layer freezing, and feature fusion. Upon evaluation, the proposed model achieved a remarkable 97.99 % accuracy, with only 1.2 million parameters and a shorter end-to-end structure. It has also shown better performance than some existing studies and other massive state-of-the-art models that diagnosed COVID-19 from CXRs.
Collapse
Key Words
- AP, Average Pooling
- AUC, Area Under the Curve
- BN, Batch Normalization
- BS, Batch Size
- CAD, Computer-Aided Diagnosis
- CCE, Categorical Cross-Entropy
- CNN, Convolutional Neural Networks
- CT, Computer Tomography
- CV, Computer Vision
- CXR, Chest X-Rays
- Chest x-rays
- Computer-aided diagnosis
- Covid-19
- DCNN, Deep Convolutional Neural Networks
- DL, Deep Learning
- DR, Dropout Rate
- Deep learning
- Densely connected neural networks
- GAP, Global Average Pooling
- GRAD-CAM, Gradient-Weighted Class Activation Maps
- JPG, Joint Photographic Group
- LR, Learning Rate
- MP, Max-Pooling
- P-R, Precision-Recall
- PEPX, Projection-Expansion-Projection-Extension
- ROC, Receiver Operating Characteristic
- ReLU, Rectified Linear Unit
- SGD, Stochastic Gradient Descent
- WHO, World Health Organization
- rRT-PCR, real-time Reverse-Transcription Polymerase Chain Reaction
Collapse
|
6
|
Wang Q, Zhang Y, Zhang E, Xing X, Chen Y, Su MY, Lang N. Prediction of the early recurrence in spinal giant cell tumor of bone using radiomics of preoperative CT: Long-term outcome of 62 consecutive patients. J Bone Oncol 2021; 27:100354. [PMID: 33850701 PMCID: PMC8039834 DOI: 10.1016/j.jbo.2021.100354] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 02/26/2021] [Accepted: 02/28/2021] [Indexed: 12/27/2022] Open
Abstract
Characteristics of 62 patients with spinal GCTB who underwent surgery. A prognostic classification model was built based on features selected by SVM. The combined histogram and texture features could predict recurrence of GCTB.
Objectives To determine if radiomics analysis based on preoperative computed tomography (CT) can predict early postoperative recurrence of giant cell tumor of bone (GCTB) in the spine. Methods In a retrospective review, 62 patients with pathologically confirmed spinal GCTB from March 2008 to February 2018, with a minimum follow-up of 24 months, were identified. The mean follow-up was 73.7 months (range, 28.7–152.1 months). The clinical information including age, gender, lesion location, multi-vertebral involvement, and surgical methods, were obtained. CT images acquired before the operation were retrieved for radiomics analysis. For each case, the tumor regions of interest (ROI) was manually outlined, and a total of 107 radiomics features were extracted. The features were selected via the sequential selection process by using the support vector machine (SVM), then used to construct classification models with Gaussian kernels. The differentiation between recurrence and non-recurrence groups was evaluated by ROC analysis, using 10-fold cross-validation. Results Of the 62 patients, 17 had recurrence with a recurrence rate of 27.4%. None of the clinical information was significantly different between the two groups. Patients receiving curettage had a higher recurrence rate (6/16 = 37.5%) compared to patients receiving TES (6/26 = 23.1%) or intralesional spondylectomy (5/20 = 25%). The final radiomics model was built using 10 selected features, which achieved an accuracy of 89% with AUC of 0.78. Conclusions The radiomics model developed based on pre-operative CT can achieve a high accuracy to predict the recurrence of spinal GCTB. Patients who have a high risk of early recurrence should be treated more aggressively to minimize recurrence.
Collapse
Key Words
- CT texture analysis
- CT, Computed Tomography
- DICOM, Digital Imaging and Communications in Medicine
- GCTB, Giant Cell Tumor of Bone
- GLCM, Gray Level Co-occurrence Matrix
- GLDM, Gray Level Dependence Matrix
- GLRLM, Gray Level Run Length Matrix
- GLSZM, Gray Level Size Zone Matrix
- Giant cell tumor of bone
- MRI, Magnetic Resonance Imaging
- NGTDM, Neighborhood Gray Tone Difference Matrix
- OPG, Osteoprotegerin
- PACS, Picture Archiving and Communication System
- Prognosis
- RANK, Receptor Activator of Nuclear factor Kappa-Β
- RANKL, Receptor Activator of Nuclear factor Kappa-Β Ligand
- ROC, Receiver Operating Characteristic
- ROI, Regions of Interest
- Radiomics
- SVM, Support Vector Machine
- Spine
Collapse
Affiliation(s)
- Qizheng Wang
- Department of Radiology, Peking University Third Hospital, 49 North Garden Road, Haidian District, Beijing 100191, China
| | - Yang Zhang
- 164 Irvine Hall, Center for Functional Onco-Imaging, University of California, Irvine, CA 92697-5020, USA.,Department of Radiation Oncology, Rutgers-Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, NJ, United States
| | - Enlong Zhang
- Department of Radiology, Peking University International Hospital, Life Park Road No.1 Life Science Park of Zhong Guancun, Chang Ping District, Beijing 100191, China
| | - Xiaoying Xing
- Department of Radiology, Peking University Third Hospital, 49 North Garden Road, Haidian District, Beijing 100191, China
| | - Yongye Chen
- Department of Radiology, Peking University Third Hospital, 49 North Garden Road, Haidian District, Beijing 100191, China
| | - Min-Ying Su
- 164 Irvine Hall, Center for Functional Onco-Imaging, University of California, Irvine, CA 92697-5020, USA.,Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Ning Lang
- Department of Radiology, Peking University Third Hospital, 49 North Garden Road, Haidian District, Beijing 100191, China
| |
Collapse
|
7
|
Ferdinand S, Mondal M, Mallik S, Goswami J, Das S, Manir KS, Sen A, Palit S, Sarkar P, Mondal S, Das S, Pal B. Dosimetric analysis of Deep Inspiratory Breath-hold technique (DIBH) in left-sided breast cancer radiotherapy and evaluation of pre-treatment predictors of cardiac doses for guiding patient selection for DIBH. Tech Innov Patient Support Radiat Oncol 2021; 17:25-31. [PMID: 33681484 PMCID: PMC7930610 DOI: 10.1016/j.tipsro.2021.02.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 02/10/2021] [Accepted: 02/10/2021] [Indexed: 12/25/2022] Open
Abstract
Introduction The risk of radiotherapy-associated cardiovascular disease has been a concern for decades in breast cancer survivors. The objective of our study is to evaluate the dosimetric benefit of Deep Inspiratory Breath-hold technique (DIBH) on organs-at-risk (OAR) sparing in left-sided breast cancer radiotherapy and to find out pre-treatment predictors of cardiac doses for guiding patient selection for DIBH. Material and methods Pre-radiotherapy planning CT scans were done in Free Breathing (FB) and in DIBH [using Active Breathing Coordinator system (ABC™)] in 31 left sided breast cancer patients. 3DCRT plans were generated for both scans. Comparison of anatomical and dosimetric variables were done using paired t test and correlation was evaluated using Pearson correlation. Linear regression was used to get independent predictors of cardiac sparing and Receiver Operating Characteristic (ROC) curve analysis was done to find out the specific threshold of the predictors. Results There was a 39.15% reduction in mean heart dose in DIBH compared to FB (2.4 Gy vs 4.01 Gy) (p < 0.001), 19% reduction in maximum Left Anterior Descending (LAD) dose and a 9.9% reduction in ipsilateral lung mean dose (p = 0.036) with DIBH. A significant correlation was observed between reduction in Heart Volume in Field (HVIF) and Maximum Heart Depth (MHD) with reduction in mean heart dose. Reduction in HVIF (ΔHVIF) independently predicted cardiac sparing. Conclusion DIBH leads to significant reduction in OAR doses and is suggested for all patients of left-sided breast cancer undergoing radiotherapy. However, HVIF and MHD predicted for cardiac sparing and threshold criteria of ΔHVIF and ΔMHD may be used by centres with high workload to select patients for DIBH.
Collapse
Key Words
- 3DCRT, Three-Dimensional Conformal Radiation Therapy
- ABC™, Active Breathing Coordinator™
- AUC, Area under the curve
- BCS, Breast Conservation Surgery
- BMI, Body Mass Index
- Breast cancer
- CCD, Cardiac Contact Distance
- CD, Chest Depth
- CLD, Central Lung Distance
- CS, Chest Separation
- CT, Computer Tomography
- DIBH, Deep Inspiratory Breath-hold
- DVH, Dose Volume Histograms
- Deep inspiratory breath-hold
- Dosimetric predictors
- EORTC, European Organization for Research and Treatment of Cancer
- FB, Free Breathing
- HCWD, Heart Chest Wall Distance
- HCWL, Heart Chest Wall Length
- HH, Heart Height
- HV, Heart Volume
- HVIF, Heart Volume in Field
- IMC, Internal Mammary Chain
- LAD, Left Anterior Descending
- LOD, Lung Orthogonal Distance
- LV, Lung Volume
- MHD, Maximum Heart Depth
- MRM, Modified Radical Mastectomy
- NTCP, Normal Tissue Complications Probability
- OAR, Organs-at-risk
- PTV, Planning target volume
- RNI, Regional Nodal Irradiation
- ROC, Receiver Operating Characteristic
- RPM, Real-time Position Management
- RTOG, Radiation Therapy Oncology Group
- Radiotherapy
- SCF, Supraclavicular Fossa
Collapse
Affiliation(s)
- Soujanya Ferdinand
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Monidipa Mondal
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Suman Mallik
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Jyotirup Goswami
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Sayan Das
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Kazi S Manir
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Arijit Sen
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Soura Palit
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Papai Sarkar
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Subhayan Mondal
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Suresh Das
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| | - Bipasha Pal
- Radiation Oncology, Narayana Superspeciality Hospital, Andul Road, Howrah, West Bengal 711103, India
| |
Collapse
|
8
|
Lopez-Pais J, Otero DL, Ferreiro TG, Antonio CEC, Muiños PJA, Perez-Poza M, García ÓO, Ramos VJ, Fernández MS, Fernandez MB, Pena XCS, Roman AV, Romero MP, Lago AL, Escudero JÁ, Román AS, Gonzalez-Juanatey JR. Fast track triage for COVID-19 based on a population study: The soda score. Prev Med Rep 2021; 21:101298. [PMID: 33489725 PMCID: PMC7809432 DOI: 10.1016/j.pmedr.2020.101298] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2020] [Revised: 10/28/2020] [Accepted: 12/07/2020] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Healthcare systems are under prominent stress due to the COVID-19 pandemic. A fast and simple triage is mandatory to screen patients who will benefit from early hospitalization, from those that can be managed as outpatients. There is a lack of all-comers scores, and no score has been proposed for western-world population. AIMS To develop a fast-track risk score valid for every COVID-19 patient at diagnosis. METHODS Single-center, retrospective study based on all the inhabitants of a healthcare area. Logistic regression was used to identify simple and wide-available risk factors for adverse events (death, intensive care admission, invasive mechanical ventilation, bleeding > BARC3, acute renal injury, respiratory insufficiency, myocardial infarction, acute heart failure, pulmonary emboli, or stroke). RESULTS Of the total healthcare area population, 447.979 inhabitants, 965 patients (0.22%), were diagnosed with COVID-19. A total of 124 patients (12.85%) experienced adverse events. The novel SODA score (based on sex, peripheral O2 saturation, presence of diabetes, and age) demonstrated good accuracy for adverse events prediction (area under ROC curve 0.858, CI: 0.82-0.98). A cut-off value of ≤2 points identifies patients with low risk (positive predictive value [PPV] for absence of events: 98.9%) and a cut-off of ≥5 points, high-risk patients (PPV 58.8% for adverse events). CONCLUSIONS This quick and easy score allows fast-track triage at the moment of diagnosis for COVID-19 using four simple variables: age, sex, SpO2, and diabetes. SODA score could improve preventive measures taken at diagnosis in high-risk patients and also relieve resources by identifying very low-risk patients.
Collapse
Affiliation(s)
- Javier Lopez-Pais
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
| | - Diego López Otero
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
| | - Teba Gonzalez Ferreiro
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Carla Eugenia Cacho Antonio
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Pablo José Antúnez Muiños
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Marta Perez-Poza
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Óscar Otero García
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Victor Jimenez Ramos
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Manuela Sestayo Fernández
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - María Bastos Fernandez
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
- Instituto de Investigación Sanitaria IDICHUS, Spain
| | - Xoan Carlos Sanmartin Pena
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
- Instituto de Investigación Sanitaria IDICHUS, Spain
| | - Alfonso Varela Roman
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
- Instituto de Investigación Sanitaria IDICHUS, Spain
| | - Manuel Portela Romero
- Primary Healthcare, Centro de Salud Concepción Arenal, Santiago de Compostela, Spain
| | - Ana López Lago
- Intensive Care Unit, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Julián Álvarez Escudero
- Anaesthesia Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
| | - Alberto San Román
- Cardiology Department, University Clinical Hospital of Valladolid, Valladolid, Spain
| | - Jose Ramón Gonzalez-Juanatey
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
| | - On behalf of CARDIOVID investigators
- Cardiology Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- CIBERCV, Spain
- Instituto de Investigación Sanitaria IDICHUS, Spain
- Primary Healthcare, Centro de Salud Concepción Arenal, Santiago de Compostela, Spain
- Intensive Care Unit, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- Anaesthesia Department, University Clinical Hospital of Santiago de Compostela, Santiago de Compostela, Spain
- Cardiology Department, University Clinical Hospital of Valladolid, Valladolid, Spain
| |
Collapse
|
9
|
Ghannam RB, Techtmann SM. Machine learning applications in microbial ecology, human microbiome studies, and environmental monitoring. Comput Struct Biotechnol J 2021; 19:1092-1107. [PMID: 33680353 PMCID: PMC7892807 DOI: 10.1016/j.csbj.2021.01.028] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 01/16/2021] [Accepted: 01/18/2021] [Indexed: 01/04/2023] Open
Abstract
Advances in nucleic acid sequencing technology have enabled expansion of our ability to profile microbial diversity. These large datasets of taxonomic and functional diversity are key to better understanding microbial ecology. Machine learning has proven to be a useful approach for analyzing microbial community data and making predictions about outcomes including human and environmental health. Machine learning applied to microbial community profiles has been used to predict disease states in human health, environmental quality and presence of contamination in the environment, and as trace evidence in forensics. Machine learning has appeal as a powerful tool that can provide deep insights into microbial communities and identify patterns in microbial community data. However, often machine learning models can be used as black boxes to predict a specific outcome, with little understanding of how the models arrived at predictions. Complex machine learning algorithms often may value higher accuracy and performance at the sacrifice of interpretability. In order to leverage machine learning into more translational research related to the microbiome and strengthen our ability to extract meaningful biological information, it is important for models to be interpretable. Here we review current trends in machine learning applications in microbial ecology as well as some of the important challenges and opportunities for more broad application of machine learning to understanding microbial communities.
Collapse
Key Words
- 16S rRNA
- ANN, Artificial Neural Networks
- ASV, Amplicon Sequence Variant
- AUC, Area Under the Curve
- Forensics
- GB, Gradient Boosting
- ML, Machine Learning
- Machine learning
- Marker genes
- Metagenomics
- PCoA, Principal Coordinate Analysis
- RF, Random Forests
- ROC, Receiver Operating Characteristic
- SML, Supervised Machine Learning
- SVM, Support Vector Machines
- USML, Unsupervised Machine Learning
- tSNE, t-distributed Stochastic Neighbor Embedding
Collapse
Affiliation(s)
- Ryan B. Ghannam
- Department of Biological Sciences, Michigan Technological University, Houghton MI, United States
| | - Stephen M. Techtmann
- Department of Biological Sciences, Michigan Technological University, Houghton MI, United States
| |
Collapse
|
10
|
Beltrán-Ortiz C, Peralta T, Ramos V, Durán M, Behrens C, Maureira D, Guzmán MA, Bastias C, Ferrer P. Standardization of a colorimetric technique for determination of enzymatic activity of diamine oxidase (DAO) and its application in patients with clinical diagnosis of histamine intolerance. World Allergy Organ J 2020; 13:100457. [PMID: 32922624 PMCID: PMC7475190 DOI: 10.1016/j.waojou.2020.100457] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 07/23/2020] [Accepted: 08/09/2020] [Indexed: 12/30/2022] Open
Abstract
Background Diamine Oxidase (DAO) has an essential role for degradation of exogenous histamine in the intestine; thus, histamine intolerance (HI) mainly has been correlated to a low concentration and/or activity of this enzyme. The objective of the study was to standardize a colorimetric technique to measure the enzymatic activity (function) of hDAO to then apply it to a series of 22 patients with a clinical diagnosis of HI. Methods For the standardization variables such as volume and type of sample, incubation time, wavelength of maximum absorption, types of substrates, and concentration of oxidized ascorbate were evaluated. Then the activity and concentration of DAO was determined in 22 patients diagnosed with HI and 22 healthy subjects. Results The mean of serum DAO concentration in the 22 patients was of 9.268 ± 1.124 U/mL. The mean of serum DAO concentration in the 22 controls was of 20.710 ± 2.509 U/mL, being significantly higher (P value 0.0002) the mean of the samples. The mean of serum DAO activity of the patients was of 1.143 ± 0.085 U/L and the controls was 1.533 ± 0.119 U/L, significantly greater than the patients (P value 0.011). In addition, the sensitivity of both techniques was 0.63. In the measuring of DAO concentration the specificity was 0.9, constituting a good diagnostic test, especially to rule out the true negatives. The determination of DAO activity had a specificity of 0.68. Conclusions Although we used a small number of patients and controls and the absorbance values were lower than expected, statistically significant differences were found in the levels of concentration and DAO activity between the patients with histamine intolerance and the controls. Therefore, the measuring of DAO concentration and DAO activity is a good diagnostic strategy for study suspect cases of HI. The simultaneous use of both assays allows to reduce positive and negative false results, for example, patients with normal DAO levels that could present a dysfunction in the activity of this enzyme.
Collapse
Affiliation(s)
- Camila Beltrán-Ortiz
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Teresa Peralta
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Verónica Ramos
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Magdalena Durán
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Carolina Behrens
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Daniella Maureira
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Maria A Guzmán
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Carla Bastias
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| | - Pablo Ferrer
- Section of Immunology, HIV and Allergy, Department of Medicine, Clinical Hospital University of Chile, Chile
| |
Collapse
|
11
|
Ramírez L, de Moura LD, Mateus NLF, de Moraes MH, do Nascimento LFM, de Jesus Melo N, Taketa LB, Catecati T, Huete SG, Penichet K, Piranda EM, de Oliveira AG, Steindel M, Barral-Netto M, do Socorro Pires e Cruz M, Barral A, Soto M. Improving the serodiagnosis of canine Leishmania infantum infection in geographical areas of Brazil with different disease prevalence. Parasite Epidemiol Control 2020; 8:e00126. [PMID: 31832561 PMCID: PMC6890974 DOI: 10.1016/j.parepi.2019.e00126] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 11/19/2019] [Indexed: 01/31/2023] Open
Abstract
Serodiagnosis of Leishmania infantum infection in dogs relies on the detection of antibodies against leishmanial crude extracts or parasitic defined antigens. The expansion of canine leishmaniasis from geographical areas of Brazil in which the infection is endemic to regions in which the disease is emerging is occurring. This fact makes necessary the analysis of the serodiagnostic capabilities of different leishmanial preparations in distinct geographical locations. In this article sera from dogs infected with Leishmania and showing the clinical form of the disease, were collected in three distinct Brazilian States and were tested against soluble leishmanial antigens or seven parasite individual antigens produced as recombinant proteins. We show that the recognition of soluble leishmanial antigens by sera from these animals was influenced by the geographical location of the infected dogs. Efficacy of the diagnosis based on this crude parasite preparation was higher in newly endemic regions when compared with areas of high disease endemicity. We also show that the use of three of the recombinant proteins, namely parasite surface kinetoplastid membrane protein of 11 kDa (KMP-11), and two members of the P protein family (P2a and P0), can improve the degree of sensitivity without adversely affecting the specificity of the diagnostic assays for canine leishmaniasis, independently of the geographical area of residence. In addition, sera from dogs clinically healthy but infected were also assayed with some of the antigen preparations. We demonstrate that the use of these proteins can help to the serodiagnosis of Leishmania infected animals with subclinical infections. Finally, we propose a diagnostic protocol using a combination of KMP-11, P2a y P0, together with total leishmanial extracts.
Collapse
Key Words
- Antibodies
- BB, blocking buffer
- CanL, Canine visceral leishmaniasis
- Canine leishmaniasis
- EDCB, ELISA denaturant coating buffer
- ELISA, enzyme-linked immunosorbent assay
- HSP, Heat shock protein
- KMP-11, Kinetoplastid-membrane protein of 11 kDa
- LR, Likelihood ratio
- Leishmania
- MS, Mato Grosso do Sul State (Brazil)
- PBS, phosphate saline buffer
- PI, Piaui State (Brazil)
- ROC, Receiver Operating Characteristic
- RR, Relative reactivity
- RT, Room temperature
- Recombinant proteins
- SC, Santa Catarina State (Brazil)
- SLA, Soluble leishmanial antigen
- Serodiagnosis
- VL, Visceral leishmaniosis
- WB, Washing buffer
Collapse
Affiliation(s)
- Laura Ramírez
- Centro de Biología Molecular Severo Ochoa (CBMSO), Departamento de Biología Molecular, Facultad de Ciencias, CSIC-UAM, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Luana Dias de Moura
- Centro de Ciências Agrárias, Universidade Federal do Piaui (UFPI), Teresina, 64049-550 PI, Brazil
| | - Natalia Lopes Fontoura Mateus
- Laboratório de Parasitologia, Centro de Ciências Biológicas e da Saúde, Universidade Federal de Mato Grosso do Sul (UFMS), Cidade Universitária, s/n, Campo Grande 79070-900 MS, Brazil
| | - Milene Hoehr de Moraes
- Departamento de Microbiologia, Imunologia e Parasitologia, Universidade Federal de Santa Catarina (UFSC), Florianópolis 88040-900 SC, Brazil
| | | | - Nailson de Jesus Melo
- Centro de Ciências Agrárias, Universidade Federal do Piaui (UFPI), Teresina, 64049-550 PI, Brazil
| | - Lucas Bezerra Taketa
- Laboratório de Parasitologia, Centro de Ciências Biológicas e da Saúde, Universidade Federal de Mato Grosso do Sul (UFMS), Cidade Universitária, s/n, Campo Grande 79070-900 MS, Brazil
| | - Tatiana Catecati
- Departamento de Microbiologia, Imunologia e Parasitologia, Universidade Federal de Santa Catarina (UFSC), Florianópolis 88040-900 SC, Brazil
| | - Samuel G. Huete
- Centro de Biología Molecular Severo Ochoa (CBMSO), Departamento de Biología Molecular, Facultad de Ciencias, CSIC-UAM, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Karla Penichet
- Centro de Biología Molecular Severo Ochoa (CBMSO), Departamento de Biología Molecular, Facultad de Ciencias, CSIC-UAM, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Eliane Mattos Piranda
- Laboratório de Parasitologia, Centro de Ciências Biológicas e da Saúde, Universidade Federal de Mato Grosso do Sul (UFMS), Cidade Universitária, s/n, Campo Grande 79070-900 MS, Brazil
| | - Alessandra Gutierrez de Oliveira
- Laboratório de Parasitologia, Centro de Ciências Biológicas e da Saúde, Universidade Federal de Mato Grosso do Sul (UFMS), Cidade Universitária, s/n, Campo Grande 79070-900 MS, Brazil
| | - Mario Steindel
- Departamento de Microbiologia, Imunologia e Parasitologia, Universidade Federal de Santa Catarina (UFSC), Florianópolis 88040-900 SC, Brazil
| | - Manoel Barral-Netto
- Centro de Pesquisas Gonçalo Moniz (Fundação Oswaldo Cruz- FIOCRUZ). Waldemar Falcão, 121, Salvador 40296-710 BA, Brazil
| | | | - Aldina Barral
- Centro de Pesquisas Gonçalo Moniz (Fundação Oswaldo Cruz- FIOCRUZ). Waldemar Falcão, 121, Salvador 40296-710 BA, Brazil
| | - Manuel Soto
- Centro de Biología Molecular Severo Ochoa (CBMSO), Departamento de Biología Molecular, Facultad de Ciencias, CSIC-UAM, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| |
Collapse
|
12
|
Yang X, Yang S, Li Q, Wuchty S, Zhang Z. Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method. Comput Struct Biotechnol J 2019; 18:153-161. [PMID: 31969974 PMCID: PMC6961065 DOI: 10.1016/j.csbj.2019.12.005] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 11/29/2019] [Accepted: 12/10/2019] [Indexed: 12/11/2022] Open
Abstract
The identification of human-virus protein-protein interactions (PPIs) is an essential and challenging research topic, potentially providing a mechanistic understanding of viral infection. Given that the experimental determination of human-virus PPIs is time-consuming and labor-intensive, computational methods are playing an important role in providing testable hypotheses, complementing the determination of large-scale interactome between species. In this work, we applied an unsupervised sequence embedding technique (doc2vec) to represent protein sequences as rich feature vectors of low dimensionality. Training a Random Forest (RF) classifier through a training dataset that covers known PPIs between human and all viruses, we obtained excellent predictive accuracy outperforming various combinations of machine learning algorithms and commonly-used sequence encoding schemes. Rigorous comparison with three existing human-virus PPI prediction methods, our proposed computational framework further provided very competitive and promising performance, suggesting that the doc2vec encoding scheme effectively captures context information of protein sequences, pertaining to corresponding protein-protein interactions. Our approach is freely accessible through our web server as part of our host-pathogen PPI prediction platform (http://zzdlab.com/InterSPPI/). Taken together, we hope the current work not only contributes a useful predictor to accelerate the exploration of human-virus PPIs, but also provides some meaningful insights into human-virus relationships.
Collapse
Key Words
- AC, Auto Covariance
- ACC, Accuracy
- AUC, area under the ROC curve
- AUPRC, area under the PR curve
- Adaboost, Adaptive Boosting
- CT, Conjoint Triad
- Doc2vec
- Embedding
- Human-virus interaction
- LD, Local Descriptor
- MCC, Matthews correlation coefficient
- ML, machine learning
- MLP, Multiple Layer Perceptron
- MS, mass spectroscopy
- Machine learning
- PPIs, protein-protein interactions
- PR, Precision-Recall
- Prediction
- Protein-protein interaction
- RBF, radial basis function
- RF, Random Forest
- ROC, Receiver Operating Characteristic
- SGD, stochastic gradient descent
- SVM, Support Vector Machine
- Y2H, yeast two-hybrid
Collapse
Affiliation(s)
- Xiaodi Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Qinmengge Li
- National Demonstration Center for Experimental Biological Sciences Education, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Stefan Wuchty
- Dept. of Computer Science, University of Miami, Miami, FL 33146, USA
- Dept. of Biology, University of Miami, Miami, FL 33146, USA
- Center of Computational Science, University of Miami, Miami, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
13
|
Santana-Garcia W, Rocha-Acevedo M, Ramirez-Navarro L, Mbouamboua Y, Thieffry D, Thomas-Chollier M, Contreras-Moreira B, van Helden J, Medina-Rivera A. RSAT variation-tools: An accessible and flexible framework to predict the impact of regulatory variants on transcription factor binding. Comput Struct Biotechnol J 2019; 17:1415-1428. [PMID: 31871587 PMCID: PMC6906655 DOI: 10.1016/j.csbj.2019.09.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2019] [Revised: 09/22/2019] [Accepted: 09/25/2019] [Indexed: 02/06/2023] Open
Abstract
Gene regulatory regions contain short and degenerated DNA binding sites recognized by transcription factors (TFBS). When TFBS harbor SNPs, the DNA binding site may be affected, thereby altering the transcriptional regulation of the target genes. Such regulatory SNPs have been implicated as causal variants in Genome-Wide Association Study (GWAS) studies. In this study, we describe improved versions of the programs Variation-tools designed to predict regulatory variants, and present four case studies to illustrate their usage and applications. In brief, Variation-tools facilitate i) obtaining variation information, ii) interconversion of variation file formats, iii) retrieval of sequences surrounding variants, and iv) calculating the change on predicted transcription factor affinity scores between alleles, using motif scanning approaches. Notably, the tools support the analysis of haplotypes. The tools are included within the well-maintained suite Regulatory Sequence Analysis Tools (RSAT, http://rsat.eu), and accessible through a web interface that currently enables analysis of five metazoa and ten plant genomes. Variation-tools can also be used in command-line with any locally-installed Ensembl genome. Users can input personal collections of variants and motifs, providing flexibility in the analysis.
Collapse
Key Words
- Binding motifs
- CEU, Northern Europeans from Utah
- CRM, Cis-Regulatory Module
- GWAS, Genome Wide Association Studies
- LD, Linkage Disequilibrium
- MPRA, Massively Parallel Reporter Assays: MPRA
- PSSM, Position Specific Scoring Matrix
- Position specific scoring matrix
- ROC, Receiver Operating Characteristic
- RSAT, Regulatory Sequence Analysis Tools
- Regulatory variants
- SNP, Single Nucleotide Polymorphism
- SNPs
- SOIs, SNPs of Interest
- TF, Transcription Factor
- TFBS, Transcription Factor Binding Site
- Transcription factors
- eQTL, Expression Quantitative Trait Loci
- rsID, Reference SNP Identifier
Collapse
Affiliation(s)
- Walter Santana-Garcia
- Institut de Biologie de l’ENS (IBENS), Département de biologie, École normale supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Blvd Juriquilla 3001, Santiago de Querétaro 76230, Mexico
| | - Maria Rocha-Acevedo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Blvd Juriquilla 3001, Santiago de Querétaro 76230, Mexico
| | - Lucia Ramirez-Navarro
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Blvd Juriquilla 3001, Santiago de Querétaro 76230, Mexico
| | - Yvon Mbouamboua
- Fondation Congolaise pour la Recherche Médicale, Brazzaville, People’s Republic of Congo
- Aix-Marseille Univ, INSERM UMR S 1090, Theory and Approaches of Genome Complexity (TAGC), F-13288 Marseille, France
| | - Denis Thieffry
- Institut de Biologie de l’ENS (IBENS), Département de biologie, École normale supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | - Morgane Thomas-Chollier
- Institut de Biologie de l’ENS (IBENS), Département de biologie, École normale supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | | | - Jacques van Helden
- Aix-Marseille Univ, INSERM UMR S 1090, Theory and Approaches of Genome Complexity (TAGC), F-13288 Marseille, France
- CNRS, Institut Français de Bioinformatique, IFB-core, UMS 3601, Evry, France
- Corresponding authors at: Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Blvd Juriquilla 3001, Santiago de Querétaro 76230, México (Medina-Rivera). Aix-Marseille Univ, INSERM UMR S 1090, Theory and Approaches of Genome Complexity (TAGC), F-13288 Marseille, France (J. van Heldenf).
| | - Alejandra Medina-Rivera
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Blvd Juriquilla 3001, Santiago de Querétaro 76230, Mexico
- Corresponding authors at: Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Blvd Juriquilla 3001, Santiago de Querétaro 76230, México (Medina-Rivera). Aix-Marseille Univ, INSERM UMR S 1090, Theory and Approaches of Genome Complexity (TAGC), F-13288 Marseille, France (J. van Heldenf).
| |
Collapse
|
14
|
Meares C, Badran A, Dewar D. Prediction of survival after surgical management of femoral metastatic bone disease - A comparison of prognostic models. J Bone Oncol 2019; 15:100225. [PMID: 30847272 PMCID: PMC6389683 DOI: 10.1016/j.jbo.2019.100225] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 02/12/2019] [Accepted: 02/12/2019] [Indexed: 12/23/2022] Open
Abstract
Background Operative fixation for femoral metastatic bone disease is based on the principles of reducing pain and restoring function. Recent literature has proposed a number of prognostic models for appendicular metastatic bone disease. The aim of this study was to compare the accuracy of proposed soring systems in the setting of femoral metastatic bone disease in order to provide surgeons with information to determine the most appropriate scoring system in this setting. Methods A retrospective cohort analysis of patients who underwent surgical management of femoral metastatic bone disease at a single institution were included. A pre-operative predicted survival for all 114 patients was retrospectively calculated utilising the revised Katagiri model, PathFx model, SSG score, Janssen nomogram, OPTModel and SPRING 13 nomogram. Univariate and multivariate Cox regression proportional hazard models were constructed to assess the role of prognostic variables in the patient group. Area under the receiver characteristics and Brier scores were calculated for each prognostic model from comparison of predicted survival and actual survival of patients to quantify the accuracy of each model. Results For the femoral metastatic bone disease patients treated with surgical fixation, multivariate analysis demonstrated a number of pre-operative factors associated with survival in femoral metastatic bone disease, consistent with established literature. The OPTIModel demonstrated the highest accuracy at predicting 12-month (Area Under the Curve [AUC] = 0.79) and 24-month (AUC = 0.77) survival after surgical management. PathFx model was the most accurate at predicting 3-month survival (AUC = 0.70) and 6-month (AUC = 0.70) survival. The PathFx model was successfully externally validated in the femoral patient dataset for all time periods. Conclusions Among six prognostic models assessed in the setting of femoral metastatic bone disease, the present study observed the most accurate model for 3-month, 6-month, 12-month and 24-month survival. The results of this study may be utilised by the treating surgical team to determine the most accurate model for the required time period and therefore improve decision-making in the care of patients with femoral metastatic bone disease.
Collapse
Affiliation(s)
- Charles Meares
- The Bone and Joint Institute, Royal Newcastle Centre and John Hunter Hospital, Newcastle, Australia
| | | | - David Dewar
- The Bone and Joint Institute, Royal Newcastle Centre and John Hunter Hospital, Newcastle, Australia.,School of Medicine and Public Health, University of Newcastle, Newcastle, Australia
| |
Collapse
|
15
|
Yang MQ, Li D, Yang W, Zhang Y, Liu J, Tong W. A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer. Comput Struct Biotechnol J 2017; 15:463-470. [PMID: 29158875 PMCID: PMC5683705 DOI: 10.1016/j.csbj.2017.09.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Revised: 09/16/2017] [Accepted: 09/24/2017] [Indexed: 12/17/2022] Open
Abstract
Clear cell renal cell carcinoma (ccRCC) is the most common and most aggressive form of renal cell cancer (RCC). The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1, as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways.
Collapse
Key Words
- AUC, Area Under Curve
- Causative mutation
- DEG, Differentially expressed gene
- DGM, Differential gene module
- Gene module
- KEGG, Kyoto Encyclopedia of Genes and Genomes
- Pathways
- Protein-protein interaction
- RCC, Renal cell cancer
- ROC, Receiver Operating Characteristic
- SVM, Support vector machine
- TCGA, The Cancer Genome Atlas
- ccRCC
- ccRCC, Clear cell renal cell carcinoma
- eQTL
- eQTL, Expression quantitative trait loci
Collapse
Affiliation(s)
- Mary Qu Yang
- Joint Bioinformatics Graduate Program, Department of Information Science, George W. Donaghey College of Engineering and Information Technology, University of Arkansas at Little Rock, USA
- University of Arkansas for Medical Sciences, 2801 S. University Ave, Little Rock, AR 72204, USA
| | - Dan Li
- Joint Bioinformatics Graduate Program, Department of Information Science, George W. Donaghey College of Engineering and Information Technology, University of Arkansas at Little Rock, USA
- University of Arkansas for Medical Sciences, 2801 S. University Ave, Little Rock, AR 72204, USA
| | - William Yang
- School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA
| | - Yifan Zhang
- Joint Bioinformatics Graduate Program, Department of Information Science, George W. Donaghey College of Engineering and Information Technology, University of Arkansas at Little Rock, USA
- University of Arkansas for Medical Sciences, 2801 S. University Ave, Little Rock, AR 72204, USA
| | - Jun Liu
- Department of Statistics, Harvard University, Cambridge, MA 02138, USA
| | - Weida Tong
- Divisions of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA
| |
Collapse
|
16
|
Abstract
Cancer has been characterized as a heterogeneous disease consisting of many different subtypes. The early diagnosis and prognosis of a cancer type have become a necessity in cancer research, as it can facilitate the subsequent clinical management of patients. The importance of classifying cancer patients into high or low risk groups has led many research teams, from the biomedical and the bioinformatics field, to study the application of machine learning (ML) methods. Therefore, these techniques have been utilized as an aim to model the progression and treatment of cancerous conditions. In addition, the ability of ML tools to detect key features from complex datasets reveals their importance. A variety of these techniques, including Artificial Neural Networks (ANNs), Bayesian Networks (BNs), Support Vector Machines (SVMs) and Decision Trees (DTs) have been widely applied in cancer research for the development of predictive models, resulting in effective and accurate decision making. Even though it is evident that the use of ML methods can improve our understanding of cancer progression, an appropriate level of validation is needed in order for these methods to be considered in the everyday clinical practice. In this work, we present a review of recent ML approaches employed in the modeling of cancer progression. The predictive models discussed here are based on various supervised ML techniques as well as on different input features and data samples. Given the growing trend on the application of ML methods in cancer research, we present here the most recent publications that employ these techniques as an aim to model cancer risk or patient outcomes.
Collapse
Key Words
- ANN, Artificial Neural Network
- AUC, Area Under Curve
- BCRSVM, Breast Cancer Support Vector Machine
- BN, Bayesian Network
- CFS, Correlation based Feature Selection
- Cancer recurrence
- Cancer survival
- Cancer susceptibility
- DT, Decision Tree
- ES, Early Stopping algorithm
- GEO, Gene Expression Omnibus
- HTT, High-throughput Technologies
- LCS, Learning Classifying Systems
- ML, Machine Learning
- Machine learning
- NCI caArray, National Cancer Institute Array Data Management System
- NSCLC, Non-small Cell Lung Cancer
- OSCC, Oral Squamous Cell Carcinoma
- PPI, Protein–Protein Interaction
- Predictive models
- ROC, Receiver Operating Characteristic
- SEER, Surveillance, Epidemiology and End results Database
- SSL, Semi-supervised Learning
- SVM, Support Vector Machine
- TCGA, The Cancer Genome Atlas Research Network
Collapse
Affiliation(s)
- Konstantina Kourou
- Unit of Medical Technology and Intelligent Information Systems, Dept. of Materials Science and Engineering, University of Ioannina, Ioannina, Greece
| | - Themis P Exarchos
- Unit of Medical Technology and Intelligent Information Systems, Dept. of Materials Science and Engineering, University of Ioannina, Ioannina, Greece ; IMBB - FORTH, Dept. of Biomedical Research, Ioannina, Greece
| | - Konstantinos P Exarchos
- Unit of Medical Technology and Intelligent Information Systems, Dept. of Materials Science and Engineering, University of Ioannina, Ioannina, Greece
| | - Michalis V Karamouzis
- Molecular Oncology Unit, Department of Biological Chemistry, Medical School, University of Athens, Athens, Greece
| | - Dimitrios I Fotiadis
- Unit of Medical Technology and Intelligent Information Systems, Dept. of Materials Science and Engineering, University of Ioannina, Ioannina, Greece ; IMBB - FORTH, Dept. of Biomedical Research, Ioannina, Greece
| |
Collapse
|