26
|
Chen X, Sun B, Chu J, Han Z, Wang Y, Du Y, Han X, Xu P. Oxygen Vacancy-Induced Construction of CoO/h-TiO 2 Z-Scheme Heterostructures for Enhanced Photocatalytic Hydrogen Evolution. ACS APPLIED MATERIALS & INTERFACES 2022; 14:28945-28955. [PMID: 35723439 DOI: 10.1021/acsami.2c06622] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Environmentally friendly catalysts with excellent performance and low cost are critical for photocatalysis. Herein, using hydrogenated TiO2 (h-TiO2) nanosheets with enriched oxygen vacancies as the support, two-dimensional CoO/h-TiO2 Z-scheme heterostructures are fabricated for hydrogen production through photocatalytic water splitting. It is revealed that the oxygen vacancies in h-TiO2 can inhibit the oxidation of Co2+ into high-valence Co3+ during the hydrothermal reaction and thermal treatment processes. A CoO/h-TiO2 Z-scheme heterostructure possesses a space charge region and a built-in electric field at the interface, and oxygen vacancies in h-TiO2 can provide more reactive sites, which synergistically improve the separation and transportation of photogenerated carriers. As a result, the photocatalytic hydrogen evolution rate achieves 129.75 μmol·h-1 (with 50 mg of photocatalysts) on the optimized CoO/h-TiO2 heterostructures. This work provides a new design idea for the preparation of excellent TiO2-based photocatalysts.
Collapse
|
27
|
Zhang R, Cheng Z, Ding F, Hua L, Fang Y, Han Z, Shi J, Zou X, Xiao J. Improvements in chitosan-based slurry ice production and its application in precooling and storage of Pampus argenteus. Food Chem 2022; 393:133266. [PMID: 35653987 DOI: 10.1016/j.foodchem.2022.133266] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/10/2022] [Accepted: 05/18/2022] [Indexed: 11/30/2022]
Abstract
The effects of microbubbles in chitosan-based slurry ice production were investigated, and the efficiency of chitosan-based slurry ice was evaluated for silver pomfret (Pampus argenteus) precooling and storage at 0 °C. Microbubbles generated though agitation accelerated slurry ice production by promoting ice nucleation and eliminating supercooling. Higher bubble counts improved freezing, but overly large bubbles reduced the performance. The rheological properties of chitosan solutions were also investigaed, and solutions with higher viscosity formed more bubbles. Experiments investigating precooling rates, microbial concentrations, pH, thiobarbituric-acid-reactive substances, and total volatile basic nitrogen all confirmed that chitosan-based slurry ice had higher performance than flake ice or conventional slurry ice. Chitosan-based slurry ice can be used for precooling in the fish industry to reduce energy consumption, accelerate precooling, reduce microbial growth, and improve shelf life.
Collapse
|
28
|
Sharpnack MF, Johnson TS, Chalkley R, Han Z, Carbone D, Huang K, He K. TSAFinder: exhaustive tumor-specific antigen detection with RNAseq. Bioinformatics 2022; 38:2422-2427. [PMID: 35191489 PMCID: PMC11020248 DOI: 10.1093/bioinformatics/btac116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 01/10/2022] [Accepted: 02/19/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Tumor-specific antigen (TSA) identification in human cancer predicts response to immunotherapy and provides targets for cancer vaccine and adoptive T-cell therapies with curative potential, and TSAs that are highly expressed at the RNA level are more likely to be presented on major histocompatibility complex (MHC)-I. Direct measurements of the RNA expression of peptides would allow for generalized prediction of TSAs. Human leukocyte antigen (HLA)-I genotypes were predicted with seq2HLA. RNA sequencing (RNAseq) fastq files were translated into all possible peptides of length 8-11, and peptides with high and low expressions in the tumor and control samples, respectively, were tested for their MHC-I binding potential with netMHCpan-4.0. RESULTS A novel pipeline for TSA prediction from RNAseq was used to predict all possible unique peptides size 8-11 on previously published murine and human lung and lymphoma tumors and validated on matched tumor and control lung adenocarcinoma (LUAD) samples. We show that neoantigens predicted by exomeSeq are typically poorly expressed at the RNA level, and a fraction is expressed in matched normal samples. TSAs presented in the proteomics data have higher RNA abundance and lower MHC-I binding percentile, and these attributes are used to discover high confidence TSAs within the validation cohort. Finally, a subset of these high confidence TSAs is expressed in a majority of LUAD tumors and represents attractive vaccine targets. AVAILABILITY AND IMPLEMENTATION The datasets were derived from sources in the public domain as follows: TSAFinder is open-source software written in python and R. It is licensed under CC-BY-NC-SA and can be downloaded at https://github.com/RNAseqTSA. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
29
|
Jia H, Chen X, Han Z, Liu B, Wen T, Tang Y. Nonconvex Nonlocal Tucker Decomposition for 3D Medical Image Super-Resolution. Front Neuroinform 2022; 16:880301. [PMID: 35547860 PMCID: PMC9083114 DOI: 10.3389/fninf.2022.880301] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 03/14/2022] [Indexed: 11/13/2022] Open
Abstract
Limited by hardware conditions, imaging devices, transmission efficiency, and other factors, high-resolution (HR) images cannot be obtained directly in clinical settings. It is expected to obtain HR images from low-resolution (LR) images for more detailed information. In this article, we propose a novel super-resolution model for single 3D medical images. In our model, nonlocal low-rank tensor Tucker decomposition is applied to exploit the nonlocal self-similarity prior knowledge of data. Different from the existing methods that use a convex optimization for tensor Tucker decomposition, we use a tensor folded-concave penalty to approximate a nonlocal low-rank tensor. Weighted 3D total variation (TV) is used to maintain the local smoothness across different dimensions. Extensive experiments show that our method outperforms some state-of-the-art (SOTA) methods on different kinds of medical images, including MRI data of the brain and prostate and CT data of the abdominal and dental.
Collapse
|
30
|
Shao W, Luo X, Zhang Z, Han Z, Chandrasekaran V, Turzhitsky V, Bali V, Roberts AR, Metzger M, Baker J, La Rosa C, Weaver J, Dexter P, Huang K. Application of unsupervised deep learning algorithms for identification of specific clusters of chronic cough patients from EMR data. BMC Bioinformatics 2022; 23:140. [PMID: 35439945 PMCID: PMC9019947 DOI: 10.1186/s12859-022-04680-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Chronic cough affects approximately 10% of adults. The lack of ICD codes for chronic cough makes it challenging to apply supervised learning methods to predict the characteristics of chronic cough patients, thereby requiring the identification of chronic cough patients by other mechanisms. We developed a deep clustering algorithm with auto-encoder embedding (DCAE) to identify clusters of chronic cough patients based on data from a large cohort of 264,146 patients from the Electronic Medical Records (EMR) system. We constructed features using the diagnosis within the EMR, then built a clustering-oriented loss function directly on embedded features of the deep autoencoder to jointly perform feature refinement and cluster assignment. Lastly, we performed statistical analysis on the identified clusters to characterize the chronic cough patients compared to the non-chronic cough patients. RESULTS The experimental results show that the DCAE model generated three chronic cough clusters and one non-chronic cough patient cluster. We found various diagnoses, medications, and lab tests highly associated with chronic cough patients by comparing the chronic cough cluster with the non-chronic cough cluster. Comparison of chronic cough clusters demonstrated that certain combinations of medications and diagnoses characterize some chronic cough clusters. CONCLUSIONS To the best of our knowledge, this study is the first to test the potential of unsupervised deep learning methods for chronic cough investigation, which also shows a great advantage over existing algorithms for patient data clustering.
Collapse
|
31
|
Zhu L, Huang Q, Li X, Jin B, Ding Y, Chou CJ, Su KJ, Zhang Y, Chen X, Hwa KY, Thyparambil S, Liao W, Han Z, Mortensen R, Jin Y, Li Z, Schilling J, Li Z, Sylvester KG, Sun X, Ling XB. Serological Phenotyping Analysis Uncovers a Unique Metabolomic Pattern Associated With Early Onset of Type 2 Diabetes Mellitus. Front Mol Biosci 2022; 9:841209. [PMID: 35463946 PMCID: PMC9024215 DOI: 10.3389/fmolb.2022.841209] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 03/14/2022] [Indexed: 12/12/2022] Open
Abstract
Background: Type 2 diabetes mellitus (T2DM) is a multifaceted disorder affecting epidemic proportion at global scope. Defective insulin secretion by pancreatic β-cells and the inability of insulin-sensitive tissues to respond effectively to insulin are the underlying biology of T2DM. However, circulating biomarkers indicative of early diabetic onset at the asymptomatic stage have not been well described. We hypothesized that global and targeted mass spectrometry (MS) based metabolomic discovery can identify novel serological metabolic biomarkers specifically associated with T2DM. We further hypothesized that these markers can have a unique pattern associated with latent or early asymptomatic stage, promising an effective liquid biopsy approach for population T2DM risk stratification and screening. Methods: Four independent cohorts were assembled for the study. The T2DM cohort included sera from 25 patients with T2DM and 25 healthy individuals for the biomarker discovery and sera from 15 patients with T2DM and 15 healthy controls for the testing. The Pre-T2DM cohort included sera from 76 with prediabetes and 62 healthy controls for the model training and sera from 35 patients with prediabetes and 27 healthy controls for the model testing. Both global and targeted (amino acid, acylcarnitine, and fatty acid) approaches were used to deep phenotype the serological metabolome by high performance liquid chromatography-high resolution mass spectrometry. Different machine learning approaches (Random Forest, XGBoost, and ElasticNet) were applied to model the unique T2DM/Pre-T2DM metabolic patterns and contrasted with their effectiness to differentiate T2DM/Pre-T2DM from controls. Results: The univariate analysis identified unique panel of metabolites (n = 22) significantly associated with T2DM. Global metabolomics and subsequent structure determination led to the identification of 8 T2DM biomarkers while targeted LCMS profiling discovered 14 T2DM biomarkers. Our panel can effectively differentiate T2DM (ROC AUC = 1.00) or Pre-T2DM (ROC AUC = 0.84) from the controls in the respective testing cohort. Conclusion: Our serological metabolite panel can be utilized to identifiy asymptomatic population at risk of T2DM, which may provide utility in identifying population at risk at an early stage of diabetic development to allow for clinical intervention. This early detection would guide ehanced levels of care and accelerate development of clinical strategies to prevent T2DM.
Collapse
|
32
|
Han Z, Yu S, Lin SB, Zhou DX. Depth Selection for Deep ReLU Nets in Feature Extraction and Generalization. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:1853-1868. [PMID: 33079656 DOI: 10.1109/tpami.2020.3032422] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Deep learning is recognized to be capable of discovering deep features for representation learning and pattern recognition without requiring elegant feature engineering techniques by taking advantages of human ingenuity and prior knowledge. Thus it has triggered enormous research activities in machine learning and pattern recognition. One of the most important challenges of deep learning is to figure out relations between a feature and the depth of deep neural networks (deep nets for short) to reflect the necessity of depth. Our purpose is to quantify this feature-depth correspondence in feature extraction and generalization. We present the adaptivity of features to depths and vice-verse via showing a depth-parameter trade-off in extracting both single feature and composite features. Based on these results, we prove that implementing the classical empirical risk minimization on deep nets can achieve the optimal generalization performance for numerous learning tasks. Our theoretical results are verified by a series of numerical experiments including toy simulations and a real application of earthquake seismic intensity prediction.
Collapse
|
33
|
Lu S, Fang J, Li X, Cao L, Zhou J, Guo Q, Liang Z, Cheng Y, Jiang L, Yang N, Han Z, Shi J, Chen Y, Xu H, Zhang H, Chen G, Ma R, Sun S, Fan Y, Weiguo S. 2MO Final OS results and subgroup analysis of savolitinib in patients with MET exon 14 skipping mutations (METex14+) NSCLC. Ann Oncol 2022. [DOI: 10.1016/j.annonc.2022.02.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
34
|
Han Z, Gangwar L, Magnuson E, Etheridge ML, Bischof JC, Choi J, Pringle CO. Supplemented phase diagrams for vitrification CPA cocktails: DP6, VS55 and M22. Cryobiology 2022; 106:113-121. [PMID: 35276219 DOI: 10.1016/j.cryobiol.2022.02.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Revised: 02/24/2022] [Accepted: 02/25/2022] [Indexed: 11/03/2022]
Abstract
DP6, VS55 and M22 are the most commonly used cryoprotective agent (CPA) cocktails for vitrification experiments in tissues and organs. However, complete phase diagrams for the three CPAs are often unavailable or incomplete (only available for full strength CPAs) thereby hampering optimization of vitrification and rewarming procedures. In this paper, we used differential scanning calorimetry (DSC) to measure the transition temperatures including heterogeneous nucleation temperatures (Thet), glass transition temperatures (Tg), rewarming phase crystallization (devitrification and/or recrystallization) temperatures (Td) and melting temperatures (Tm) while cooling or warming the CPA sample at 5 °C/min and plotted the obtained transition temperatures for different concentrations of CPAs into the phase diagrams. We also used cryomicroscopy cooling or warming the sample at the same rate to record the ice crystallization during the whole process, and we presented the cryomicroscopic images at the transition temperatures, which agreed with the DSC presented phenomena.
Collapse
|
35
|
Thyparambil SP, Zhu X, Zhang Y, Sun H, Peng J, Cai S, Li Y, Fu C, Bao P, Hao S, Li Z, Ding Y, Yao X, Liao WL, Heaton R, Han Z, Tian L, Schilling J, Sylvester KG, Ling X. Deviation from the precisely timed age-associated patterns revealed by blood metabolomics to find CRC patients at risk of relapse at the CRC diagnosis. J Clin Oncol 2022. [DOI: 10.1200/jco.2022.40.4_suppl.206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
206 Background: Human serum metabolome profiles have been analyzed to explore the molecular changes that occur with aging. We hypothesized that deep metabolic profiling of sera with different ages would allow the identification of distinct metabolic chronologic patterns as a normal biological baseline to study personal aging. We further hypothesized that metabolic assessment of this chronologic deviation, resulting from advanced precancerous lesion (APL) and stage I/II/III CRC, from the normal reference baseline, would be instrumental for prognosis of relapse revealing underlying pathophysiology. Methods: A cohort of normal (n=3,616, training; n=1,170, testing), 631 advanced adenoma, 1,019 stage I, 404 stage II and 417 stage III serum samples were assembled. Innovative global LCMS metabolomic production were applied to deep profile these subjects. Identification of the age-associated molecular patterns in normal subjects, modeled with an elastic net algorithm, established the reference baseline to mirror a metabolic clock. CRC associated deviation from the precise chronologically paced metabolic patterns was quantified to associate the clinical endpoints of relapse, OSF and PFS, and to identify the tightly associated metabolic pathways. Results: We observed that for those CRCs, the predicted metabolic age can differ from the chronological age with consistent variations, resulting “older” or “younger” metabolic age subgroup in reference to the chronological age. Significant disruptions from the normal baseline were observed in CRCs patients, and consistent stage specific patterns were observed. Outlier, “Older” or “younger” metabolic age subgroup, CRC patients were found with significant future relapse enrichment. Predictive models were derived to case find the patients at risk of future relapse at the CRC diagnosis timepoint. Conclusions: Deviations from the meticulously timed metabolic aging patterns may provide utility to allow prognosis of future clinical endpoints of relapse and overall survival. Close examination of the underlying metabolic pathways, associated with CRC stage specific metabolic patterns, disrupting the baseline ageotypes, not only may improve the sensitivity and specificity of prognostic tests of CRC relapse, but also shed new insights into CRC therapeutics.
Collapse
|
36
|
Han Z, Zhang B, Li T. Differential Gene Expression Analysis of Carboplatin Treatment for Oral Squamous Cell Carcinoma. Indian J Pharm Sci 2022. [DOI: 10.36468/pharmaceutical-sciences.spl.495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
37
|
Chen X, Sun B, Han Z, Wang Y, Han X, Xu P. Ultrathin tungsten-doped hydrogenated titanium dioxide nanosheets for solar-driven hydrogen evolution. Inorg Chem Front 2022. [DOI: 10.1039/d2qi00978a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Ultrathin tungsten-doped hydrogenated TiO2 (W-h-TiO2) nanosheets are highly efficient for photocatalytic hydrogen production by water splitting without a noble metal cocatalyst.
Collapse
|
38
|
Shao W, Wang T, Huang Z, Han Z, Zhang J, Huang K. Weakly Supervised Deep Ordinal Cox Model for Survival Prediction From Whole-Slide Pathological Images. IEEE TRANSACTIONS ON MEDICAL IMAGING 2021; 40:3739-3747. [PMID: 34264823 DOI: 10.1109/tmi.2021.3097319] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Whole-Slide Histopathology Image (WSI) is generally considered the gold standard for cancer diagnosis and prognosis. Given the large inter-operator variation among pathologists, there is an imperative need to develop machine learning models based on WSIs for consistently predicting patient prognosis. The existing WSI-based prediction methods do not utilize the ordinal ranking loss to train the prognosis model, and thus cannot model the strong ordinal information among different patients in an efficient way. Another challenge is that a WSI is of large size (e.g., 100,000-by-100,000 pixels) with heterogeneous patterns but often only annotated with a single WSI-level label, which further complicates the training process. To address these challenges, we consider the ordinal characteristic of the survival process by adding a ranking-based regularization term on the Cox model and propose a weakly supervised deep ordinal Cox model (BDOCOX) for survival prediction from WSIs. Here, we generate amounts of bags from WSIs, and each bag is comprised of the image patches representing the heterogeneous patterns of WSIs, which is assumed to match the WSI-level labels for training the proposed model. The effectiveness of the proposed method is well validated by theoretical analysis as well as the prognosis and patient stratification results on three cancer datasets from The Cancer Genome Atlas (TCGA).
Collapse
|
39
|
Koenig J, Cagney D, Huynh E, Boyle S, Lee H, Williams C, Han Z, Leeman J, Mak R, Mancias J, Singer L. Target Coverage, Organ at Risk Metrics, and Tumor Control for Metastases to the Pancreas Treated With Adaptive MR-Guided Stereotactic Body Radiation Therapy. Int J Radiat Oncol Biol Phys 2021. [DOI: 10.1016/j.ijrobp.2021.07.1327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
40
|
Roberts H, Shin K, Catalano P, Huynh E, Williams C, Han Z, Vastola M, Ampofo N, Leeman J, Mamon H, Mancias J, Lam M, Martin N, Huynh M, Mak R, Singer L, Cagney D. A Prospective Clinical Trial Evaluating Stereotactic Magnetic Resonance Guided Adaptive Radiation Therapy (SMART) for Pancreatic Cancer. Int J Radiat Oncol Biol Phys 2021. [DOI: 10.1016/j.ijrobp.2021.07.429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
41
|
Roberts H, Huynh E, Williams C, Han Z, Vastola M, Ampofo N, Leeman J, Mamon H, Mancias J, Lam M, Martin N, Huynh M, Mak R, Singer L, Cagney D. Impact of Stereotactic MR-Guided Adaptive Radiation Therapy on Early Clinical and Dosimetric Outcomes in Patients With Pancreatic Cancer. Int J Radiat Oncol Biol Phys 2021. [DOI: 10.1016/j.ijrobp.2021.07.430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
42
|
Yang D, Brennan V, Huynh E, Williams C, Han Z, Ampofo N, Vastola M, Leeman J, Mak R, Singer L, Cagney D, Huynh M. Stereotactic Magnetic Resonance Guided Adaptive Radiation Therapy for Abdominopelvic Metastases. Int J Radiat Oncol Biol Phys 2021. [DOI: 10.1016/j.ijrobp.2021.07.1339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
43
|
Luo X, Gandhi P, Storey S, Zhang Z, Han Z, Huang K. A Computational Framework to Analyze the Associations Between Symptoms and Cancer Patient Attributes Post Chemotherapy Using EHR Data. IEEE J Biomed Health Inform 2021; 25:4098-4109. [PMID: 34613922 DOI: 10.1109/jbhi.2021.3117238] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Patients with cancer, such as breast and colorectal cancer, often experience different symptoms post-chemotherapy. The symptoms could be fatigue, gastrointestinal (nausea, vomiting, lack of appetite), psychoneurological symptoms (depressive symptoms, anxiety), or other types. Previous research focused on understanding the symptoms using survey data. In this research, we propose to utilize the data within the Electronic Health Record (EHR). A computational framework is developed to use a natural language processing (NLP) pipeline to extract the clinician-documented symptoms from clinical notes. Then, a patient clustering method is based on the symptom severity levels to group the patient in clusters. The association rule mining is used to analyze the associations between symptoms and patient attributes (smoking history, number of comorbidities, diabetes status, age at diagnosis) in the patient clusters. The results show that the various symptom types and severity levels have different associations between breast and colorectal cancers and different timeframes post-chemotherapy. The results also show that patients with breast or colorectal cancers, who smoke and have severe fatigue, likely have severe gastrointestinal symptoms six months after the chemotherapy. Our framework can be generalized to analyze symptoms or symptom clusters of other chronic diseases where symptom management is critical.
Collapse
|
44
|
Wang J, Wang Z, Wu L, Li B, Cheng Y, Li X, Wang X, Han L, Wu X, Fan Y, Yu Y, Lv D, Shi J, Huang J, Zhou S, Han B, Sun G, Guo Q, Ji Y, Zhu X, Hu S, Zhang W, Wang Q, Jia Y, Wang Z, Song Y, Wu J, Shi M, Li X, Han Z, Liu Y, Yu Z, Liu A, Wang X, Zhou C, Zhong D, Miao L, Zhang Z, Zhao H, Yang J, Wang D, Wang Y, Li Q, Zhang X, Ji M, Yang Z, Cui J, Gao B, Wang B, Liu H, Nie L, He M, Jin S, Gu W, Shu Y, Zhou T, Feng J, Yang X, Huang C, Zhu B, Yao Y, Wang Y, Kang X, Yao S, Keegan P. MA13.08 CHOICE-01: A Phase 3 Study of Toripalimab Versus Placebo in Combination With First-Line Chemotherapy for Advanced NSCLC. J Thorac Oncol 2021. [DOI: 10.1016/j.jtho.2021.08.181] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
45
|
Luo X, Gandhi P, Zhang Z, Shao W, Han Z, Chandrasekaran V, Turzhitsky V, Bali V, Roberts AR, Metzger M, Baker J, La Rosa C, Weaver J, Dexter P, Huang K. Applying interpretable deep learning models to identify chronic cough patients using EHR data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 210:106395. [PMID: 34525412 DOI: 10.1016/j.cmpb.2021.106395] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 08/30/2021] [Indexed: 06/13/2023]
Abstract
BACKGROUND AND OBJECTIVE Chronic cough (CC) affects approximately 10% of adults. Many disease states are associated with chronic cough, such as asthma, upper airway cough syndrome, bronchitis, and gastroesophageal reflux disease. The lack of an ICD code specific for chronic cough makes it challenging to identify such patients from electronic health records (EHRs). For clinical and research purposes, computational methods using EHR data are urgently needed to identify chronic cough cases. This research aims to investigate the data representations and deep learning algorithms for chronic cough prediction. METHODS Utilizing real-world EHR data from a large academic healthcare system from October 2005 to September 2015, we investigated Natural Language Representation of the EHR data and systematically evaluated deep learning and traditional machine learning models to predict chronic cough patients. We built these machine learning models using structured data (medication and diagnosis) and unstructured data (clinical notes). RESULTS The sensitivity and specificity of a transformer-based deep learning algorithm, specifically BERT with attention model, was 0.856 and 0.866, respectively, using structured data (medication and diagnosis). Sensitivity and specificity improved to 0.952 and 0.930 when we combined structured data with symptoms extracted from clinical notes. We further found that the attention mechanism of deep learning models can be used to extract important features that drive the prediction decisions. Compared with our previously published rule-based algorithm, the deep learning algorithm can identify more chronic cough patients with structured data. CONCLUSIONS By applying deep learning models, chronic cough patients can be reliably identified for prospective or retrospective research through medication and diagnosis data, widely available in EHR and electronic claims data, thus improving the generalizability of the patient identification algorithm. Deep learning models can identify chronic cough patients with even higher sensitivity and specificity when structured and unstructured EHR data are utilized. We anticipate language-based data representation and deep learning models developed in this research could also be productively used for other disease prediction and case identification.
Collapse
|
46
|
Lu Z, Zhan X, Wu Y, Cheng J, Shao W, Ni D, Han Z, Zhang J, Feng Q, Huang K. BrcaSeg: A Deep Learning Approach for Tissue Quantification and Genomic Correlations of Histopathological Images. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:1032-1042. [PMID: 34280546 PMCID: PMC9403022 DOI: 10.1016/j.gpb.2020.06.026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Revised: 12/09/2019] [Accepted: 08/09/2020] [Indexed: 11/25/2022]
Abstract
Epithelial and stromal tissues are components of the tumor microenvironment and play a major role in tumor initiation and progression. Distinguishing stroma from epithelial tissues is critically important for spatial characterization of the tumor microenvironment. Here, we propose BrcaSeg, an image analysis pipeline based on a convolutional neural network (CNN) model to classify epithelial and stromal regions in whole-slide hematoxylin and eosin (H&E) stained histopathological images. The CNN model is trained using well-annotated breast cancer tissue microarrays and validated with images from The Cancer Genome Atlas (TCGA) Program. BrcaSeg achieves a classification accuracy of 91.02%, which outperforms other state-of-the-art methods. Using this model, we generate pixel-level epithelial/stromal tissue maps for 1000 TCGA breast cancer slide images that are paired with gene expression data. We subsequently estimate the epithelial and stromal ratios and perform correlation analysis to model the relationship between gene expression and tissue ratios. Gene Ontology (GO) enrichment analyses of genes that are highly correlated with tissue ratios suggest that the same tissue is associated with similar biological processes in different breast cancer subtypes, whereas each subtype also has its own idiosyncratic biological processes governing the development of these tissues. Taken all together, our approach can lead to new insights in exploring relationships between image-based phenotypes and their underlying genomic events and biological processes for all types of solid tumors. BrcaSeg can be accessed at https://github.com/Serian1992/ImgBio.
Collapse
|
47
|
Liu M, Xie J, Tan C, Ruan X, Wang Z, Luo X, Lin J, Xiang L, Li A, Han Z, Liu S. [Japan narrow-band imaging Expert Team type 2B colorectal cancer: consistency between endoscopic prediction and pathological diagnosis]. NAN FANG YI KE DA XUE XUE BAO = JOURNAL OF SOUTHERN MEDICAL UNIVERSITY 2021; 41:942-946. [PMID: 34238749 DOI: 10.12122/j.issn.1673-4254.2021.06.19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
OBJECTIVE To explore the potential factors that affect the accuracy of endoscopic diagnosis for Japan narrow-band imaging (NBI) Expert Team (JNET) type 2B colorectal lesions. OBJECTIVE The clinical data were collected from 261 patients with JNET type 2B colorectal lesions diagnosed in Nanfang Hospital between July, 2018 and July, 2021. We analyzed the macroscopic type, size, location or pit pattern classification of the lesions for their potential influence of the diagnostic accuracy of JNET type 2B lesions. OBJECTIVE The 261 lesions included 91 low-grade intramucosal neoplasia lesions (34.9%), 132 high-grade intramucosal neoplasia lesions (50.6%), 13 submucosal invasive cancer lesions (5.0%), and 25 deep submucosal invasive cancer lesions (9.6%). The coincidence rate between endoscopic prediction and pathological diagnosis of these lesions was 55.6% (145/ 261). The macroscopic type and size of the lesions were significantly associated with the diagnostic accuracy of JNET type 2B lesions (P < 0.001). There was a significant difference in the diagnostic accuracy among the lesions with different pit pattern types (P < 0.001). OBJECTIVE Both the macroscopic type and size affect the accuracy of endoscopic diagnosis of JNET type 2B colorectal lesions. JNET classification combined with pit pattern types can have better accuracy in predicting the pathological diagnosis of these lesions.
Collapse
|
48
|
Sharpnack M, Johnson T, Chalkley R, Han Z, Carbone D, Huang K, He K. Abstract 238: Exhaustive tumor specific antigen detection with RNAseq. Cancer Res 2021. [DOI: 10.1158/1538-7445.am2021-238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Tumor specific antigen (TSA) identification in human cancer predicts response to immunotherapy and provides vaccine targets for precision medicine. In addition to neoantigens from somatic coding mutations, numerous non-mutated TSAs can elicit T-cell responses but are often overlooked by current methods. We present a method that accurately and comprehensively predict TSAs from RNAseq data regardless of mutation status.
Methods: HLA-I genotypes were predicted with seq2HLA. RNAseq fastq files were translated into all possible peptides of length 8-11, and peptides with high expression in the tumor and comparatively low expression in normal were tested for their MHC-I binding potential with netMHCpan-4.0. We defined our predicted TSA by i) high expression in tumor samples, ii) low expression in normal samples, and iii) high predicted patient-specific MHC-I binding affinity.
Results: We developed a novel pipeline for TSA prediction from RNAseq that is not limited to mutation-derived TSAs. This pipeline was used to predict all possible unique peptides size 8-11 on previously published murine and human lung and lymphoma tumors then validated on matched tumor and control lung adenocarcinoma (LUAD) samples. This pipeline is able to predict TSAs in MHC-I ligand-purified proteomics data with favorable performance to existing methods. Furthermore, neoantigens predicted by exomeSeq are typically poorly expressed at the RNA level, (28% of predicted neoantigens with >0 expression, mean of 15.6 reads/sample) and a fraction of them (47/6,928, 0.68%) are expressed in matched normal samples. Finally, a set of 6 TSAs are expressed in 22/39 (56%) of LUAD tumors and represent attractive vaccine targets.
Conclusion: Direct quantification of RNAseq evidence of the potential peptidome in matched tumor and control RNAseq samples, via our novel pipeline, allows for exhaustive detection of TSAs.
Citation Format: Michael Sharpnack, Travis Johnson, Robert Chalkley, Zhi Han, David Carbone, Kun Huang, Kai He. Exhaustive tumor specific antigen detection with RNAseq [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 238.
Collapse
|
49
|
Han Z, Ding J, Cheng X, Hsieh YL, Wang CJ, Wang JY, Yang JM, Cong N, Chi FL. SGN nerve filaments develop synapses with IHCs earlier than with OHCs in C57BL/6 mouse inner ear. EUROPEAN REVIEW FOR MEDICAL AND PHARMACOLOGICAL SCIENCES 2021; 24:11496-11508. [PMID: 33275216 DOI: 10.26355/eurrev_202011_23791] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
OBJECTIVE To explore the connections between hair cells and spiral ganglion neurons (SGNs) during the development of the C57BL/6 mouse inner ear. MATERIALS AND METHODS The specimens of C57BL/6 mouse inner ear, from E15 (embryo day 15) to adult mouse, were collected; immunohistochemistry was employed to explore the frozen sections of specimens. RESULTS The development of cochlea starts sequentially from the basal turn to the apex turn. Morphological development of SGNs occurs mainly from E16 to P12 (postnatal day 12). Hair cells appear from E18 to P12, and inner hair cells (IHCs) develop earlier than outer hair cells (OHCs). The connections between hair cells and SGNs begin to develop during E18-P1, morphologically resemble mature synapses during P8-P12, and completely mature in adult mice. CONCLUSIONS The genesis of auditory ribbon synapse occurs from E18 to P1. Synchronized with the development of SGNs and hair cells, the functional filaments remain connected to hair cells, while the spare ones get disconnected from the surface of hair cells. Connections between SGN nerve filaments and IHCs occur earlier than those between SGN nerve filaments and OHCs.
Collapse
|
50
|
Hu YN, Zhan JT, Zhu QF, Hu T, An N, Zhou Z, Liang Y, Wang W, Han Z, Wang J, Xu FQ, Feng YQ. A mathematical method for calibrating the signal drift in liquid chromatography - mass spectrometry analysis. Talanta 2021; 233:122511. [PMID: 34215126 DOI: 10.1016/j.talanta.2021.122511] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 05/06/2021] [Accepted: 05/08/2021] [Indexed: 01/06/2023]
Abstract
Liquid chromatography coupled with high-resolution mass spectrometry (LC-HRMS) has become the most versatile analytical tool for profiling small-molecule compounds and increasingly been applied in many fields. Nevertheless, LC-MS based quantification still face some challenges, such as signal drift in LC-MS, which may affect the validity of the obtained data and lead to misinterpretation of biological results. Here, we established a calibration method known as "RIM" to compensate the signal drift of LC-MS. To this end, a mixture of d4-2-dimethylaminoethylamine (d4-DMED)-coded normal fatty acids (C5-C23) was used as calibrants to construct RIM calibration. With the addition of calibrants, not only the MS signal drift, but also the mass accuracy and LC retention time can be calibrated, thereby improving the reliability of quantitative data. The effectiveness of RIM was carefully validated using a human serum extract spiked with 34 standards and then RIM was applied for rat brain untargeted metabolome research. In addition, to expand the functionality and flexibility of RIM for data handling, we generated a MATLAB-based RIM program, which implements the above concepts and allows automatic data process. Taken together, the proposed RIM method has potential application in large-scale quantitative study of complex samples.
Collapse
|