1
|
Agrawal S, Ransom RF, Saraswathi S, Garcia-Gonzalo E, Webb A, Fernandez-Martinez JL, Popovic M, Guess AJ, Kloczkowski A, Benndorf R, Sadee W, Smoyer WE. Sulfatase 2 Is Associated with Steroid Resistance in Childhood Nephrotic Syndrome. J Clin Med 2021; 10:523. [PMID: 33540508 PMCID: PMC7867139 DOI: 10.3390/jcm10030523] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/20/2021] [Accepted: 01/23/2021] [Indexed: 01/17/2023] Open
Abstract
Glucocorticoid (GC) resistance complicates the treatment of ~10-20% of children with nephrotic syndrome (NS), yet the molecular basis for resistance remains unclear. We used RNAseq analysis and in silico algorithm-based approaches on peripheral blood leukocytes from 12 children both at initial NS presentation and after ~7 weeks of GC therapy to identify a 12-gene panel able to differentiate steroid resistant NS (SRNS) from steroid-sensitive NS (SSNS). Among this panel, subsequent validation and analyses of one biologically relevant candidate, sulfatase 2 (SULF2), in up to a total of 66 children, revealed that both SULF2 leukocyte expression and plasma arylsulfatase activity Post/Pre therapy ratios were greater in SSNS vs. SRNS. However, neither plasma SULF2 endosulfatase activity (measured by VEGF binding activity) nor plasma VEGF levels, distinguished SSNS from SRNS, despite VEGF's reported role as a downstream mediator of SULF2's effects in glomeruli. Experimental studies of NS-related injury in both rat glomeruli and cultured podocytes also revealed decreased SULF2 expression, which were partially reversible by GC treatment of podocytes. These findings together suggest that SULF2 levels and activity are associated with GC resistance in NS, and that SULF2 may play a protective role in NS via the modulation of downstream mediators distinct from VEGF.
Collapse
Affiliation(s)
- Shipra Agrawal
- Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
| | - Richard F. Ransom
- Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
| | - Saras Saraswathi
- Battelle Center for Mathematical Medicine at Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA;
| | | | - Amy Webb
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
| | | | - Milan Popovic
- Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
| | - Adam J. Guess
- Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
| | - Andrzej Kloczkowski
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
- Battelle Center for Mathematical Medicine at Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA;
| | - Rainer Benndorf
- Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
| | - Wolfgang Sadee
- Department of Cancer Biology and Genetics, Center for Pharmacogenomics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
| | - William E. Smoyer
- Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
| | | |
Collapse
|
2
|
Roisman A, Castellano G, Navarro A, Gonzalez-Farre B, Pérez-Galan P, Esteve-Codina A, Dabad M, Heath S, Gut M, Bosio M, Bellot P, Salembier P, Oliveras A, Slavutsky I, Magnano L, Horn H, Rosenwald A, Ott G, Aymerich M, López-Guillermo A, Jares P, Martín-Subero JI, Campo E, Hernández L. Differential expression of long non-coding RNAs are related to proliferation and histological diversity in follicular lymphomas. Br J Haematol 2018; 184:373-383. [DOI: 10.1111/bjh.15656] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 09/11/2018] [Indexed: 01/03/2023]
Affiliation(s)
- Alejandro Roisman
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Laboratorio de Genética de Neoplasias Linfoides; Instituto de Medicina Experimental; CONICET-Academia Nacional de Medicina; Buenos Aires Argentina
| | | | - Alba Navarro
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Centro de Investigación Biomédica en Red de Cáncer (CIBERONC); Barcelona Spain
| | - Blanca Gonzalez-Farre
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Department of Pathology; Hospital Clínic; University of Barcelona; Barcelona Spain
| | - Patricia Pérez-Galan
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
| | - Anna Esteve-Codina
- CNAG-CRG; Centre for Genomic Regulation (CRG); Barcelona Institute of Science and Technology (BIST); Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
| | - Marc Dabad
- CNAG-CRG; Centre for Genomic Regulation (CRG); Barcelona Institute of Science and Technology (BIST); Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
| | - Simon Heath
- CNAG-CRG; Centre for Genomic Regulation (CRG); Barcelona Institute of Science and Technology (BIST); Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
| | - Marta Gut
- CNAG-CRG; Centre for Genomic Regulation (CRG); Barcelona Institute of Science and Technology (BIST); Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
| | - Mattia Bosio
- Barcelona Supercomputing Center; Barcelona Spain
| | - Pau Bellot
- Department of Signal Theory and Communications; Technical University of Catalonia UPC; Barcelona Spain
| | - Philippe Salembier
- Department of Signal Theory and Communications; Technical University of Catalonia UPC; Barcelona Spain
| | - Albert Oliveras
- Department of Signal Theory and Communications; Technical University of Catalonia UPC; Barcelona Spain
| | - Irma Slavutsky
- Laboratorio de Genética de Neoplasias Linfoides; Instituto de Medicina Experimental; CONICET-Academia Nacional de Medicina; Buenos Aires Argentina
| | - Laura Magnano
- Department of Haematology; Hospital Clínic of Barcelona; Barcelona Spain
| | - Heike Horn
- Dr. M. Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart and University of Tübingen; Stuttgart Germany
| | | | - German Ott
- Department of Clinical Pathology; Robert-Bosch Krankenhaus; Stuttgart Germany
| | - Marta Aymerich
- Haematopathology Unit; Department of Pathology; Hospital Clínic; IDIBAPS; Barcelona Spain
| | | | - Pedro Jares
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Department of Pathology; Hospital Clínic; University of Barcelona; Barcelona Spain
| | - José I. Martín-Subero
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Centro de Investigación Biomédica en Red de Cáncer (CIBERONC); Barcelona Spain
| | - Elías Campo
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Centro de Investigación Biomédica en Red de Cáncer (CIBERONC); Barcelona Spain
- Haematopathology Unit; Department of Pathology; Hospital Clínic; IDIBAPS; Barcelona Spain
| | - Luis Hernández
- Lymphoid Neoplasm Programme; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS); Barcelona Spain
- Centro de Investigación Biomédica en Red de Cáncer (CIBERONC); Barcelona Spain
| |
Collapse
|
3
|
Liu KH, Zeng ZH, Ng VTY. A Hierarchical Ensemble of ECOC for cancer classification based on multi-class microarray data. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.02.028] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
4
|
Tapia E, Spetale F, Krsticevic F, Angelone L, Bulacio P. DNA Barcoding through Quaternary LDPC Codes. PLoS One 2015; 10:e0140459. [PMID: 26492348 PMCID: PMC4619643 DOI: 10.1371/journal.pone.0140459] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 09/23/2015] [Indexed: 12/04/2022] Open
Abstract
For many parallel applications of Next-Generation Sequencing (NGS) technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH) or have intrinsic poor error correcting abilities (Hamming). Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC) codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10−2 per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10−9 at the expense of a rate of read losses just in the order of 10−6.
Collapse
Affiliation(s)
- Elizabeth Tapia
- CIFASIS-Conicet Institute, Rosario, Argentina
- Fac. de Cs. Exactas e Ingeniería, Universidad Nac. de Rosario, Rosario, Argentina
- * E-mail:
| | - Flavio Spetale
- CIFASIS-Conicet Institute, Rosario, Argentina
- Fac. de Cs. Exactas e Ingeniería, Universidad Nac. de Rosario, Rosario, Argentina
| | | | - Laura Angelone
- CIFASIS-Conicet Institute, Rosario, Argentina
- Fac. de Cs. Exactas e Ingeniería, Universidad Nac. de Rosario, Rosario, Argentina
| | - Pilar Bulacio
- CIFASIS-Conicet Institute, Rosario, Argentina
- Fac. de Cs. Exactas e Ingeniería, Universidad Nac. de Rosario, Rosario, Argentina
| |
Collapse
|
5
|
Han H. Diagnostic biases in translational bioinformatics. BMC Med Genomics 2015; 8:46. [PMID: 26232237 PMCID: PMC4522082 DOI: 10.1186/s12920-015-0116-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Accepted: 07/07/2015] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND With the surge of translational medicine and computational omics research, complex disease diagnosis is more and more relying on massive omics data-driven molecular signature detection. However, how to detect and prevent possible diagnostic biases in translational bioinformatics remains an unsolved problem despite its importance in the coming era of personalized medicine. METHODS In this study, we comprehensively investigate the diagnostic bias problem by analyzing benchmark gene array, protein array, RNA-Seq and miRNA-Seq data under the framework of support vector machines for different model selection methods. We further categorize the diagnostic biases into different types by conducting rigorous kernel matrix analysis and provide effective machine learning methods to conquer the diagnostic biases. RESULTS In this study, we comprehensively investigate the diagnostic bias problem by analyzing benchmark gene array, protein array, RNA-Seq and miRNA-Seq data under the framework of support vector machines. We have found that the diagnostic biases happen for data with different distributions and SVM with different kernels. Moreover, we identify total three types of diagnostic biases: overfitting bias, label skewness bias, and underfitting bias in SVM diagnostics, and present corresponding reasons through rigorous analysis. Compared with the overfitting and underfitting biases, the label skewness bias is more challenging to detect and conquer because it can be easily confused as a normal diagnostic case from its deceptive accuracy. To tackle this problem, we propose a derivative component analysis based support vector machines to conquer the label skewness bias by achieving the rivaling clinical diagnostic results. CONCLUSIONS Our studies demonstrate that the diagnostic biases are mainly caused by the three major factors, i.e. kernel selection, signal amplification mechanism in high-throughput profiling, and training data label distribution. Moreover, the proposed DCA-SVM diagnosis provides a generic solution for the label skewness bias overcome due to the powerful feature extraction capability from derivative component analysis. Our work identifies and solves an important but less addressed problem in translational research. It also has a positive impact on machine learning for adding new results to kernel-based learning for omics data.
Collapse
Affiliation(s)
- Henry Han
- Department of Computer and Information Science, Fordham University, New York, 10023, NY, USA. .,Quantitative Proteomics Center, Columbia University, New York, NY, USA.
| |
Collapse
|
6
|
Sachnev V, Saraswathi S, Niaz R, Kloczkowski A, Suresh S. Multi-class BCGA-ELM based classifier that identifies biomarkers associated with hallmarks of cancer. BMC Bioinformatics 2015; 16:166. [PMID: 25986937 PMCID: PMC4448565 DOI: 10.1186/s12859-015-0565-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 03/31/2015] [Indexed: 12/05/2022] Open
Abstract
Background Traditional cancer treatments have centered on cytotoxic drugs and general purpose chemotherapy that may not be tailored to treat specific cancers. Identification of molecular markers that are related to different types of cancers might lead to discovery of drugs that are patient and disease specific. This study aims to use microarray gene expression cancer data to identify biomarkers that are indicative of different types of cancers. Our aim is to provide a multi-class cancer classifier that can simultaneously differentiate between cancers and identify type-specific biomarkers, through the application of the Binary Coded Genetic Algorithm (BCGA) and a neural network based Extreme Learning Machine (ELM) algorithm. Results BCGA and ELM are combined and used to select a subset of genes that are present in the Global Cancer Mapping (GCM) data set. This set of candidate genes contains over 52 biomarkers that are related to multiple cancers, according to the literature. They include APOA1, VEGFC, YWHAZ, B2M, EIF2S1, CCR9 and many other genes that have been associated with the hallmarks of cancer. BCGA-ELM is tested on several cancer data sets and the results are compared to other classification methods. BCGA-ELM compares or exceeds other algorithms in terms of accuracy. We were also able to show that over 50% of genes selected by BCGA-ELM on GCM data are cancer related biomarkers. Conclusions We were able to simultaneously differentiate between 14 different types of cancers, using only 92 genes, to achieve a multi-class classification accuracy of 95.4% which is between 21.6% and 38% higher than other results in the literature for multi-class cancer classification. Our findings suggest that computational algorithms such as BCGA-ELM can facilitate biomarker-driven integrated cancer research that can lead to a detailed understanding of the complexities of cancer. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0565-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Vasily Sachnev
- Department of Information, Communication and Electronics Engineering, Catholic University of Korea, Bucheon, Republic of Korea.
| | - Saras Saraswathi
- Battelle Center for Mathematical Medicine at The Research Institute at Nationwide Children's Hospital; currently at Sidra, Medical and Research Center, Doha, Qatar.
| | - Rashid Niaz
- Department of Medical Informatics, Sidra Medical and Research Center, Doha, Qatar.
| | - Andrzej Kloczkowski
- Battelle Center for Mathematical Medicine at The Research Institute at Nationwide Children's Hospital; Department of Pediatrics, College of Medicine, The Ohio State University, Columbus, USA.
| | - Sundaram Suresh
- School of Computer Science, Nanyang Technological University, Nanyang, Singapore.
| |
Collapse
|
7
|
Irsoy O, Yildiz OT, Alpaydin E. Design and analysis of classifier learning experiments in bioinformatics: survey and case studies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1663-1675. [PMID: 22908127 DOI: 10.1109/tcbb.2012.117] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
In many bioinformatics applications, it is important to assess and compare the performances of algorithms trained from data, to be able to draw conclusions unaffected by chance and are therefore significant. Both the design of such experiments and the analysis of the resulting data using statistical tests should be done carefully for the results to carry significance. In this paper, we first review the performance measures used in classification, the basics of experiment design and statistical tests. We then give the results of our survey over 1,500 papers published in the last two years in three bioinformatics journals (including this one). Although the basics of experiment design are well understood, such as resampling instead of using a single training set and the use of different performance metrics instead of error, only 21 percent of the papers use any statistical test for comparison. In the third part, we analyze four different scenarios which we encounter frequently in the bioinformatics literature, discussing the proper statistical methodology as well as showing an example case study for each. With the supplementary software, we hope that the guidelines we discuss will play an important role in future studies.
Collapse
Affiliation(s)
- Ozan Irsoy
- Department of Computer Engineering, Boğaziçi University, Bebek 34342, Istanbul, Turkey.
| | | | | |
Collapse
|