Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pang H, Lin A, Holford M, Enerson BE, Lu B, Lawton MP, Floyd E, Zhao H. Pathway analysis using random forests classification and regression. Bioinformatics 2006;22:2028-36. [PMID: 16809386 DOI: 10.1093/bioinformatics/btl344] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

For:	Pang H, Lin A, Holford M, Enerson BE, Lu B, Lawton MP, Floyd E, Zhao H. Pathway analysis using random forests classification and regression. Bioinformatics 2006;22:2028-36. [PMID: 16809386 DOI: 10.1093/bioinformatics/btl344] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Number

Cited by Other Article(s)

Büyükakın F, Özyılmaz A, Işık E, Bayraktar Y, Olgun MF, Toprak M. Pandemics, Income Inequality, and Refugees: The Case of COVID-19. SOCIAL WORK IN PUBLIC HEALTH 2024;39:78-92. [PMID: 38372287 DOI: 10.1080/19371918.2024.2318372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]

Nizeyimana P, Lee KE, Kim I. Bayesian pathway selection. J Korean Stat Soc 2023. [DOI: 10.1007/s42952-022-00201-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Random Forests in Count Data Modelling: An Analysis of the Influence of Data Features and Overdispersion on Regression Performance. JOURNAL OF PROBABILITY AND STATISTICS 2022. [DOI: 10.1155/2022/2833537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Abstract Machine learning algorithms, especially random forests (RFs), have become an integrated part of the modern scientific methodology and represent an efficient alternative to conventional parametric algorithms. This study aimed to assess the influence of data features and overdispersion on RF regression performance. We assessed the effect of types of predictors (100, 75, 50, and 20% continuous, and 100% categorical), the number of predictors (p = 816 and 24), and the sample size (N = 50, 250, and 1250) on RF parameter settings. We also compared RF performance to that of classical generalized linear models (Poisson, negative binomial, and zero-inflated Poisson) and the linear model applied to log-transformed data. Two real datasets were analysed to demonstrate the usefulness of RF for overdispersed data modelling. Goodness-of-fit statistics such as root mean square error (RMSE) and biases were used to determine RF accuracy and validity. Results revealed that the number of variables to be randomly selected for each split, the proportion of samples to train the model, the minimal number of samples within each terminal node, and RF regression performance are not influenced by the sample size, number, and type of predictors. However, the ratio of observations to the number of predictors affects the stability of the best RF parameters. RF performs well for all types of covariates and different levels of dispersion. The magnitude of dispersion does not significantly influence RF predictive validity. In contrast, its predictive accuracy is significantly influenced by the magnitude of dispersion in the response variable, conditional on the explanatory variables. RF has performed almost as well as the models of the classical Poisson family in the presence of overdispersion. Given RF’s advantages, it is an appropriate statistical alternative for counting data. Collapse

Zhao J, Jiang H, Zou G, Lin Q, Wang Q, Liu J, Ma L. CNNArginineMe: A CNN structure for training models for predicting arginine methylation sites based on the One-Hot encoding of peptide sequence. Front Genet 2022;13:1036862. [PMID: 36324513 PMCID: PMC9618650 DOI: 10.3389/fgene.2022.1036862] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/04/2022] [Indexed: 11/30/2022] Open

The advanced design of bioleaching process for metal recovery: A machine learning approach. Sep Purif Technol 2022. [DOI: 10.1016/j.seppur.2022.120919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Zhang X, Xuan J, Yao C, Gao Q, Wang L, Jin X, Li S. A deep learning approach for orphan gene identification in moso bamboo (Phyllostachys edulis) based on the CNN + Transformer model. BMC Bioinformatics 2022;23:162. [PMID: 35513802 PMCID: PMC9069780 DOI: 10.1186/s12859-022-04702-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 04/28/2022] [Indexed: 12/02/2022] Open

Canella Vieira C, Zhou J, Usovsky M, Vuong T, Howland AD, Lee D, Li Z, Zhou J, Shannon G, Nguyen HT, Chen P. Exploring Machine Learning Algorithms to Unveil Genomic Regions Associated With Resistance to Southern Root-Knot Nematode in Soybeans. FRONTIERS IN PLANT SCIENCE 2022;13:883280. [PMID: 35592556 PMCID: PMC9111516 DOI: 10.3389/fpls.2022.883280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 04/08/2022] [Indexed: 06/15/2023]

Abstract

Southern root-knot nematode [SRKN, Meloidogyne incognita (Kofold & White) Chitwood] is a plant-parasitic nematode challenging to control due to its short life cycle, a wide range of hosts, and limited management options, of which genetic resistance is the main option to efficiently control the damage caused by SRKN. To date, a major quantitative trait locus (QTL) mapped on chromosome (Chr.) 10 plays an essential role in resistance to SRKN in soybean varieties. The confidence of discovered trait-loci associations by traditional methods is often limited by the assumptions of individual single nucleotide polymorphisms (SNPs) always acting independently as well as the phenotype following a Gaussian distribution. Therefore, the objective of this study was to conduct machine learning (ML)-based genome-wide association studies (GWAS) utilizing Random Forest (RF) and Support Vector Machine (SVM) algorithms to unveil novel regions of the soybean genome associated with resistance to SRKN. A total of 717 breeding lines derived from 330 unique bi-parental populations were genotyped with the Illumina Infinium BARCSoySNP6K BeadChip and phenotyped for SRKN resistance in a greenhouse. A GWAS pipeline involving a supervised feature dimension reduction based on Variable Importance in Projection (VIP) and SNP detection based on classification accuracy was proposed. Minor effect SNPs were detected by the proposed ML-GWAS methodology but not identified using Bayesian-information and linkage-disequilibrium Iteratively Nested Keyway (BLINK), Fixed and Random Model Circulating Probability Unification (FarmCPU), and Enriched Compressed Mixed Linear Model (ECMLM) models. Besides the genomic region on Chr. 10 that can explain most of SRKN resistance variance, additional minor effects SNPs were also identified on Chrs. 10 and 11. The findings in this study demonstrated that overfitting in GWAS may lead to lower prediction accuracy, and the detection of significant SNPs based on classification accuracy limited false-positive associations. The expansion of the basis of the genetic resistance to SRKN can potentially reduce the selection pressure over the major QTL on Chr. 10 and achieve higher levels of resistance.

Collapse

Shen J, Jin G, Zhang Z, Zhang J, Sun Y, Xie X, Ma T, Zhu Y, Du Y, Niu Y, Shi X. A multiple-dimension model for microbiota of patients with colorectal cancer from normal participants and other intestinal disorders. Appl Microbiol Biotechnol 2022;106:2161-2173. [PMID: 35218389 DOI: 10.1007/s00253-022-11846-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Revised: 02/12/2022] [Accepted: 02/19/2022] [Indexed: 11/02/2022]

Affiliation(s)

Jian Shen Department of Medical Administration, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China.,Laboratory Medicine Center, Department of Transfusion Medicine, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
Gulei Jin Hangzhou GUHE Information and Technology Company, Hangzhou, Zhejiang, China.,Department of Clinical Laboratory, The Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
Zhengliang Zhang Department of Clinical Laboratory, The Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
Jun Zhang Department of Medical Administration, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China.,Cancer Center, Department of Gastroenterology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
Yan Sun Cancer Center, Department of Gastroenterology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
Xiaoxiao Xie Hangzhou GUHE Information and Technology Company, Hangzhou, Zhejiang, China
Tingting Ma Hangzhou GUHE Information and Technology Company, Hangzhou, Zhejiang, China
Yongze Zhu Laboratory Medicine Center, Department of Clinical Laboratory, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
Yaoqiang Du Laboratory Medicine Center, Department of Transfusion Medicine, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China.
Yaofang Niu Hangzhou GUHE Information and Technology Company, Hangzhou, Zhejiang, China.
Xinwei Shi Department of Nursing, The Eye Hospital of Wenzhou Medical University (Zhejiang Eye Hospital), Hangzhou, Zhejiang, China.

Collapse

Jung SY, Sobel EM, Pellegrini M, Yu H, Papp JC. Synergistic Effects of Genetic Variants of Glucose Homeostasis and Lifelong Exposures to Cigarette Smoking, Female Hormones, and Dietary Fat Intake on Primary Colorectal Cancer Development in African and Hispanic/Latino American Women. Front Oncol 2021;11:760243. [PMID: 34692549 PMCID: PMC8529283 DOI: 10.3389/fonc.2021.760243] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 09/22/2021] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

Disparities in cancer genomic science exist among racial/ethnic minorities. Particularly, African American (AA) and Hispanic/Latino American (HA) women, the 2 largest minorities, are underrepresented in genetic/genome-wide studies for cancers and their risk factors. We conducted on AA and HA postmenopausal women a genomic study for insulin resistance (IR), the main biologic mechanism underlying colorectal cancer (CRC) carcinogenesis owing to obesity.

METHODS

With 780 genome-wide IR-specific single-nucleotide polymorphisms (SNPs) among 4,692 AA and 1,986 HA women, we constructed a CRC-risk prediction model. Along with these SNPs, we incorporated CRC-associated lifestyles in the model of each group and detected the topmost influential genetic and lifestyle factors. Further, we estimated the attributable risk of the topmost risk factors shared by the groups to explore potential factors that differentiate CRC risk between these groups.

RESULTS

In both groups, we detected IR-SNPs in PCSK1 (in AA) and IFT172, GCKR, and NRBP1 (in HA) and risk lifestyles, including long lifetime exposures to cigarette smoking and endogenous female hormones and daily intake of polyunsaturated fatty acids (PFA), as the topmost predictive variables for CRC risk. Combinations of those top genetic- and lifestyle-markers synergistically increased CRC risk. Of those risk factors, dietary PFA intake and long lifetime exposure to female hormones may play a key role in mediating racial disparity of CRC incidence between AA and HA women.

CONCLUSIONS

Our results may improve CRC risk prediction performance in those medically/scientifically underrepresented groups and lead to the development of genetically informed interventions for cancer prevention and therapeutic effort, thus contributing to reduced cancer disparities in those minority subpopulations.

Collapse

Jung SY. Genetic Signatures of Glucose Homeostasis: Synergistic Interplay With Long-Term Exposure to Cigarette Smoking in Development of Primary Colorectal Cancer Among African American Women. Clin Transl Gastroenterol 2021;12:e00412. [PMID: 34608882 PMCID: PMC8500576 DOI: 10.14309/ctg.0000000000000412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 08/22/2021] [Indexed: 11/17/2022] Open

Montesinos-López OA, Montesinos-López A, Mosqueda-Gonzalez BA, Montesinos-López JC, Crossa J, Ramirez NL, Singh P, Valladares-Anguiano FA. A zero altered Poisson random forest model for genomic-enabled prediction. G3-GENES GENOMES GENETICS 2021;11:6042695. [PMID: 33693599 PMCID: PMC8022945 DOI: 10.1093/g3journal/jkaa057] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 12/10/2020] [Indexed: 12/23/2022]

Smith PF, Zheng Y. Applications of Multivariate Statistical and Data Mining Analyses to the Search for Biomarkers of Sensorineural Hearing Loss, Tinnitus, and Vestibular Dysfunction. Front Neurol 2021;12:627294. [PMID: 33746881 PMCID: PMC7966509 DOI: 10.3389/fneur.2021.627294] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 02/01/2021] [Indexed: 11/24/2022] Open

He S, Guo F, Zou Q, HuiDing. MRMD2.0: A Python Tool for Machine Learning with Feature Ranking and Reduction. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200503030350] [Citation(s) in RCA: 101] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Seifert S, Gundlach S, Junge O, Szymczak S. Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study. Bioinformatics 2021;36:4301-4308. [PMID: 32399562 PMCID: PMC7520048 DOI: 10.1093/bioinformatics/btaa483] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Revised: 03/13/2020] [Accepted: 05/05/2020] [Indexed: 12/12/2022] Open

Gut microbiome analysis as a predictive marker for the gastric cancer patients. Appl Microbiol Biotechnol 2021;105:803-814. [PMID: 33404833 DOI: 10.1007/s00253-020-11043-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 11/24/2020] [Accepted: 12/01/2020] [Indexed: 02/06/2023]

Gao Q, Jin X, Xia E, Wu X, Gu L, Yan H, Xia Y, Li S. Identification of Orphan Genes in Unbalanced Datasets Based on Ensemble Learning. Front Genet 2020;11:820. [PMID: 33133122 PMCID: PMC7567012 DOI: 10.3389/fgene.2020.00820] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 07/08/2020] [Indexed: 11/13/2022] Open

Yan KK, Wang X, Lam WWT, Vardhanabhuti V, Lee AWM, Pang HH. Radiomics analysis using stability selection supervised component analysis for right-censored survival data. Comput Biol Med 2020;124:103959. [PMID: 32905923 PMCID: PMC7501167 DOI: 10.1016/j.compbiomed.2020.103959] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 08/02/2020] [Accepted: 08/03/2020] [Indexed: 02/03/2023]

Gu X, Chen Z, Wang D. Prediction of G Protein-Coupled Receptors With CTDC Extraction and MRMD2.0 Dimension-Reduction Methods. Front Bioeng Biotechnol 2020;8:635. [PMID: 32671038 PMCID: PMC7329982 DOI: 10.3389/fbioe.2020.00635] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 05/26/2020] [Indexed: 11/13/2022] Open

Effect of the Abnormal Expression of BMP-4 in the Blood of Diabetic Patients on the Osteogenic Differentiation Potential of Alveolar BMSCs and the Rescue Effect of Metformin: A Bioinformatics-Based Study. BIOMED RESEARCH INTERNATIONAL 2020;2020:7626215. [PMID: 32596370 PMCID: PMC7298258 DOI: 10.1155/2020/7626215] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 04/28/2020] [Indexed: 02/08/2023]

Wang H, Sham P, Tong T, Pang H. Pathway-Based Single-Cell RNA-Seq Classification, Clustering, and Construction of Gene-Gene Interactions Networks Using Random Forests. IEEE J Biomed Health Inform 2020;24:1814-1822. [DOI: 10.1109/jbhi.2019.2944865] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Fernández-Martínez JL, Álvarez-Machancoses Ó, deAndrés-Galiana EJ, Bea G, Kloczkowski A. Robust Sampling of Defective Pathways in Alzheimer's Disease. Implications in Drug Repositioning. Int J Mol Sci 2020;21:ijms21103594. [PMID: 32438758 PMCID: PMC7279419 DOI: 10.3390/ijms21103594] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 05/09/2020] [Accepted: 05/13/2020] [Indexed: 12/21/2022] Open

Abstract

We present the analysis of the defective genetic pathways of the Late-Onset Alzheimer’s Disease (LOAD) compared to the Mild Cognitive Impairment (MCI) and Healthy Controls (HC) using different sampling methodologies. These algorithms sample the uncertainty space that is intrinsic to any kind of highly underdetermined phenotype prediction problem, by looking for the minimum-scale signatures (header genes) corresponding to different random holdouts. The biological pathways can be identified performing posterior analysis of these signatures established via cross-validation holdouts and plugging the set of most frequently sampled genes into different ontological platforms. That way, the effect of helper genes, whose presence might be due to the high degree of under determinacy of these experiments and data noise, is reduced. Our results suggest that common pathways for Alzheimer’s disease and MCI are mainly related to viral mRNA translation, influenza viral RNA transcription and replication, gene expression, mitochondrial translation, and metabolism, with these results being highly consistent regardless of the comparative methods. The cross-validated predictive accuracies achieved for the LOAD and MCI discriminations were 84% and 81.5%, respectively. The difference between LOAD and MCI could not be clearly established (74% accuracy). The most discriminatory genes of the LOAD-MCI discrimination are associated with proteasome mediated degradation and G-protein signaling. Based on these findings we have also performed drug repositioning using Dr. Insight package, proposing the following different typologies of drugs: isoquinoline alkaloids, antitumor antibiotics, phosphoinositide 3-kinase PI3K, autophagy inhibitors, antagonists of the muscarinic acetylcholine receptor and histone deacetylase inhibitors. We believe that the potential clinical relevance of these findings should be further investigated and confirmed with other independent studies.

Collapse

Seifert S. Application of random forest based approaches to surface-enhanced Raman scattering data. Sci Rep 2020;10:5436. [PMID: 32214194 PMCID: PMC7096517 DOI: 10.1038/s41598-020-62338-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 02/26/2020] [Indexed: 01/08/2023] Open

Holland CH, Tanevski J, Perales-Patón J, Gleixner J, Kumar MP, Mereu E, Joughin BA, Stegle O, Lauffenburger DA, Heyn H, Szalai B, Saez-Rodriguez J. Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data. Genome Biol 2020;21:36. [PMID: 32051003 PMCID: PMC7017576 DOI: 10.1186/s13059-020-1949-z] [Citation(s) in RCA: 173] [Impact Index Per Article: 43.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 01/29/2020] [Indexed: 12/31/2022] Open

Affiliation(s)

Christian H Holland Institute for Computational Biomedicine, Bioquant, Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Heidelberg, Germany Joint Research Centre for Computational Biomedicine (JRC-COMBINE), RWTH Aachen University, Faculty of Medicine, Aachen, Germany
Jovan Tanevski Institute for Computational Biomedicine, Bioquant, Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Heidelberg, Germany Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia
Javier Perales-Patón Institute for Computational Biomedicine, Bioquant, Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Heidelberg, Germany
Jan Gleixner German Cancer Research Center (DKFZ), Heidelberg, Germany European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
Manu P Kumar Department of Biological Engineering, MIT, Cambridge, MA, USA
Elisabetta Mereu CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
Brian A Joughin Department of Biological Engineering, MIT, Cambridge, MA, USA Koch Institute for Integrative Cancer Biology, MIT, Cambridge, MA, USA
Oliver Stegle German Cancer Research Center (DKFZ), Heidelberg, Germany European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, UK
Douglas A Lauffenburger Department of Biological Engineering, MIT, Cambridge, MA, USA
Holger Heyn CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain Universitat Pompeu Fabra (UPF), Barcelona, Spain
Bence Szalai Faculty of Medicine, Department of Physiology, Semmelweis University, Budapest, Hungary
Julio Saez-Rodriguez Institute for Computational Biomedicine, Bioquant, Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Heidelberg, Germany. Joint Research Centre for Computational Biomedicine (JRC-COMBINE), RWTH Aachen University, Faculty of Medicine, Aachen, Germany.

Collapse

Martey ONK, Greish K, Smith PF, Rosengren RJ. A multivariate statistical analysis of the effects of styrene maleic acid encapsulated RL71 in a xenograft model of triple negative breast cancer. J Biol Methods 2019;6:e121. [PMID: 31976348 PMCID: PMC6974696 DOI: 10.14440/jbm.2019.306] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 09/08/2016] [Accepted: 10/07/2019] [Indexed: 12/29/2022] Open

Network-based Biased Tree Ensembles (NetBiTE) for Drug Sensitivity Prediction and Drug Sensitivity Biomarker Identification in Cancer. Sci Rep 2019;9:15918. [PMID: 31685861 PMCID: PMC6828742 DOI: 10.1038/s41598-019-52093-w] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 10/07/2019] [Indexed: 12/15/2022] Open

Rahimi A, Gönen M. Discriminating early- and late-stage cancers using multiple kernel learning on gene sets. Bioinformatics 2019;34:i412-i421. [PMID: 29949993 PMCID: PMC6022595 DOI: 10.1093/bioinformatics/bty239] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Abstract

Motivation

Identifying molecular mechanisms that drive cancers from early to late stages is highly important to develop new preventive and therapeutic strategies. Standard machine learning algorithms could be used to discriminate early- and late-stage cancers from each other using their genomic characterizations. Even though these algorithms would get satisfactory predictive performance, their knowledge extraction capability would be quite restricted due to highly correlated nature of genomic data. That is why we need algorithms that can also extract relevant information about these biological mechanisms using our prior knowledge about pathways/gene sets.

Results

In this study, we addressed the problem of separating early- and late-stage cancers from each other using their gene expression profiles. We proposed to use a multiple kernel learning (MKL) formulation that makes use of pathways/gene sets (i) to obtain satisfactory/improved predictive performance and (ii) to identify biological mechanisms that might have an effect in cancer progression. We extensively compared our proposed MKL on gene sets algorithm against two standard machine learning algorithms, namely, random forests and support vector machines, on 20 diseases from the Cancer Genome Atlas cohorts for two different sets of experiments. Our method obtained statistically significantly better or comparable predictive performance on most of the datasets using significantly fewer gene expression features. We also showed that our algorithm was able to extract meaningful and disease-specific information that gives clues about the progression mechanism.

Availability and implementation

Our implementations of support vector machine and multiple kernel learning algorithms in R are available at https://github.com/mehmetgonen/gsbc together with the scripts that replicate the reported experiments.

Collapse

Modelling the Spatial Distribution of Asbestos—Cement Products in Poland with the Use of the Random Forest Algorithm. SUSTAINABILITY 2019. [DOI: 10.3390/su11164355] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract The unique set of physical and chemical properties of asbestos has led to its many industrial applications worldwide, of which roofing and facades constitute approximately 80% of currently used asbestos-containing products. Since asbestos-containing products are harmful to human health, their use and production have been banned in many countries. To date, no research has been undertaken to estimate the total amount of asbestos–cement products used at the country level in relation to regions or other administrative units. The objective of this paper is to present a possible new solution for developing the spatial distribution of asbestos–cement products used across the country by applying the supervised machine learning algorithm, i.e., Random Forest. Based on the results of a physical inventory taken on asbestos–cement products with the use of aerial imagery, and the application of selected features, considering the socio-economic situation of Poland, i.e., population, buildings, public finance, housing economy and municipal infrastructure, wages, salaries and social security benefits, agricultural census, entities of the national economy, labor market, environment protection, area of built-up surfaces, historical belonging to annexations, and data on asbestos manufacturing plants, best Random Forest models were computed. The selection of important variables was made in the R v.3.1.0 program and supported by the Boruta algorithm. The prediction of the amount of asbestos–cement products used in communes was executed in the randomForest package. An algorithm explaining 75.85% of the variance was subsequently used to prepare the prediction map of the spatial distribution of the amount of asbestos–cement products used in Poland. The total amount was estimated at 710,278,645 m2 (7.8 million tons). Since the best model used data on built-up surfaces which are available for the whole of Europe, it is worth considering the use of the developed method in other European countries, as well as to assess the environmental risk of asbestos exposure to humans. Collapse

Jung SY, Zhang ZF. The effects of genetic variants related to insulin metabolism pathways and the interactions with lifestyles on colorectal cancer risk. Menopause 2019;26:771-780. [PMID: 30649085 PMCID: PMC7035960 DOI: 10.1097/gme.0000000000001301] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Xu Y, Kim I, Carroll RJ. A hybrid omnibus test for generalized semiparametric single-index models with high-dimensional covariate sets. Biometrics 2019;75:757-767. [PMID: 30859553 DOI: 10.1111/biom.13054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2017] [Accepted: 02/26/2019] [Indexed: 11/27/2022]

Paldino MJ, Golriz F, Zhang W, Chu ZD. Normalization enhances brain network features that predict individual intelligence in children with epilepsy. PLoS One 2019;14:e0212901. [PMID: 30835738 PMCID: PMC6400436 DOI: 10.1371/journal.pone.0212901] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 02/12/2019] [Indexed: 12/18/2022] Open

Abstract

BACKGROUND AND PURPOSE

Architecture of the cerebral network has been shown to associate with IQ in children with epilepsy. However, subject-level prediction on this basis, a crucial step toward harnessing network analyses for the benefit of children with epilepsy, has yet to be achieved. We compared two network normalization strategies in terms of their ability to optimize subject-level inferences on the relationship between brain network architecture and brain function.

MATERIALS AND METHODS

Patients with epilepsy and resting state fMRI were retrospectively identified. Brain network nodes were defined by anatomic parcellation, first in patient space (nodes defined for each patient) and again in template space (same nodes for all patients). Whole-brain weighted graphs were constructed according to pair-wise correlation of BOLD-signal time courses between nodes. The following metrics were then calculated: clustering coefficient, transitivity, modularity, path length, and global efficiency. Metrics computed on graphs in patient space were normalized to the same metric computed on a random network of identical size. A machine learning algorithm was used to predict patient IQ given access to only the network metrics.

RESULTS

Twenty-seven patients (8-18 years) comprised the final study group. All brain networks demonstrated expected small world properties. Accounting for intrinsic population heterogeneity had a significant effect on prediction accuracy. Specifically, transformation of all patients into a common standard space as well as normalization of metrics to those computed on a random network both substantially outperformed the use of non-normalized metrics.

CONCLUSION

Normalization contributed significantly to accurate subject-level prediction of cognitive function in children with epilepsy. These findings support the potential for quantitative network approaches to contribute clinically meaningful information in children with neurological disorders.

Collapse

Lim S, Lee S, Jung I, Rhee S, Kim S. Comprehensive and critical evaluation of individualized pathway activity measurement tools on pan-cancer data. Brief Bioinform 2018;21:36-46. [PMID: 30462155 DOI: 10.1093/bib/bby097] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 08/20/2018] [Accepted: 09/09/2018] [Indexed: 12/11/2022] Open

Mutual Information Better Quantifies Brain Network Architecture in Children with Epilepsy. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018;2018:6142898. [PMID: 30425750 PMCID: PMC6217888 DOI: 10.1155/2018/6142898] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 08/06/2018] [Accepted: 09/18/2018] [Indexed: 01/01/2023]

Abstract

Purpose

Metrics of the brain network architecture derived from resting-state fMRI have been shown to provide physiologically meaningful markers of IQ in children with epilepsy. However, traditional measures of functional connectivity (FC), specifically the Pearson correlation, assume a dominant linear relationship between BOLD time courses; this assumption may not be valid. Mutual information is an alternative measure of FC which has shown promise in the study of complex networks due to its ability to flexibly capture association of diverse forms. We aimed to compare network metrics derived from mutual information-defined FC to those derived from traditional correlation in terms of their capacity to predict patient-level IQ.

Materials and Methods

Patients were retrospectively identified with the following: (1) focal epilepsy; (2) resting-state fMRI; and (3) full-scale IQ by a neuropsychologist. Brain network nodes were defined by anatomic parcellation. Parcellation was performed at the size threshold of 350 mm², resulting in networks containing 780 nodes. Whole-brain, weighted graphs were then constructed according to the pairwise connectivity between nodes. In the traditional condition, edges (connections) between each pair of nodes were defined as the absolute value of the Pearson correlation coefficient between their BOLD time courses. In the mutual information condition, edges were defined as the mutual information between time courses. The following metrics were then calculated for each weighted graph: clustering coefficient, modularity, characteristic path length, and global efficiency. A machine learning algorithm was used to predict the IQ of each individual based on their network metrics. Prediction accuracy was assessed as the fractional variation explained for each condition.

Results

Twenty-four patients met the inclusion criteria (age: 8-18 years). All brain networks demonstrated expected small-world properties. Network metrics derived from mutual information-defined FC significantly outperformed the use of the Pearson correlation. Specifically, fractional variation explained was 49% (95% CI: 46%, 51%) for the mutual information method; the Pearson correlation demonstrated a variation of 17% (95% CI: 13%, 19%).

Conclusion

Mutual information-defined functional connectivity captures physiologically relevant features of the brain network better than correlation.

Clinical Relevance

Optimizing the capacity to predict cognitive phenotypes at the patient level is a necessary step toward the clinical utility of network-based biomarkers.

Collapse

Zhang L, Kim I. Semiparametric Bayesian kernel survival model for evaluating pathway effects. Stat Methods Med Res 2018;28:3301-3317. [PMID: 30289021 DOI: 10.1177/0962280218797360] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Li B, Zhang N, Wang YG, George AW, Reverter A, Li Y. Genomic Prediction of Breeding Values Using a Subset of SNPs Identified by Three Machine Learning Methods. Front Genet 2018;9:237. [PMID: 30023001 PMCID: PMC6039760 DOI: 10.3389/fgene.2018.00237] [Citation(s) in RCA: 79] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2018] [Accepted: 06/14/2018] [Indexed: 12/22/2022] Open

Abstract

The analysis of large genomic data is hampered by issues such as a small number of observations and a large number of predictive variables (commonly known as “large P small N”), high dimensionality or highly correlated data structures. Machine learning methods are renowned for dealing with these problems. To date machine learning methods have been applied in Genome-Wide Association Studies for identification of candidate genes, epistasis detection, gene network pathway analyses and genomic prediction of phenotypic values. However, the utility of two machine learning methods, Gradient Boosting Machine (GBM) and Extreme Gradient Boosting Method (XgBoost), in identifying a subset of SNP makers for genomic prediction of breeding values has never been explored before. In this study, using 38,082 SNP markers and body weight phenotypes from 2,093 Brahman cattle (1,097 bulls as a discovery population and 996 cows as a validation population), we examined the efficiency of three machine learning methods, namely Random Forests (RF), GBM and XgBoost, in (a) the identification of top 400, 1,000, and 3,000 ranked SNPs; (b) using the subsets of SNPs to construct genomic relationship matrices (GRMs) for the estimation of genomic breeding values (GEBVs). For comparison purposes, we also calculated the GEBVs from (1) 400, 1,000, and 3,000 SNPs that were randomly selected and evenly spaced across the genome, and (2) from all the SNPs. We found that RF and especially GBM are efficient methods in identifying a subset of SNPs with direct links to candidate genes affecting the growth trait. In comparison to the estimate of prediction accuracy of GEBVs from using all SNPs (0.43), the 3,000 top SNPs identified by RF (0.42) and GBM (0.46) had similar values to those of the whole SNP panel. The performance of the subsets of SNPs from RF and GBM was substantially better than that of evenly spaced subsets across the genome (0.18–0.29). Of the three methods, RF and GBM consistently outperformed the XgBoost in genomic prediction accuracy.

Collapse

Wang J, Jain S, Chen D, Song W, Hu CT, Su YH. Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening. Sci Rep 2018;8:3799. [PMID: 29491388 PMCID: PMC5830457 DOI: 10.1038/s41598-018-21922-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 02/13/2018] [Indexed: 02/07/2023] Open

Jung SY, Papp JC, Sobel EM, Zhang ZF. Genetic Variants in Metabolic Signaling Pathways and Their Interaction with Lifestyle Factors on Breast Cancer Risk: A Random Survival Forest Analysis. Cancer Prev Res (Phila) 2018;11:44-51. [PMID: 29074537 PMCID: PMC5754228 DOI: 10.1158/1940-6207.capr-17-0143] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Revised: 09/06/2017] [Accepted: 10/18/2017] [Indexed: 12/18/2022]

Cheng L, Shan L, Kim I. Multilevel Gaussian graphical model for multilevel networks. J Stat Plan Inference 2017. [DOI: 10.1016/j.jspi.2017.05.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Chiappini F, Coilly A, Kadar H, Gual P, Tran A, Desterke C, Samuel D, Duclos-Vallée JC, Touboul D, Bertrand-Michel J, Brunelle A, Guettier C, Le Naour F. Metabolism dysregulation induces a specific lipid signature of nonalcoholic steatohepatitis in patients. Sci Rep 2017;7:46658. [PMID: 28436449 PMCID: PMC5402394 DOI: 10.1038/srep46658] [Citation(s) in RCA: 155] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 03/28/2017] [Indexed: 02/07/2023] Open

Affiliation(s)

Franck Chiappini Inserm, Unité 1193, Villejuif, F-94800, France.,Univ Paris-Sud, UMR-S1193, Villejuif, F-94800, France.,DHU Hepatinov, Villejuif, F-94800, France
Audrey Coilly Inserm, Unité 1193, Villejuif, F-94800, France.,Univ Paris-Sud, UMR-S1193, Villejuif, F-94800, France.,DHU Hepatinov, Villejuif, F-94800, France.,AP-HP, Hôpital Paul-Brousse, Centre Hépato-Biliaire, Villejuif, F-94800, France
Hanane Kadar Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Univ. Paris-Sud, Université Paris-Saclay, F-91198 Gif-Sur-Yvette, France
Philippe Gual Inserm, Unité 1065, Nice, F-06204, France.,University of Nice-Sophia-Antipolis, Nice, F-06204, France.,Centre Hospitalier Universitaire de Nice, Hôpital L'Archet, Nice Cedex 3, F-06202, France
Albert Tran Inserm, Unité 1065, Nice, F-06204, France.,University of Nice-Sophia-Antipolis, Nice, F-06204, France.,Centre Hospitalier Universitaire de Nice, Hôpital L'Archet, Nice Cedex 3, F-06202, France
Christophe Desterke Inserm, US33, Villejuif, F-94800, France.,Univ Paris-Sud, US33, Villejuif, F-94800, France
Didier Samuel Inserm, Unité 1193, Villejuif, F-94800, France.,Univ Paris-Sud, UMR-S1193, Villejuif, F-94800, France.,DHU Hepatinov, Villejuif, F-94800, France.,AP-HP, Hôpital Paul-Brousse, Centre Hépato-Biliaire, Villejuif, F-94800, France
Jean-Charles Duclos-Vallée Inserm, Unité 1193, Villejuif, F-94800, France.,Univ Paris-Sud, UMR-S1193, Villejuif, F-94800, France.,DHU Hepatinov, Villejuif, F-94800, France.,AP-HP, Hôpital Paul-Brousse, Centre Hépato-Biliaire, Villejuif, F-94800, France
David Touboul Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Univ. Paris-Sud, Université Paris-Saclay, F-91198 Gif-Sur-Yvette, France
Justine Bertrand-Michel MetaToul-Lipidomic Facility, MetaboHUB, Inserm UMR1048, Toulouse, F-31432, France
Alain Brunelle Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Univ. Paris-Sud, Université Paris-Saclay, F-91198 Gif-Sur-Yvette, France
Catherine Guettier Inserm, Unité 1193, Villejuif, F-94800, France.,Univ Paris-Sud, UMR-S1193, Villejuif, F-94800, France.,DHU Hepatinov, Villejuif, F-94800, France.,AP-HP, Hôpital du Kremlin-Bicêtre, Service d'Anatomopathologie, Le Kremlin-Bicêtre, F-94275, France
François Le Naour Inserm, Unité 1193, Villejuif, F-94800, France.,Univ Paris-Sud, UMR-S1193, Villejuif, F-94800, France.,DHU Hepatinov, Villejuif, F-94800, France.,Inserm, US33, Villejuif, F-94800, France.,Univ Paris-Sud, US33, Villejuif, F-94800, France

Collapse

Pang H, Wang X. Statistical aspect of translational and correlative studies in clinical trials. Chin Clin Oncol 2017;5:11. [PMID: 26932435 DOI: 10.3978/j.issn.2304-3865.2014.07.04] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Accepted: 06/18/2014] [Indexed: 01/07/2023]

Fabres PJ, Collins C, Cavagnaro TR, Rodríguez López CM. A Concise Review on Multi-Omics Data Integration for Terroir Analysis in Vitis vinifera. FRONTIERS IN PLANT SCIENCE 2017;8:1065. [PMID: 28676813 PMCID: PMC5477006 DOI: 10.3389/fpls.2017.01065] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 06/02/2017] [Indexed: 05/19/2023]

Lim S, Park Y, Hur B, Kim M, Han W, Kim S. Protein interaction network (PIN)-based breast cancer subsystem identification and activation measurement for prognostic modeling. Methods 2016;110:81-89. [DOI: 10.1016/j.ymeth.2016.06.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Revised: 05/31/2016] [Accepted: 06/17/2016] [Indexed: 12/20/2022] Open

Estrada-Carmona N, Harper EB, DeClerck F, Fremier AK. Quantifying model uncertainty to improve watershed-level ecosystem service quantification: a global sensitivity analysis of the RUSLE. INTERNATIONAL JOURNAL OF BIODIVERSITY SCIENCE, ECOSYSTEM SERVICES & MANAGEMENT 2016. [DOI: 10.1080/21513732.2016.1237383] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open

Zheng B, Liu J, Gu J, Du J, Wang L, Gu S, Cheng J, Yang J, Lu H. Classification of Benign and Malignant Thyroid Nodules Using a Combined Clinical Information and Gene Expression Signatures. PLoS One 2016;11:e0164570. [PMID: 27776138 PMCID: PMC5077123 DOI: 10.1371/journal.pone.0164570] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2016] [Accepted: 09/27/2016] [Indexed: 01/08/2023] Open

Affiliation(s)

Bing Zheng Shanghai Institute of Medical Genetics, Shanghai Children’s Hospital, Shanghai Jiao Tong University, Shanghai, China Department of Laboratory Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Jun Liu Department of Otolaryngology, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China Department of Otolaryngology-Head and Neck Surgery, Xinhua Hospital, School of Medicine, Shanghai Jiaotong University, Shanghai, China Ear Institute, Shanghai Jiaotong University, Shanghai, China
Jianlei Gu Shanghai Institute of Medical Genetics, Shanghai Children’s Hospital, Shanghai Jiao Tong University, Shanghai, China Key Laboratory of Molecular Embryology, Ministry of Health and Shanghai Key Laboratory of Embryo and Reproduction Engineering, Shanghai, China
Jing Du Department of Ultrasonography, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Lin Wang Department of Ultrasonography, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Shengli Gu Department of Ultrasonography, Xinhua Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Juan Cheng Department of Ultrasonography, Xinhua Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Jun Yang Department of Otolaryngology-Head and Neck Surgery, Xinhua Hospital, School of Medicine, Shanghai Jiaotong University, Shanghai, China Ear Institute, Shanghai Jiaotong University, Shanghai, China
Hui Lu Shanghai Institute of Medical Genetics, Shanghai Children’s Hospital, Shanghai Jiao Tong University, Shanghai, China Key Laboratory of Molecular Embryology, Ministry of Health and Shanghai Key Laboratory of Embryo and Reproduction Engineering, Shanghai, China Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America

Collapse

Metric forests based on Gaussian mixture model for visual image classification. Soft comput 2016. [DOI: 10.1007/s00500-016-2350-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Bayesian Semiparametric Model for Pathway-Based Analysis with Zero-Inflated Clinical Outcomes. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2016. [DOI: 10.1007/s13253-016-0264-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Li A, Zang Q, Sun D, Wang M. A text feature-based approach for literature mining of lncRNA–protein interactions. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2015.11.110] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Chiappini F, Desterke C, Bertrand-Michel J, Guettier C, Le Naour F. Hepatic and serum lipid signatures specific to nonalcoholic steatohepatitis in murine models. Sci Rep 2016;6:31587. [PMID: 27510159 PMCID: PMC4980672 DOI: 10.1038/srep31587] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Accepted: 07/19/2016] [Indexed: 01/01/2023] Open

Chan WH, Mohamad MS, Deris S, Zaki N, Kasim S, Omatu S, Corchado JM, Al Ashwal H. Identification of informative genes and pathways using an improved penalized support vector machine with a weighting scheme. Comput Biol Med 2016;77:102-15. [PMID: 27522238 DOI: 10.1016/j.compbiomed.2016.08.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Revised: 08/03/2016] [Accepted: 08/03/2016] [Indexed: 01/03/2023]

Cabrera-Barona P, Blaschke T, Kienberger S. Explaining Accessibility and Satisfaction Related to Healthcare: A Mixed-Methods Approach. SOCIAL INDICATORS RESEARCH 2016;133:719-739. [PMID: 28890596 PMCID: PMC5569143 DOI: 10.1007/s11205-016-1371-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 05/23/2016] [Indexed: 05/09/2023]

Abstract

Accessibility and satisfaction related to healthcare services are conceived as multidimensional concepts. These concepts can be studied using objective and subjective measures. In this study, we created two indices: a composite healthcare accessibility index (CHCA) and a composite healthcare satisfaction index (CHCS). To calculate the CHCA index we used three indicators based on three components of multidimensional healthcare accessibility: availability, acceptability and accessibility. In the indicator based on the component of accessibility, we included an innovative perceived time-decay parameter. The three indicators of the CHCA index were weighted through the application of a principal components analysis. To calculate the CHCS index, we used three indicators: the waiting time after the patient arrives at the healthcare service, the quality of the healthcare, and the healthcare service supply. These three indicators making up the CHCA index were weighted by applying an analytical hierarchy process. Three kinds of regressions were subsequently applied in order to explain the CHCA and CHCS indices: namely the Linear Least Squares, Ordinal Logistic, and Random Forests regressions. In these regressions, we used different independent social and health-related variables. These variables represented the predisposing, enabling, and need factors of people´s behaviors related to healthcare. All the calculations were applied to a study area: the city of Quito, Ecuador. Results showed that there are health-related inequalities in regard to healthcare accessibility and healthcare satisfaction in our study area. We also identified specific social factors that explained the indices developed. The present work is a mixed-methods approach to evaluate multidimensional healthcare accessibility and healthcare satisfaction, incorporating a pluralistic perspective, as well as a multidisciplinary framework. The results obtained can also be considered as tools for healthcare and urban planners, for more integrative social analyses that can improve the quality of life in urban residents.

Collapse

Hua L, An L, Li L, Zhang Y, Wang C. A bioinformatics strategy for detecting the complexity of Chronic Obstructive Pulmonary Disease in Northern Chinese Han Population. Genes Genet Syst 2016;87:197-209. [PMID: 22976395 DOI: 10.1266/ggs.87.197] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open