Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kaderali L, Zander T, Faigle U, Wolf J, Schultze JL, Schrader R. CASPAR: a hierarchical bayesian approach to predict survival times in cancer from gene expression data. Bioinformatics 2006;22:1495-502. [PMID: 16554338 DOI: 10.1093/bioinformatics/btl103] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

For:	Kaderali L, Zander T, Faigle U, Wolf J, Schultze JL, Schrader R. CASPAR: a hierarchical bayesian approach to predict survival times in cancer from gene expression data. Bioinformatics 2006;22:1495-502. [PMID: 16554338 DOI: 10.1093/bioinformatics/btl103] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

Yao A, Wang L, Qi F, Li J, Meng J, Jiang T, He Y, Lai W. Risk factors and early detection of joint damage in patients with psoriasis: a case-control study. Int J Dermatol 2024. [PMID: 38682296 DOI: 10.1111/ijd.17212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 04/09/2024] [Accepted: 04/10/2024] [Indexed: 05/01/2024]

Abstract

BACKGROUND

Our aim was to target the unsatisfied need for early detection of the at-risk population and determine the subgroup of patients whose psoriasis (PsO) could transform into psoriatic arthritis (PsA).

METHODS

A retrospective and longitudinal case-control study was conducted at Beijing Chao-yang Hospital. It included 75 patients who were clinically diagnosed with PsA in the case group and 345 who solely suffered from PsO without PsA in the control group. A variety of baseline covariates were gathered from every patient with PsO. Univariate and multivariate analyses and receiver operating characteristic (ROC) curves were used to identify underlying risk factors and determine whether it was necessary to examine the imaging of PsO patients.

RESULTS

In multivariate logistic regression analysis, age ≥40 (odds ratio (OR): 1.04, 95% confidence interval (CI): 1.02-1.06, P < 0.01), nail involvement (OR: 1.17, 95% CI: 1.09-1.32, P < 0.01), erythrocyte sedimentation rate (ESR) (OR: 1.03, 95% CI: 1.01-1.06, P < 0.05) and elevated high-sensitivity C-reactive protein (hs-CRP) (OR: 1.31, 95% CI: 1.13-1.53, P < 0.01) were perceived to be risk factors for the transformation from PsO into clinical PsA. By combining magnetic resonance imaging (MRI)-detected enthesitis with tenosynovitis, combined predictors demonstrated better diagnostic efficacy, with an improvement in specificity (94.3% vs. 69%) and similarities in sensitivity (89% vs. 84.6%). The areas under the ROC curve (AUCs) amounted to 0.925 (95% CI: 0.882-0.967, P < 0.01) and 0.858 (95% CI: 0.814-0.903, P < 0.01).

CONCLUSIONS

It was identified that age ≥40, nail involvement, as well as an elevated ESR, and hs-CRP served as independent risk factors for PsO transforming into PsA. Additionally, MRI provides additional value for the early recognition of PsA.

Collapse

Chu J, Sun N, Hu W, Chen X, Yi N, Shen Y. Bayesian hierarchical lasso Cox model: A 9-gene prognostic signature for overall survival in gastric cancer in an Asian population. PLoS One 2022;17:e0266805. [PMID: 35421138 PMCID: PMC9009599 DOI: 10.1371/journal.pone.0266805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 03/29/2022] [Indexed: 12/24/2022] Open

Abstract Objective Gastric cancer (GC) is one of the most common tumour diseases worldwide and has poor survival, especially in the Asian population. Exploration based on biomarkers would be efficient for better diagnosis, prediction, and targeted therapy. Methods Expression profiles were downloaded from the Gene Expression Omnibus (GEO) database. Survival-related genes were identified by gene set enrichment analysis (GSEA) and univariate Cox. Then, we applied a Bayesian hierarchical lasso Cox model for prognostic signature screening. Protein-protein interaction and Spearman analysis were performed. Kaplan–Meier and receiver operating characteristic (ROC) curve analysis were applied to evaluate the prediction performance. Multivariate Cox regression was used to identify prognostic factors, and a prognostic nomogram was constructed for clinical application. Results With the Bayesian lasso Cox model, a 9-gene signature included TNFRSF11A, NMNAT1, EIF5A, NOTCH3, TOR2A, E2F8, PSMA5, TPMT, and KIF11 was established to predict overall survival in GC. Protein-protein interaction analysis indicated that E2F8 was likely related to KIF11. Kaplan-Meier analysis showed a significant difference between the high-risk and low-risk groups (P<0.001). Multivariate analysis demonstrated that the 9-gene signature was an independent predictor (HR = 2.609, 95% CI 2.017–3.370), and the C-index of the integrative model reached 0.75. Function enrichment analysis for different risk groups revealed the most significant enrichment pathway/term, including pyrimidine metabolism and respiratory electron transport chain. Conclusion Our findings suggested that a novel prognostic model based on a 9-gene signature was developed to predict GC patients in high-risk and improve prediction performance. We hope our model could provide a reference for risk classification and clinical decision-making. Collapse

Chen CK. Inference of genetic regulatory networks with regulatory hubs using vector autoregressions and automatic relevance determination with model selections. Stat Appl Genet Mol Biol 2021;20:121-143. [PMID: 34963205 DOI: 10.1515/sagmb-2020-0054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 11/15/2021] [Indexed: 12/11/2022]

Jia M, Li Z, Pan M, Tao M, Lu X, Liu Y. Evaluation of immune infiltrating of thyroid cancer based on the intrinsic correlation between pair-wise immune genes. Life Sci 2020;259:118248. [PMID: 32791153 DOI: 10.1016/j.lfs.2020.118248] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 07/09/2020] [Accepted: 08/07/2020] [Indexed: 10/23/2022]

Chen CK. Inference of gene networks from gene expression time series using recurrent neural networks and sparse MAP estimation. J Bioinform Comput Biol 2018;16:1850009. [PMID: 30051742 DOI: 10.1142/s0219720018500099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

BACKGROUND

The inference of genetic regulatory networks (GRNs) provides insight into the cellular responses to signals. A class of recurrent neural networks (RNNs) capturing the dynamics of GRN has been used as a basis for inferring small-scale GRNs from gene expression time series. The Bayesian framework facilitates incorporating the hypothesis of GRN into the model estimation to improve the accuracy of GRN inference.

RESULTS

We present new methods for inferring small-scale GRNs based on RNNs. The weights of wires of RNN represent the strengths of gene-to-gene regulatory interactions. We use a class of automatic relevance determination (ARD) priors to enforce the sparsity in the maximum a posteriori (MAP) estimates of wire weights of RNN. A particle swarm optimization (PSO) is integrated as an optimization engine into the MAP estimation process. Likely networks of genes generated based on estimated wire weights are combined using the majority rule to determine a final estimated GRN. As an alternative, a class of <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -norm ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>q</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math> ) priors is used for attaining the sparse MAP estimates of wire weights of RNN. We also infer the GRN using the maximum likelihood (ML) estimates of wire weights of RNN. The RNN-based GRN inference algorithms, ARD-RNN, <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -RNN, and ML-RNN are tested on simulated and experimental E. coli and yeast time series containing 6-11 genes and 7-19 data points. Published GRN inference algorithms based on regressions and mutual information networks are performed on the benchmark datasets to compare performances.

CONCLUSION

ARD and <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -norm priors are used for the estimation of wire weights of RNN. Results of GRN inference experiments show that ARD-RNN, <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -RNN have similar best accuracies on the simulated time series. The ARD-RNN is more accurate than <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -RNN, ML-RNN, and mostly more accurate than the reference algorithms on the experimental time series. The effectiveness of ARD-RNN for inferring small-scale GRNs using gene expression time series of limited length is empirically verified.

Collapse

Ow GS, Tang Z, Kuznetsov VA. Big data and computational biology strategy for personalized prognosis. Oncotarget 2018;7:40200-40220. [PMID: 27229533 PMCID: PMC5130003 DOI: 10.18632/oncotarget.9571] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 05/01/2016] [Indexed: 01/05/2023] Open

Cui Y, Li B, Li R. Decentralized Learning Framework of Meta-Survival Analysis for Developing Robust Prognostic Signatures. JCO Clin Cancer Inform 2017;1:1-13. [PMID: 30657395 PMCID: PMC6873986 DOI: 10.1200/cci.17.00077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

PURPOSE

A significant hurdle in developing reliable gene expression-based prognostic models has been the limited sample size, which can cause overfitting and false discovery. Combining data from multiple studies can enhance statistical power and reduce spurious findings, but how to address the biologic heterogeneity across different datasets remains a major challenge. Better meta-survival analysis approaches are needed.

MATERIAL AND METHODS

We presented a decentralized learning framework for meta-survival analysis without the need for data aggregation. Our method consisted of a series of proposals that together alleviated the influence of data heterogeneity and improved the performance of survival prediction. First, we transformed the gene expression profile of every sample into normalized percentile ranks to obtain platform-agnostic features. Second, we used Stouffer's meta-z approach in combination with Harrell's concordance index to prioritize and select genes to be included in the model. Third, we used survival discordance as a scale-independent model loss function. Instead of generating a merged dataset and training the model therein, we avoided comparing patients across datasets and individually evaluated the loss function on each dataset. Finally, we optimized the model by minimizing the joint loss function.

RESULTS

Through comprehensive evaluation on 31 public microarray datasets containing 6,724 samples of several cancer types, we demonstrated that the proposed method has outperformed (1) single prognostic genes identified using conventional meta-analysis, (2) multigene signatures trained on single datasets, (3) multigene signatures trained on merged datasets as well as by other existing meta-analysis methods, and (4) clinically applicable, established multigene signatures.

CONCLUSION

The decentralized learning approach can be used to effectively perform meta-analysis of gene expression data and to develop robust multigene prognostic signatures.

Collapse

Reconstructing Genetic Regulatory Networks Using Two-Step Algorithms with the Differential Equation Models of Neural Networks. Interdiscip Sci 2017;10:823-835. [PMID: 28748400 DOI: 10.1007/s12539-017-0254-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2016] [Revised: 07/01/2017] [Accepted: 07/14/2017] [Indexed: 10/19/2022]

Abstract

BACKGROUND

The identification of genetic regulatory networks (GRNs) provides insights into complex cellular processes. A class of recurrent neural networks (RNNs) captures the dynamics of GRN. Algorithms combining the RNN and machine learning schemes were proposed to reconstruct small-scale GRNs using gene expression time series.

RESULTS

We present new GRN reconstruction methods with neural networks. The RNN is extended to a class of recurrent multilayer perceptrons (RMLPs) with latent nodes. Our methods contain two steps: the edge rank assignment step and the network construction step. The former assigns ranks to all possible edges by a recursive procedure based on the estimated weights of wires of RNN/RMLP (RE_RNN/RE_RMLP), and the latter constructs a network consisting of top-ranked edges under which the optimized RNN simulates the gene expression time series. The particle swarm optimization (PSO) is applied to optimize the parameters of RNNs and RMLPs in a two-step algorithm. The proposed RE_RNN-RNN and RE_RMLP-RNN algorithms are tested on synthetic and experimental gene expression time series of small GRNs of about 10 genes. The experimental time series are from the studies of yeast cell cycle regulated genes and E. coli DNA repair genes.

CONCLUSION

The unstable estimation of RNN using experimental time series having limited data points can lead to fairly arbitrary predicted GRNs. Our methods incorporate RNN and RMLP into a two-step structure learning procedure. Results show that the RE_RMLP using the RMLP with a suitable number of latent nodes to reduce the parameter dimension often result in more accurate edge ranks than the RE_RNN using the regularized RNN on short simulated time series. Combining by a weighted majority voting rule the networks derived by the RE_RMLP-RNN using different numbers of latent nodes in step one to infer the GRN, the method performs consistently and outperforms published algorithms for GRN reconstruction on most benchmark time series. The framework of two-step algorithms can potentially incorporate with different nonlinear differential equation models to reconstruct the GRN.

Collapse

Meder L, König K, Ozretić L, Schultheis AM, Ueckeroth F, Ade CP, Albus K, Boehm D, Rommerscheidt-Fuss U, Florin A, Buhl T, Hartmann W, Wolf J, Merkelbach-Bruse S, Eilers M, Perner S, Heukamp LC, Buettner R. NOTCH, ASCL1, p53 and RB alterations define an alternative pathway driving neuroendocrine and small cell lung carcinomas. Int J Cancer 2015;138:927-38. [PMID: 26340530 PMCID: PMC4832386 DOI: 10.1002/ijc.29835] [Citation(s) in RCA: 125] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2015] [Accepted: 08/19/2015] [Indexed: 12/17/2022]

Abstract

Small cell lung cancers (SCLCs) and extrapulmonary small cell cancers (SCCs) are very aggressive tumors arising de novo as primary small cell cancer with characteristic genetic lesions in RB1 and TP53. Based on murine models, neuroendocrine stem cells of the terminal bronchioli have been postulated as the cellular origin of primary SCLC. However, both in lung and many other organs, combined small cell/non‐small cell tumors and secondary transitions from non‐small cell carcinomas upon cancer therapy to neuroendocrine and small cell tumors occur. We define features of “small cell‐ness” based on neuroendocrine markers, characteristic RB1 and TP53 mutations and small cell morphology. Furthermore, here we identify a pathway driving the pathogenesis of secondary SCLC involving inactivating NOTCH mutations, activation of the NOTCH target ASCL1 and canonical WNT‐signaling in the context of mutual bi‐allelic RB1 and TP53 lesions. Additionaly, we explored ASCL1 dependent RB inactivation by phosphorylation, which is reversible by CDK5 inhibition. We experimentally verify the NOTCH‐ASCL1‐RB‐p53 signaling axis in vitro and validate its activation by genetic alterations in vivo. We analyzed clinical tumor samples including SCLC, SCC and pulmonary large cell neuroendocrine carcinomas and adenocarcinomas using amplicon‐based Next Generation Sequencing, immunohistochemistry and fluorescence in situ hybridization. In conclusion, we identified a novel pathway underlying rare secondary SCLC which may drive small cell carcinomas in organs other than lung, as well.

What's new?

Using next generation sequencing and establishing features of ‘small cell‐ness’, we identified a NOTCH‐ASCL1‐RB1‐TP53 signaling axis driving small cell cancers. In contrast to the previously described bi‐allelic RB1/TP53 loss in neuroendocrine stem cells as origin of primary small cell neuroendocrine cancers, the NOTCH‐ASCL1 mediated signaling defines an alternative pathway driving secondary small cell neuroendocrine cancers arising from non‐small cell cancers. Moreover, we show a preclinical rational for therapeutically testing WNT‐inhibitors in small cell cancers.

Collapse

Affiliation(s)

Lydia Meder Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Katharina König Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Luka Ozretić Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Anne M Schultheis Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Frank Ueckeroth Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Carsten P Ade Biocenter, University of Würzburg, Am Hubland, Würzburg, 97074, Germany
Kerstin Albus Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Diana Boehm Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Department of Prostate Cancer Research, Institute of Pathology, University Hospital Bonn, Sigmund-Freud Straße 25, Bonn, 53105, Germany
Ursula Rommerscheidt-Fuss Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Alexandra Florin Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Theresa Buhl Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Wolfgang Hartmann Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Jürgen Wolf Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Clinic for Internal Medicine I, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Sabine Merkelbach-Bruse Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Martin Eilers Biocenter, University of Würzburg, Am Hubland, Würzburg, 97074, Germany
Sven Perner Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Biocenter, University of Würzburg, Am Hubland, Würzburg, 97074, Germany
Lukas C Heukamp Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany
Reinhard Buettner Institute of Pathology, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany.,Center for Integrated Oncology Bonn, University Hospital Bonn, Sigmund-Freud Straße 25, 53105, Bonn, Germany.,Lung Cancer Group Cologne, University Hospital Cologne, Kerpener Straße 62, Cologne, 50937, Germany

Collapse

Attallah O, Ma X. Bayesian neural network approach for determining the risk of re-intervention after endovascular aortic aneurysm repair. Proc Inst Mech Eng H 2014;228:857-66. [PMID: 25212212 DOI: 10.1177/0954411914549980] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Kiani NA, Kaderali L. Dynamic probabilistic threshold networks to infer signaling pathways from time-course perturbation data. BMC Bioinformatics 2014;15:250. [PMID: 25047753 PMCID: PMC4133630 DOI: 10.1186/1471-2105-15-250] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 07/15/2014] [Indexed: 11/10/2022] Open

Boosting the concordance index for survival data--a unified framework to derive and evaluate biomarker combinations. PLoS One 2014;9:e84483. [PMID: 24400093 PMCID: PMC3882229 DOI: 10.1371/journal.pone.0084483] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 11/14/2013] [Indexed: 11/30/2022] Open

Schmid M, Kestler HA, Potapov S. On the validity of time-dependent AUC estimators. Brief Bioinform 2013;16:153-68. [PMID: 24036698 DOI: 10.1093/bib/bbt059] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Kim J, Sohn I, Son DS, Kim DH, Ahn T, Jung SH. Prediction of a time-to-event trait using genome wide SNP data. BMC Bioinformatics 2013;14:58. [PMID: 23418752 PMCID: PMC3651372 DOI: 10.1186/1471-2105-14-58] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Accepted: 02/12/2013] [Indexed: 02/07/2023] Open

Schulte JH, Schowe B, Mestdagh P, Kaderali L, Kalaghatgi P, Schlierf S, Vermeulen J, Brockmeyer B, Pajtler K, Thor T, de Preter K, Speleman F, Morik K, Eggert A, Vandesompele J, Schramm A. Accurate prediction of neuroblastoma outcome based on miRNA expression profiles. Int J Cancer 2010;127:2374-85. [PMID: 20473924 DOI: 10.1002/ijc.25436] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Abstract

For neuroblastoma, the most common extracranial tumour of childhood, identification of new biomarkers and potential therapeutic targets is mandatory to improve risk stratification and survival rates. MicroRNAs are deregulated in most cancers, including neuroblastoma. In this study, we analysed 430 miRNAs in 69 neuroblastomas by stem-loop RT-qPCR. Prediction of event-free survival (EFS) with support vector machines (SVM) and actual survival times with Cox regression-based models (CASPAR) were highly accurate and were independently validated. SVM-accuracy for prediction of EFS was 88.7% (95% CI: 88.5-88.8%). For CASPAR-based predictions, 5y-EFS probability was 0.19% (95% CI: 0-38%) in the CASPAR-predicted short survival group compared with 0.78% (95%CI: 64-93%) in the CASPAR-predicted long survival group. Both classifiers were validated on an independent test set yielding accuracies of 94.74% (SVM) and 5y-EFS probabilities as 0.25 (95% CI: 0.0-0.55) for short versus 1 ± 0.0 for long survival (CASPAR), respectively. Amplification of the MYCN oncogene was highly correlated with deregulation of miRNA expression. In addition, 37 miRNAs correlated with TrkA expression, a marker of excellent outcome, and 6 miRNAs further analysed in vitro were regulated upon TrkA transfection, suggesting a functional relationship. Expression of the most significant TrkA-correlated miRNA, miR-542-5p, also discriminated between local and metastatic disease and was inversely correlated with MYCN amplification and event-free survival. We conclude that neuroblastoma patient outcome prediction using miRNA expression is feasible and effective. Studies testing miRNA-based predictors in comparison to and in combination with mRNA and aCGH information should be initiated. Specific miRNAs (e.g., miR-542-5p) might be important in neuroblastoma tumour biology, and qualify as potential therapeutic targets.

Collapse

Mazur J, Ritter D, Reinelt G, Kaderali L. Reconstructing nonlinear dynamic models of gene regulation using stochastic sampling. BMC Bioinformatics 2009;10:448. [PMID: 20038296 PMCID: PMC2811124 DOI: 10.1186/1471-2105-10-448] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2009] [Accepted: 12/28/2009] [Indexed: 12/01/2022] Open

Abstract

Background

The reconstruction of gene regulatory networks from time series gene expression data is one of the most difficult problems in systems biology. This is due to several reasons, among them the combinatorial explosion of possible network topologies, limited information content of the experimental data with high levels of noise, and the complexity of gene regulation at the transcriptional, translational and post-translational levels. At the same time, quantitative, dynamic models, ideally with probability distributions over model topologies and parameters, are highly desirable.

Results

We present a novel approach to infer such models from data, based on nonlinear differential equations, which we embed into a stochastic Bayesian framework. We thus address both the stochasticity of experimental data and the need for quantitative dynamic models. Furthermore, the Bayesian framework allows it to easily integrate prior knowledge into the inference process. Using stochastic sampling from the Bayes' posterior distribution, our approach can infer different likely network topologies and model parameters along with their respective probabilities from given data. We evaluate our approach on simulated data and the challenge #3 data from the DREAM 2 initiative. On the simulated data, we study effects of different levels of noise and dataset sizes. Results on real data show that the dynamics and main regulatory interactions are correctly reconstructed.

Conclusions

Our approach combines dynamic modeling using differential equations with a stochastic learning framework, thus bridging the gap between biophysical modeling and stochastic inference approaches. Results show that the method can reap the advantages of both worlds, and allows the reconstruction of biophysically accurate dynamic models from noisy data. In addition, the stochastic learning framework used permits the computation of probability distributions over models and model parameters, which holds interesting prospects for experimental design purposes.

Collapse

Pang H, Datta D, Zhao H. Pathway analysis using random forests with bivariate node-split for survival outcomes. ACTA ACUST UNITED AC 2009;26:250-8. [PMID: 19933158 DOI: 10.1093/bioinformatics/btp640] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Abstract

MOTIVATION

There is great interest in pathway-based methods for genomics data analysis in the research community. Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, no study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without incorporating biological knowledge are more difficult to interpret. Correlating pathway-based gene expression with survival outcomes may lead to biologically more meaningful prognosis biomarkers. Thus, a comprehensive study on how these methods perform in a pathway-based setting is warranted.

RESULTS

In this article, we describe a pathway-based method using random forests to correlate gene expression data with survival outcomes and introduce a novel bivariate node-splitting random survival forests. The proposed method allows researchers to identify important pathways for predicting patient prognosis and time to disease progression, and discover important genes within those pathways. We compared different implementations of random forests with different split criteria and found that bivariate node-splitting random survival forests with log-rank test is among the best. We also performed simulation studies that showed random forests outperforms several other machine learning algorithms and has comparable results with a newly developed component-wise Cox boosting model. Thus, pathway-based survival analysis using machine learning tools represents a promising approach in dissecting pathways and for generating new biological hypothesis from microarray studies.

AVAILABILITY

R package Pwayrfsurvival is available from URL: http://www.duke.edu/~hp44/pwayrfsurvival.htm.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Oberthuer A, Theissen J, Westermann F, Hero B, Fischer M. Molecular characterization and classification of neuroblastoma. Future Oncol 2009;5:625-39. [DOI: 10.2217/fon.09.41] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Sohn I, Kim J, Jung SH, Park C. Gradient lasso for Cox proportional hazards model. Bioinformatics 2009;25:1775-81. [PMID: 19447787 DOI: 10.1093/bioinformatics/btp322] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Reanalysis of neuroblastoma expression profiling data using improved methodology and extended follow-up increases validity of outcome prediction. Cancer Lett 2009;282:55-62. [PMID: 19349112 DOI: 10.1016/j.canlet.2009.02.052] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2008] [Revised: 02/25/2009] [Accepted: 02/26/2009] [Indexed: 11/20/2022]

van Wieringen WN, Kun D, Hampel R, Boulesteix AL. Survival prediction using gene expression data: A review and comparison. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.05.021] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Annest A, Bumgarner RE, Raftery AE, Yeung KY. Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data. BMC Bioinformatics 2009;10:72. [PMID: 19245714 PMCID: PMC2657791 DOI: 10.1186/1471-2105-10-72] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2008] [Accepted: 02/26/2009] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

Microarray technology is increasingly used to identify potential biomarkers for cancer prognostics and diagnostics. Previously, we have developed the iterative Bayesian Model Averaging (BMA) algorithm for use in classification. Here, we extend the iterative BMA algorithm for application to survival analysis on high-dimensional microarray data. The main goal in applying survival analysis to microarray data is to determine a highly predictive model of patients' time to event (such as death, relapse, or metastasis) using a small number of selected genes. Our multivariate procedure combines the effectiveness of multiple contending models by calculating the weighted average of their posterior probability distributions. Our results demonstrate that our iterative BMA algorithm for survival analysis achieves high prediction accuracy while consistently selecting a small and cost-effective number of predictor genes.

RESULTS

We applied the iterative BMA algorithm to two cancer datasets: breast cancer and diffuse large B-cell lymphoma (DLBCL) data. On the breast cancer data, the algorithm selected a total of 15 predictor genes across 84 contending models from the training data. The maximum likelihood estimates of the selected genes and the posterior probabilities of the selected models from the training data were used to divide patients in the test (or validation) dataset into high- and low-risk categories. Using the genes and models determined from the training data, we assigned patients from the test data into highly distinct risk groups (as indicated by a p-value of 7.26e-05 from the log-rank test). Moreover, we achieved comparable results using only the 5 top selected genes with 100% posterior probabilities. On the DLBCL data, our iterative BMA procedure selected a total of 25 genes across 3 contending models from the training data. Once again, we assigned the patients in the validation set to significantly distinct risk groups (p-value = 0.00139).

CONCLUSION

The strength of the iterative BMA algorithm for survival analysis lies in its ability to account for model uncertainty. The results from this study demonstrate that our procedure selects a small number of genes while eclipsing other methods in predictive performance, making it a highly accurate and cost-effective prognostic tool in the clinical setting.

Collapse

Lee ES, Son DS, Kim SH, Lee J, Jo J, Han J, Kim H, Lee HJ, Choi HY, Jung Y, Park M, Lim YS, Kim K, Shim Y, Kim BC, Lee K, Huh N, Ko C, Park K, Lee JW, Choi YS, Kim J. Prediction of recurrence-free survival in postoperative non-small cell lung cancer patients by using an integrated model of clinical information and gene expression. Clin Cancer Res 2009;14:7397-404. [PMID: 19010856 DOI: 10.1158/1078-0432.ccr-07-4937] [Citation(s) in RCA: 205] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Oberthuer A, Kaderali L, Kahlert Y, Hero B, Westermann F, Berthold F, Brors B, Eils R, Fischer M. Subclassification and individual survival time prediction from gene expression data of neuroblastoma patients by using CASPAR. Clin Cancer Res 2008;14:6590-601. [PMID: 18927300 DOI: 10.1158/1078-0432.ccr-07-4377] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract

PURPOSE

To predict individual survival times for neuroblastoma patients from gene expression data using the cancer survival prediction using automatic relevance determination (CASPAR) algorithm.

EXPERIMENTAL DESIGN

A first set of oligonucleotide microarray gene expression profiles comprising 256 neuroblastoma patients was generated. Then, CASPAR was combined with a leave-one-out cross-validation to predict individual times for both the whole cohort and subgroups of patients with unfavorable markers, including stage 4 disease (n = 67), unfavorable genetic alterations, intermediate-risk or high-risk stratification by the German neuroblastoma trial, and patients predicted as unfavorable by a recently described gene expression classifier (n = 83). Prediction accuracy of individual survival times was assessed by Kaplan-Meier analyses and time-dependent receiver operator characteristics curve analyses. Subsequently, classification results were validated in an independent cohort (n = 120).

RESULTS

CASPAR separated patients with divergent outcome in both the initial and the validation cohort [initial set, 5y-OS 0.94 +/- 0.04 (predicted long survival) versus 0.38 +/- 0.17 (predicted short survival), P < 0.0001; validation cohort, 5y-OS 0.94 +/- 0.07 (long) versus 0.40 +/- 0.13 (short), P < 0.0001]. Time-dependent receiver operator characteristics analyses showed that CASPAR-predicted individual survival times were highly accurate (initial set, mean area under the curve for first 10 years of overall survival prediction 0.92 +/- 0.04; validation set, 0.81 +/- 0.05). Furthermore, CASPAR significantly discriminated short (<5 years) from long survivors (>5 years) in subgroups of patients with unfavorable markers with the exception of MYCN-amplified patients (initial set). Confirmatory results with high significance were observed in the validation cohort [stage 4 disease (P = 0.0049), NB2004 intermediate-risk or high-risk stratification (P = 0.0017), and unfavorable gene expression prediction (P = 0.0017)].

CONCLUSIONS

CASPAR accurately forecasts individual survival times for neuroblastoma patients from gene expression data.

Collapse

Zhang YJ, Fang JY. Molecular staging of gastric cancer. J Gastroenterol Hepatol 2008;23:856-60. [PMID: 17854423 DOI: 10.1111/j.1440-1746.2007.05140.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Diaz-Uriarte R. SignS: a parallelized, open-source, freely available, web-based tool for gene selection and molecular signatures for survival and censored data. BMC Bioinformatics 2008;9:30. [PMID: 18208605 PMCID: PMC2265264 DOI: 10.1186/1471-2105-9-30] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2007] [Accepted: 01/21/2008] [Indexed: 11/17/2022] Open

Abstract

Background

Censored data are increasingly common in many microarray studies that attempt to relate gene expression to patient survival. Several new methods have been proposed in the last two years. Most of these methods, however, are not available to biomedical researchers, leading to many re-implementations from scratch of ad-hoc, and suboptimal, approaches with survival data.

Results

We have developed SignS (Signatures for Survival data), an open-source, freely-available, web-based tool and R package for gene selection, building molecular signatures, and prediction with survival data. SignS implements four methods which, according to existing reviews, perform well and, by being of a very different nature, offer complementary approaches. We use parallel computing via MPI, leading to large decreases in user waiting time. Cross-validation is used to asses predictive performance and stability of solutions, the latter an issue of increasing concern given that there are often several solutions with similar predictive performance. Biological interpretation of results is enhanced because genes and signatures in models can be sent to other freely-available on-line tools for examination of PubMed references, GO terms, and KEGG and Reactome pathways of selected genes.

Conclusion

SignS is the first web-based tool for survival analysis of expression data, and one of the very few with biomedical researchers as target users. SignS is also one of the few bioinformatics web-based applications to extensively use parallelization, including fault tolerance and crash recovery. Because of its combination of methods implemented, usage of parallel computing, code availability, and links to additional data bases, SignS is a unique tool, and will be of immediate relevance to biomedical researchers, biostatisticians and bioinformaticians.

Collapse

Inferring Gene Regulatory Networks from Expression Data. COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS 2008. [DOI: 10.1007/978-3-540-76803-6_2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]

Schramm A, Vandesompele J, Schulte JH, Dreesmann S, Kaderali L, Brors B, Eils R, Speleman F, Eggert A. Translating expression profiling into a clinically feasible test to predict neuroblastoma outcome. Clin Cancer Res 2007;13:1459-65. [PMID: 17332289 DOI: 10.1158/1078-0432.ccr-06-2032] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]