Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pang H, Datta D, Zhao H. Pathway analysis using random forests with bivariate node-split for survival outcomes. ACTA ACUST UNITED AC 2009;26:250-8. [PMID: 19933158 DOI: 10.1093/bioinformatics/btp640] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

For:	Pang H, Datta D, Zhao H. Pathway analysis using random forests with bivariate node-split for survival outcomes. ACTA ACUST UNITED AC 2009;26:250-8. [PMID: 19933158 DOI: 10.1093/bioinformatics/btp640] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Number

Cited by Other Article(s)

Li S, Ni Z, Zhao Y, Hu W, Long Z, Ma H, Zhou G, Luo Y, Geng C. Susceptibility Analysis of Geohazards in the Longmen Mountain Region after the Wenchuan Earthquake. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:ijerph19063229. [PMID: 35328915 PMCID: PMC8953272 DOI: 10.3390/ijerph19063229] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Revised: 03/02/2022] [Accepted: 03/04/2022] [Indexed: 12/10/2022]

Abstract

Multitemporal geohazard susceptibility analysis can not only provide reliable results but can also help identify the differences in the mechanisms of different elements under different temporal and spatial backgrounds, so as to better accurately prevent and control geohazards. Here, we studied the 12 counties (cities) that were severely affected by the Wenchuan earthquake of 12 May 2008. Our study was divided into four time periods: 2008, 2009–2012, 2013, and 2014–2017. Common geohazards in the study area, such as landslides, collapses and debris flows, were taken into account. We constructed a geohazard susceptibility index evaluation system that included topography, geology, land cover, meteorology, hydrology, and human activities. Then we used a random forest model to study the changes in geohazard susceptibility during the Wenchuan earthquake, the following ten years, and its driving mechanisms. We had four main findings. (1) The susceptibility of geohazards from 2008 to 2017 gradually increased and their spatial distribution was significantly correlated with the main faults and rivers. (2) The Yingxiu-Beichuan Fault, the western section of the Jiangyou-Dujiangyan Fault, and the Minjiang and Fujiang rivers were highly susceptible to geohazards, and changes in geohazard susceptibility mainly occurred along the Pingwu-Qingchuan Fault, the eastern section of the Jiangyou-Dujiangyan Fault, and the riparian areas of the Mianyuan River, Zagunao River, Tongkou River, Baicao River, and other secondary rivers. (3) The relative contribution of topographic factors to geohazards in the four different periods was stable, geological factors slowly decreased, and meteorological and hydrological factors increased. In addition, the impact of land cover in 2008 was more significant than during other periods, and the impact of human activities had an upward trend from 2008 to 2017. (4) Elevation and slope had significant topographical effects, coupled with the geological environmental effects of engineering rock groups and faults, and river-derived effects, which resulted in a spatial aggregation of geohazard susceptibility. We attributed the dynamic changes in the areas that were highly susceptible to geohazards around the faults and rivers to the changes in the intensity of earthquakes and precipitation in different periods.

Collapse

Zhang L, Kim I. Finite mixtures of semiparametric Bayesian survival kernel machine regressions: Application to breast cancer gene pathway subgroup analysis. J R Stat Soc Ser C Appl Stat 2020. [DOI: 10.1111/rssc.12457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Yan KK, Wang X, Lam WWT, Vardhanabhuti V, Lee AWM, Pang HH. Radiomics analysis using stability selection supervised component analysis for right-censored survival data. Comput Biol Med 2020;124:103959. [PMID: 32905923 PMCID: PMC7501167 DOI: 10.1016/j.compbiomed.2020.103959] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 08/02/2020] [Accepted: 08/03/2020] [Indexed: 02/03/2023]

Wang Y, Sun D, Wen H, Zhang H, Zhang F. Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China). INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17124206. [PMID: 32545618 PMCID: PMC7345078 DOI: 10.3390/ijerph17124206] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 06/09/2020] [Accepted: 06/10/2020] [Indexed: 12/05/2022]

Abstract

To compare the random forest (RF) model and the frequency ratio (FR) model for landslide susceptibility mapping (LSM), this research selected Yunyang Country as the study area for its frequent natural disasters; especially landslides. A landslide inventory was built by historical records; satellite images; and extensive field surveys. Subsequently; a geospatial database was established based on 987 historical landslides in the study area. Then; all the landslides were randomly divided into two datasets: 70% of them were used as the training dataset and 30% as the test dataset. Furthermore; under five primary conditioning factors (i.e., topography factors; geological factors; environmental factors; human engineering activities; and triggering factors), 22 secondary conditioning factors were selected to form an evaluation factor library for analyzing the landslide susceptibility. On this basis; the RF model training and the FR model mathematical analysis were performed; and the established models were used for the landslide susceptibility simulation in the entire area of Yunyang County. Next; based on the analysis results; the susceptibility maps were divided into five classes: very low; low; medium; high; and very high. In addition; the importance of conditioning factors was ranked and the influence of landslides was explored by using the RF model. The area under the curve (AUC) value of receiver operating characteristic (ROC) curve; precision; accuracy; and recall ratio were used to analyze the predictive ability of the above two LSM models. The results indicated a difference in the performances between the two models. The RF model (AUC = 0.988) performed better than the FR model (AUC = 0.716). Moreover; compared with the FR model; the RF model showed a higher coincidence degree between the areas in the high and the very low susceptibility classes; on the one hand; and the geographical spatial distribution of historical landslides; on the other hand. Therefore; it was concluded that the RF model was more suitable for landslide susceptibility evaluation in Yunyang County; because of its significant model performance; reliability; and stability. The outcome also provided a theoretical basis for application of machine learning techniques (e.g., RF) in landslide prevention; mitigation; and urban planning; so as to deliver an adequate response to the increasing demand for effective and low-cost tools in landslide susceptibility assessments.

Collapse

Predictive Features of Thymic Carcinoma and High-Risk Thymomas Using Random Forest Analysis. J Comput Assist Tomogr 2020;44:857-864. [PMID: 31996651 DOI: 10.1097/rct.0000000000000953] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Hosni M, Abnane I, Idri A, Carrillo de Gea JM, Fernández Alemán JL. Reviewing ensemble classification methods in breast cancer. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2019;177:89-112. [PMID: 31319964 DOI: 10.1016/j.cmpb.2019.05.019] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 05/16/2019] [Accepted: 05/18/2019] [Indexed: 05/09/2023]

Abstract

CONTEXT

Ensemble methods consist of combining more than one single technique to solve the same task. This approach was designed to overcome the weaknesses of single techniques and consolidate their strengths. Ensemble methods are now widely used to carry out prediction tasks (e.g. classification and regression) in several fields, including that of bioinformatics. Researchers have particularly begun to employ ensemble techniques to improve research into breast cancer, as this is the most frequent type of cancer and accounts for most of the deaths among women.

OBJECTIVE AND METHOD

The goal of this study is to analyse the state of the art in ensemble classification methods when applied to breast cancer as regards 9 aspects: publication venues, medical tasks tackled, empirical and research types adopted, types of ensembles proposed, single techniques used to construct the ensembles, validation framework adopted to evaluate the proposed ensembles, tools used to build the ensembles, and optimization methods used for the single techniques. This paper was undertaken as a systematic mapping study.

RESULTS

A total of 193 papers that were published from the year 2000 onwards, were selected from four online databases: IEEE Xplore, ACM digital library, Scopus and PubMed. This study found that of the six medical tasks that exist, the diagnosis medical task was that most frequently researched, and that the experiment-based empirical type and evaluation-based research type were the most dominant approaches adopted in the selected studies. The homogeneous type was that most widely used to perform the classification task. With regard to single techniques, this mapping study found that decision trees, support vector machines and artificial neural networks were those most frequently adopted to build ensemble classifiers. In the case of the evaluation framework, the Wisconsin Breast Cancer dataset was the most frequently used by researchers to perform their experiments, while the most noticeable validation method was k-fold cross-validation. Several tools are available to perform experiments related to ensemble classification methods, such as Weka and R Software. Few researchers took into account the optimisation of the single technique of which their proposed ensemble was composed, while the grid search method was that most frequently adopted to tune the parameter settings of a single classifier.

CONCLUSION

This paper reports an in-depth study of the application of ensemble methods as regards breast cancer. Our results show that there are several gaps and issues and we, therefore, provide researchers in the field of breast cancer research with recommendations. Moreover, after analysing the papers found in this systematic mapping study, we discovered that the majority report positive results concerning the accuracy of ensemble classifiers when compared to the single classifiers. In order to aggregate the evidence reported in literature, it will, therefore, be necessary to perform a systematic literature review and meta-analysis in which an in-depth analysis could be conducted so as to confirm the superiority of ensemble classifiers over the classical techniques.

Collapse

Dereli O, Oğuz C, Gönen M. Path2Surv: Pathway/gene set-based survival analysis using multiple kernel learning. Bioinformatics 2019;35:5137-5145. [DOI: 10.1093/bioinformatics/btz446] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 05/17/2019] [Accepted: 05/25/2019] [Indexed: 12/18/2022] Open

Sun J, Herazo-Maya JD, Wang JL, Kaminski N, Zhao H. LCox: a tool for selecting genes related to survival outcomes using longitudinal gene expression data. Stat Appl Genet Mol Biol 2019;18:/j/sagmb.ahead-of-print/sagmb-2017-0060/sagmb-2017-0060.xml. [PMID: 30759070 DOI: 10.1515/sagmb-2017-0060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Wu Q, Wang H, Yan X, Liu X. MapReduce-based adaptive random forest algorithm for multi-label classification. Neural Comput Appl 2018. [DOI: 10.1007/s00521-018-3900-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Zhang L, Kim I. Semiparametric Bayesian kernel survival model for evaluating pathway effects. Stat Methods Med Res 2018;28:3301-3317. [PMID: 30289021 DOI: 10.1177/0962280218797360] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Wang W, Liu W. Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery. Sci Rep 2018;8:13202. [PMID: 30181543 PMCID: PMC6123437 DOI: 10.1038/s41598-018-31497-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 08/20/2018] [Indexed: 02/05/2023] Open

Huang Z, Huang C, Xie J, Ma J, Cao G, Huang Q, Shen B, Byers Kraus V, Pei F. Analysis of a large data set to identify predictors of blood transfusion in primary total hip and knee arthroplasty. Transfusion 2018;58:1855-1862. [PMID: 30145838 DOI: 10.1111/trf.14783] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Revised: 03/05/2018] [Accepted: 03/05/2018] [Indexed: 02/05/2023]

Wang H, Chen X, Li G. Survival Forests with R-Squared Splitting Rules. J Comput Biol 2018;25:388-395. [PMID: 29265882 PMCID: PMC5905875 DOI: 10.1089/cmb.2017.0107] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Gong X, Hu M, Zhao L. Big Data Toolsets to Pharmacometrics: Application of Machine Learning for Time-to-Event Analysis. Clin Transl Sci 2018. [PMID: 29536640 PMCID: PMC5944589 DOI: 10.1111/cts.12541] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Ow GS, Tang Z, Kuznetsov VA. Big data and computational biology strategy for personalized prognosis. Oncotarget 2018;7:40200-40220. [PMID: 27229533 PMCID: PMC5130003 DOI: 10.18632/oncotarget.9571] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 05/01/2016] [Indexed: 01/05/2023] Open

Pang H, Wang X. Statistical aspect of translational and correlative studies in clinical trials. Chin Clin Oncol 2017;5:11. [PMID: 26932435 DOI: 10.3978/j.issn.2304-3865.2014.07.04] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Accepted: 06/18/2014] [Indexed: 01/07/2023]

Pang H, Kim I, Zhao H. Random Effects Model for Multiple Pathway Analysis with Applications to Type II Diabetes Microarray Data. STATISTICS IN BIOSCIENCES 2015;7:167-186. [PMID: 26640601 PMCID: PMC4666561 DOI: 10.1007/s12561-014-9109-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Jing GJ, Zhang Z, Wang HQ, Zheng HM. Mining gene link information for survival pathway hunting. IET Syst Biol 2015;9:147-54. [PMID: 26243831 DOI: 10.1049/iet-syb.2014.0048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Ye S, Dawson JA, Kendziorski C. Extending information retrieval methods to personalized genomic-based studies of disease. Cancer Inform 2015;13:85-95. [PMID: 25733795 PMCID: PMC4332045 DOI: 10.4137/cin.s16354] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Revised: 10/22/2014] [Accepted: 10/23/2014] [Indexed: 01/30/2023] Open

Pang H, Zhao H. Stratified pathway analysis to identify gene sets associated with oral contraceptive use and breast cancer. Cancer Inform 2014;13:73-8. [PMID: 25574128 PMCID: PMC4263464 DOI: 10.4137/cin.s13973] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2014] [Revised: 08/15/2014] [Accepted: 08/19/2014] [Indexed: 01/02/2023] Open

Dellinger AE, Nixon AB, Pang H. Integrative Pathway Analysis Using Graph-Based Learning with Applications to TCGA Colon and Ovarian Data. Cancer Inform 2014;13:1-9. [PMID: 25125969 PMCID: PMC4125381 DOI: 10.4137/cin.s13634] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 03/17/2014] [Accepted: 03/18/2014] [Indexed: 12/15/2022] Open

Pang H, Jung SH. Sample size considerations of prediction-validation methods in high-dimensional data for survival outcomes. Genet Epidemiol 2013;37:276-82. [PMID: 23471879 DOI: 10.1002/gepi.21721] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2012] [Revised: 01/21/2013] [Accepted: 02/09/2013] [Indexed: 11/09/2022]

Chen X, Ishwaran H. Pathway hunting by random survival forests. Bioinformatics 2013;29:99-105. [PMID: 23129299 PMCID: PMC3530909 DOI: 10.1093/bioinformatics/bts643] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Revised: 07/18/2012] [Accepted: 10/17/2012] [Indexed: 01/22/2023] Open

Pang H, George SL, Hui K, Tong T. Gene selection using iterative feature elimination random forests for survival outcomes. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012;9:1422-31. [PMID: 22547432 PMCID: PMC3495190 DOI: 10.1109/tcbb.2012.63] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Chen X, Ishwaran H. Random forests for genomic data analysis. Genomics 2012;99:323-9. [PMID: 22546560 PMCID: PMC3387489 DOI: 10.1016/j.ygeno.2012.04.003] [Citation(s) in RCA: 380] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2012] [Revised: 04/11/2012] [Accepted: 04/14/2012] [Indexed: 11/25/2022]

Development and validation of a quantitative real-time polymerase chain reaction classifier for lung cancer prognosis. J Thorac Oncol 2011;6:1481-7. [PMID: 21792073 DOI: 10.1097/jto.0b013e31822918bd] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Wood DJ, Buttar D, Cumming JG, Davis AM, Norinder U, Rodgers SL. Automated QSAR with a Hierarchy of Global and Local Models. Mol Inform 2011;30:960-72. [DOI: 10.1002/minf.201100107] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Accepted: 10/13/2011] [Indexed: 11/06/2022]

Porzelius C, Johannes M, Binder H, Beißbarth T. Leveraging external knowledge on molecular interactions in classification methods for risk prediction of patients. Biom J 2011;53:190-201. [DOI: 10.1002/bimj.201000155] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Revised: 10/22/2010] [Accepted: 10/29/2010] [Indexed: 12/17/2022]

Pathway-based identification of SNPs predictive of survival. Eur J Hum Genet 2011;19:704-9. [PMID: 21368918 DOI: 10.1038/ejhg.2011.3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open