Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

151

Bhatnagar SR, Yang Y, Lu T, Schurr E, Loredo-Osti JC, Forest M, Oualkacha K, Greenwood CMT. Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models. PLoS Genet 2020;16:e1008766. [PMID: 32365090 PMCID: PMC7224575 DOI: 10.1371/journal.pgen.1008766] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 05/14/2020] [Accepted: 04/08/2020] [Indexed: 12/23/2022] Open

152

Detmer FJ, Cebral J, Slawski M. A note on coding and standardization of categorical variables in (sparse) group lasso regression. J Stat Plan Inference 2020. [DOI: 10.1016/j.jspi.2019.08.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

153

Evaluation of secondary ions related to plant tissue using least absolute shrinkage and selection operator. Biointerphases 2020;15:021010. [PMID: 32272844 DOI: 10.1116/6.0000010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

154

Investigating matrix effects of different combinations of lipids and peptides on TOF-SIMS data. Biointerphases 2020;15:021008. [PMID: 32241114 DOI: 10.1116/6.0000036] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

155

Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV. ALGORITHMS 2020. [DOI: 10.3390/a13030073] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

156

Gupta S, Lee REC, Faeder JR. Parallel Tempering with Lasso for model reduction in systems biology. PLoS Comput Biol 2020;16:e1007669. [PMID: 32150537 PMCID: PMC7082068 DOI: 10.1371/journal.pcbi.1007669] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 03/19/2020] [Accepted: 01/20/2020] [Indexed: 01/08/2023] Open

Abstract

Systems Biology models reveal relationships between signaling inputs and observable molecular or cellular behaviors. The complexity of these models, however, often obscures key elements that regulate emergent properties. We use a Bayesian model reduction approach that combines Parallel Tempering with Lasso regularization to identify minimal subsets of reactions in a signaling network that are sufficient to reproduce experimentally observed data. The Bayesian approach finds distinct reduced models that fit data equivalently. A variant of this approach that uses Lasso to perform selection at the level of reaction modules is applied to the NF-κB signaling network to test the necessity of feedback loops for responses to pulsatile and continuous pathway stimulation. Taken together, our results demonstrate that Bayesian parameter estimation combined with regularization can isolate and reveal core motifs sufficient to explain data from complex signaling systems.

Cells respond to diverse environmental cues using complex networks of interacting proteins and other biomolecules. Mathematical and computational models have become invaluable tools to understand these networks and make informed predictions to rationally perturb cell behavior. However, the complexity of detailed models that try to capture all known biochemical elements of signaling networks often makes it difficult to determine the key regulatory elements that are responsible for specific cell behaviors. Here, we present a Bayesian computational approach, PTLasso, to automatically extract minimal subsets of detailed models that are sufficient to explain experimental data. The method simultaneously calibrates and reduces models, and the Bayesian approach samples globally, allowing us to find alternate mechanistic explanations for the data if present. We demonstrate the method on both synthetic and real biological data and show that PTLasso is an effective method to isolate distinct parts of a larger signaling model that are sufficient for specific data.

Collapse

157

Detmer FJ, Mut F, Slawski M, Hirsch S, Bijlenga P, Cebral JR. Incorporating variability of patient inflow conditions into statistical models for aneurysm rupture assessment. Acta Neurochir (Wien) 2020;162:553-566. [PMID: 32008209 DOI: 10.1007/s00701-020-04234-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Accepted: 01/18/2020] [Indexed: 12/19/2022]

158

Xie X, Zhang H, Wang J, Chang Q, Wang J, Pal NR. Learning Optimized Structure of Neural Networks by Hidden Node Pruning With L₁ Regularization. IEEE TRANSACTIONS ON CYBERNETICS 2020;50:1333-1346. [PMID: 31765323 DOI: 10.1109/tcyb.2019.2950105] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

159

Reps JM, Cepeda MS, Ryan PB. Wisdom of the CROUD: Development and validation of a patient-level prediction model for opioid use disorder using population-level claims data. PLoS One 2020;15:e0228632. [PMID: 32053653 PMCID: PMC7017997 DOI: 10.1371/journal.pone.0228632] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 01/21/2020] [Indexed: 11/18/2022] Open

Abstract

OBJECTIVE

Some patients who are given opioids for pain could develop opioid use disorder. If it was possible to identify patients who are at a higher risk of opioid use disorder, then clinicians could spend more time educating these patients about the risks. We develop and validate a model to predict a person's future risk of opioid use disorder at the point before being dispensed their first opioid.

METHODS

A cohort study patient-level prediction using four US claims databases with target populations ranging between 343,552 and 384,424 patients. The outcome was recorded diagnosis of opioid abuse, dependency or unspecified drug abuse as a proxy for opioid use disorder from 1 day until 365 days after the first opioid is dispensed. We trained a regularized logistic regression using candidate predictors consisting of demographics and any conditions, drugs, procedures or visits prior to the first opioid. We then selected the top predictors and created a simple 8 variable score model.

RESULTS

We estimated the percentage of new users of opioids with reported opioid use disorder within a year to range between 0.04%-0.26% across US claims data. We developed an 8 variable Calculator of Risk for Opioid Use Disorder (CROUD) score, derived from the prediction models to stratify patients into higher and lower risk groups. The 8 baseline variables were age 15-29, medical history of substance abuse, mood disorder, anxiety disorder, low back pain, renal impairment, painful neuropathy and recent ER visit. 1.8% of people were in the high risk group for opioid use disorder and had a score > = 23 with the model obtaining a sensitivity of 13%, specificity of 98% and PPV of 1.14% for predicting opioid use disorder.

CONCLUSIONS

CROUD could be used by clinicians to obtain personalized risk scores. CROUD could be used to further educate those at higher risk and to personalize new opioid dispensing guidelines such as urine testing. Due to the high false positive rate, it should not be used for contraindication or to restrict utilization.

Collapse

160

Li Y, Sun C, Li P, Zhao Y, Mensah GK, Xu Y, Guo H, Chen J. Hypernetwork Construction and Feature Fusion Analysis Based on Sparse Group Lasso Method on fMRI Dataset. Front Neurosci 2020;14:60. [PMID: 32116508 PMCID: PMC7029661 DOI: 10.3389/fnins.2020.00060] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Accepted: 01/15/2020] [Indexed: 01/21/2023] Open

Abstract

Recent works have shown that the resting-state brain functional connectivity hypernetwork, where multiple nodes can be connected, are an effective technique for brain disease diagnosis and classification research. The lasso method was used to construct hypernetworks by solving sparse linear regression models in previous research. But, constructing a hypernetwork based on the lasso method simply selects a single variable, in that it lacks the ability to interpret the grouping effect. Considering the group structure problem, the previous study proposed to create a hypernetwork based on the elastic net and the group lasso methods, and the results showed that the former method had the best classification performance. However, the highly correlated variables selected by the elastic net method were not necessarily in the active set in the group. Therefore, we extended our research to address this issue. Herein, we propose a new method that introduces the sparse group lasso method to improve the construction of the hypernetwork by solving the group structure problem of the brain regions. We used the traditional lasso, group lasso method, and sparse group lasso method to construct a hypernetwork in patients with depression and normal subjects. Meanwhile, other clustering coefficients (clustering coefficients based on pairs of nodes) were also introduced to extract features with traditional clustering coefficients. Two types of features with significant differences obtained after feature selection were subjected to multi-kernel learning for feature fusion and classification using each method, respectively. The network topology results revealed differences among the three networks, where hypernetwork using the lasso method was the strictest; the group lasso, most lenient; and the sgLasso method, moderate. The network topology of the sparse group lasso method was similar to that of the group lasso method but different from the lasso method. The classification results show that the sparse group lasso method achieves the best classification accuracy by using multi-kernel learning, which indicates that better classification performance can be achieved when the group structure exists and is properly extended.

Collapse

161

Yin Z. Variable selection for sparse logistic regression. METRIKA 2020. [DOI: 10.1007/s00184-020-00764-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

162

Noorie Z, Afsari F. Sparse feature selection: Relevance, redundancy and locality structure preserving guided by pairwise constraints. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2019.105956] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

163

Prediction in Cancer Genomics Using Topological Signatures and Machine Learning. TOPOLOGICAL DATA ANALYSIS 2020. [DOI: 10.1007/978-3-030-43408-3_10] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

164

Huang S, Garshick E, Weschler LB, Hong C, Li J, Li L, Qu F, Gao D, Zhou Y, Sundell J, Zhang Y, Koutrakis P. Home environmental and lifestyle factors associated with asthma, rhinitis and wheeze in children in Beijing, China. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2020;256:113426. [PMID: 31672368 PMCID: PMC7050389 DOI: 10.1016/j.envpol.2019.113426] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 10/15/2019] [Accepted: 10/15/2019] [Indexed: 05/04/2023]

Abstract

BACKGROUND

The prevalence of asthma and allergic diseases has increased rapidly in urban China since 2000. There has been limited study of associations between home environmental and lifestyle factors with asthma and symptoms of allergic disease in China.

METHODS

In a cross-sectional analysis of 2214 children in Beijing, we applied a two-step hybrid Least Absolute Shrinkage and Selection Operator (LASSO) algorithm to identify environmental and lifestyle-related factors associated with asthma, rhinitis and wheeze from a wide range of candidates. We used group LASSO to select variables, using cross-validation as the criterion. Effect estimates were then calculated using adaptive LASSO. Model performance was assessed using Area Under the Curve (AUC) values.

RESULTS

We found a number of environmental and lifestyle-related factors significantly associated with asthma, rhinitis or wheeze, which changed the probability of asthma, rhinitis or wheeze from -5.76% (95%CI: -7.74%, -3.79%) to 27.4% (95%CI: 16.6%, 38.3%). The three factors associated with the largest change in probability of asthma were short birth length, carpeted floor and paternal allergy; for rhinitis they were maternal smoking during pregnancy, paternal allergy and living close to industrial area; and for wheeze they were carpeted floor, short birth length and maternal allergy. Other home environmental risk factors identified were living close to a highway, industrial area or river, sharing bedroom, cooking with gas, furry pets, cockroaches, incense, printer/photocopier, TV, damp, and window condensation in winter. Lifestyle-related risk factors were child caretakers other than parents, and age<3 for the day-care. Other risk factors included use of antibiotics, and mother's occupation. Major protective factors for wheeze were living in a rural/suburban region, air conditioner use, and mother's occupation in healthcare.

CONCLUSIONS

Our findings suggest that changes in lifestyle and indoor environments associated with the urbanization and industrialization of China are associated with asthma, rhinitis, and wheeze in children.

Collapse

Affiliation(s)

Shaodan Huang Department of Building Science, Tsitnghua University, Beijing, 100084, China; Beijing Key Lab of Indoor Air Quality Evaluation and Control, Beijing, 100084, China; Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, 02115, USA
Eric Garshick Pulmonary, Allergy, Sleep, and Critical Care Medicine Section, Medical Service, VA Boston Healthcare System, Boston, MA, 02132, USA; Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA
Louise B Weschler Department of Building Science, Tsitnghua University, Beijing, 100084, China; 161 Richdale Road, Colts Neck, NJ, 07722, USA
Chuan Hong Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
Jing Li Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, 02115, USA.
Linyan Li Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, 02115, USA
Fang Qu Department of Building Science, Tsitnghua University, Beijing, 100084, China; China Meteorological Administration Training Centre, China Meteorological Administration, Beijing, 100081, China
Dewen Gao Beijing Key Lab of Indoor Air Quality Evaluation and Control, Beijing, 100084, China
Yanmin Zhou School of Architecture, Tsinghua University, Beijing, 100084, China; Beijing Key Lab of Indoor Air Quality Evaluation and Control, Beijing, 100084, China
Jan Sundell School of Environmental Science and Engineering, Tianjin University, Tianjing, 300072, China
Yinping Zhang Department of Building Science, Tsitnghua University, Beijing, 100084, China; Beijing Key Lab of Indoor Air Quality Evaluation and Control, Beijing, 100084, China.
Petros Koutrakis Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, 02115, USA

Collapse

165

Zhang X, Zhang Q, Wang X, Ma S, Fang K. Structured sparse logistic regression with application to lung cancer prediction using breath volatile biomarkers. Stat Med 2019;39:955-967. [PMID: 31880351 DOI: 10.1002/sim.8454] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 09/24/2019] [Accepted: 11/21/2019] [Indexed: 11/10/2022]

166

Hesamian G, Akbari MG. Fuzzy Lasso regression model with exact explanatory variables and fuzzy responses. Int J Approx Reason 2019. [DOI: 10.1016/j.ijar.2019.10.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

167

Groll A, Hambuckers J, Kneib T, Umlauf N. LASSO-type penalization in the framework of generalized additive models for location, scale and shape. Comput Stat Data Anal 2019. [DOI: 10.1016/j.csda.2019.06.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

168

Honda T, Ing CK, Wu WY. Adaptively weighted group Lasso for semiparametric quantile regression models. BERNOULLI 2019. [DOI: 10.3150/18-bej1091] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

169

Zègre-Hemsey JK, Asafu-Adjei J, Fernandez A, Brice J. Characteristics of Prehospital Electrocardiogram Use in North Carolina Using a Novel Linkage of Emergency Medical Services and Emergency Department Data. PREHOSP EMERG CARE 2019;23:772-779. [PMID: 30885071 PMCID: PMC6751030 DOI: 10.1080/10903127.2019.1597230] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 03/14/2019] [Accepted: 03/14/2019] [Indexed: 10/27/2022]

Abstract

Objective: Prehospital electrocardiography (ECG) is recommended for patients with suspected acute coronary syndrome (ACS), yet only 20-80% of chest pain patients receive a prehospital ECG. Less is known about prehospital ECG use in patients with less common complaints (e.g., fatigue) suspicious for ACS who are transported by emergency medical services (EMS). The aims of this study were to determine: (1) the proportion of patients with chest pain and less typical complaints, and (2) patient characteristics associated with prehospital ECG use in patients transported by EMS to emergency departments across North Carolina. Methods: A novel linked database was created between prehospital and emergency department (ED) patient care data from the North Carolina Prehospital Medical Information System and the North Carolina Disease Event Tracking and Epidemiologic Collection Tool. Institutional review board approval and a data use agreement were received prior to the start of the study. Patients ≥21 transported during 2010-14 by EMS with select variables were included. We examined patients' complaints (symptoms), characteristics (e.g., race, ethnicity, final hospital diagnosis), and prehospital ECG use (yes/no). Analysis included descriptive statistics and mixed logistic regression. Results: During 2010-14, there were 1,967,542 patients with linked EMS-ED data (mean age: 56.9 [SD: 22.2], 43.2% male, 63.7% White). Of these, 643,174 (32.6%) received a prehospital ECG. Patients with prehospital ECG presented with the following complaints: 20% chest pain; 10% shortness of breath; 6% abdominal pain/problems; 6% altered level of consciousness; 5% syncope/dizziness; 4% palpitations; 12% other complaints; and 37% missing. Patients' presenting complaints were the strongest predictor of prehospital ECG use, adjusting for age, sex, race, ethnicity, urbanicity, and date and time of EMS dispatch. Conclusions: Patients with chest pain were significantly more likely to receive a prehospital ECG compared to those with less typical but suspicious complaints for ACS. Patients with less common presentations remain disadvantaged for early triage, risk stratification, and intervention prior to the hospital.

Collapse

170

Kim K, Sun H. Incorporating genetic networks into case-control association studies with high-dimensional DNA methylation data. BMC Bioinformatics 2019;20:510. [PMID: 31640538 PMCID: PMC6805595 DOI: 10.1186/s12859-019-3040-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 08/21/2019] [Indexed: 12/23/2022] Open

Abstract

Background

In human genetic association studies with high-dimensional gene expression data, it has been well known that statistical selection methods utilizing prior biological network knowledge such as genetic pathways and signaling pathways can outperform other methods that ignore genetic network structures in terms of true positive selection. In recent epigenetic research on case-control association studies, relatively many statistical methods have been proposed to identify cancer-related CpG sites and their corresponding genes from high-dimensional DNA methylation array data. However, most of existing methods are not designed to utilize genetic network information although methylation levels between linked genes in the genetic networks tend to be highly correlated with each other.

Results

We propose new approach that combines data dimension reduction techniques with network-based regularization to identify outcome-related genes for analysis of high-dimensional DNA methylation data. In simulation studies, we demonstrated that the proposed approach overwhelms other statistical methods that do not utilize genetic network information in terms of true positive selection. We also applied it to the 450K DNA methylation array data of the four breast invasive carcinoma cancer subtypes from The Cancer Genome Atlas (TCGA) project.

Conclusions

The proposed variable selection approach can utilize prior biological network information for analysis of high-dimensional DNA methylation array data. It first captures gene level signals from multiple CpG sites using data a dimension reduction technique and then performs network-based regularization based on biological network graph information. It can select potentially cancer-related genes and genetic pathways that were missed by the existing methods.

Electronic supplementary material

The online version of this article (10.1186/s12859-019-3040-x) contains supplementary material, which is available to authorized users.

Collapse

171

Zhou S, Zhou J, Zhang B. Overlapping group lasso for high-dimensional generalized linear models. COMMUN STAT-THEOR M 2019. [DOI: 10.1080/03610926.2018.1500604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

172

Mining user interaction patterns in the darkweb to predict enterprise cyber incidents. SOCIAL NETWORK ANALYSIS AND MINING 2019. [DOI: 10.1007/s13278-019-0603-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

173

Detmer FJ, Lückehe D, Mut F, Slawski M, Hirsch S, Bijlenga P, von Voigt G, Cebral JR. Comparison of statistical learning approaches for cerebral aneurysm rupture assessment. Int J Comput Assist Radiol Surg 2019;15:141-150. [PMID: 31485987 DOI: 10.1007/s11548-019-02065-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 08/29/2019] [Indexed: 11/29/2022]

174

Bai H, Zhu R, An H, Zhou G, Huang H, Ren H, Zhang Y. Influence of wastewater sludge properties on the performance of electro-osmosis dewatering. ENVIRONMENTAL TECHNOLOGY 2019;40:2853-2863. [PMID: 29557729 DOI: 10.1080/09593330.2018.1455744] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 03/14/2018] [Indexed: 06/08/2023]

175

Wang Y, Li X, Ruiz R. Weighted General Group Lasso for Gene Selection in Cancer Classification. IEEE TRANSACTIONS ON CYBERNETICS 2019;49:2860-2873. [PMID: 29993764 DOI: 10.1109/tcyb.2018.2829811] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

176

Alquier P, Cottet V, Lecué G. Estimation bounds and sharp oracle inequalities of regularized procedures with Lipschitz loss functions. Ann Stat 2019. [DOI: 10.1214/18-aos1742] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

177

Grebla R, Setyawan J, Park C, Richards KM, Nwokeji ED, Pawaskar M, Haim Erder M, Lawson KA. Examining the heterogeneity of treatment patterns in attention deficit hyperactivity disorder among children and adolescents in the Texas Medicaid population: modeling suboptimal treatment response. J Med Econ 2019;22:788-797. [PMID: 30983465 DOI: 10.1080/13696998.2019.1606814] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

178

Drumetz L, Meyer TR, Chanussot J, Bertozzi AL, Jutten C. Hyperspectral Image Unmixing With Endmember Bundles and Group Sparsity Inducing Mixed Norms. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019;28:3435-3450. [PMID: 30716036 DOI: 10.1109/tip.2019.2897254] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

179

Koster GT, Nguyen TTM, van Zwet EW, Garcia BL, Rowling HR, Bosch J, Schonewille WJ, Velthuis BK, van den Wijngaard IR, den Hertog HM, Roos YBWEM, van Walderveen MAA, Wermer MJH, Kruyt ND. Clinical prediction of thrombectomy eligibility: A systematic review and 4-item decision tree. Int J Stroke 2019;14:530-539. [PMID: 30209989 PMCID: PMC6710617 DOI: 10.1177/1747493018801225] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Accepted: 06/25/2018] [Indexed: 01/19/2023]

Abstract

BACKGROUND

A clinical large anterior vessel occlusion (LAVO)-prediction scale could reduce treatment delays by allocating intra-arterial thrombectomy (IAT)-eligible patients directly to a comprehensive stroke center.

AIM

To subtract, validate and compare existing LAVO-prediction scales, and develop a straightforward decision support tool to assess IAT-eligibility.

METHODS

We performed a systematic literature search to identify LAVO-prediction scales. Performance was compared in a prospective, multicenter validation cohort of the Dutch acute Stroke study (DUST) by calculating area under the receiver operating curves (AUROC). With group lasso regression analysis, we constructed a prediction model, incorporating patient characteristics next to National Institutes of Health Stroke Scale (NIHSS) items. Finally, we developed a decision tree algorithm based on dichotomized NIHSS items.

RESULTS

We identified seven LAVO-prediction scales. From DUST, 1316 patients (35.8% LAVO-rate) from 14 centers were available for validation. FAST-ED and RACE had the highest AUROC (both >0.81, p < 0.01 for comparison with other scales). Group lasso analysis revealed a LAVO-prediction model containing seven NIHSS items (AUROC 0.84). With the GACE (Gaze, facial Asymmetry, level of Consciousness, Extinction/inattention) decision tree, LAVO is predicted (AUROC 0.76) for 61% of patients with assessment of only two dichotomized NIHSS items, and for all patients with four items.

CONCLUSION

External validation of seven LAVO-prediction scales showed AUROCs between 0.75 and 0.83. Most scales, however, appear too complex for Emergency Medical Services use with prehospital validation generally lacking. GACE is the first LAVO-prediction scale using a simple decision tree as such increasing feasibility, while maintaining high accuracy. Prehospital prospective validation is planned.

Collapse

180

Komatsu S, Yamashita Y, Ninomiya Y. AIC for the group Lasso in generalized linear models. JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE 2019. [DOI: 10.1007/s42081-019-00052-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

181

Dang Y, Wang Q. Simultaneous variable and factor selection via sparse group lasso in factor analysis. J STAT COMPUT SIM 2019. [DOI: 10.1080/00949655.2019.1633324] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

182

He Z, Fong Y. Maximum diversity weighting for biomarkers with application in HIV-1 vaccine studies. Stat Med 2019;38:3936-3946. [PMID: 31215662 DOI: 10.1002/sim.8212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Revised: 02/15/2019] [Accepted: 05/08/2019] [Indexed: 11/07/2022]

183

Wilder-Smith A, Wei Y, de Araújo TVB, VanKerkhove M, Turchi Martelli CM, Turchi MD, Teixeira M, Tami A, Souza J, Sousa P, Soriano-Arandes A, Soria-Segarra C, Sanchez Clemente N, Rosenberger KD, Reveiz L, Prata-Barbosa A, Pomar L, Pelá Rosado LE, Perez F, Passos SD, Nogueira M, Noel TP, Moura da Silva A, Moreira ME, Morales I, Miranda Montoya MC, Miranda-Filho DDB, Maxwell L, Macpherson CNL, Low N, Lan Z, LaBeaud AD, Koopmans M, Kim C, João E, Jaenisch T, Hofer CB, Gustafson P, Gérardin P, Ganz JS, Dias ACF, Elias V, Duarte G, Debray TPA, Cafferata ML, Buekens P, Broutet N, Brickley EB, Brasil P, Brant F, Bethencourt S, Benedetti A, Avelino-Silva VL, Ximenes RADA, Alves da Cunha A, Alger J. Understanding the relation between Zika virus infection during pregnancy and adverse fetal, infant and child outcomes: a protocol for a systematic review and individual participant data meta-analysis of longitudinal studies of pregnant women and their infants and children. BMJ Open 2019;9:e026092. [PMID: 31217315 PMCID: PMC6588966 DOI: 10.1136/bmjopen-2018-026092] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 02/11/2019] [Accepted: 05/09/2019] [Indexed: 12/14/2022] Open

Abstract

INTRODUCTION

Zika virus (ZIKV) infection during pregnancy is a known cause of microcephaly and other congenital and developmental anomalies. In the absence of a ZIKV vaccine or prophylactics, principal investigators (PIs) and international leaders in ZIKV research have formed the ZIKV Individual Participant Data (IPD) Consortium to identify, collect and synthesise IPD from longitudinal studies of pregnant women that measure ZIKV infection during pregnancy and fetal, infant or child outcomes.

METHODS AND ANALYSIS

We will identify eligible studies through the ZIKV IPD Consortium membership and a systematic review and invite study PIs to participate in the IPD meta-analysis (IPD-MA). We will use the combined dataset to estimate the relative and absolute risk of congenital Zika syndrome (CZS), including microcephaly and late symptomatic congenital infections; identify and explore sources of heterogeneity in those estimates and develop and validate a risk prediction model to identify the pregnancies at the highest risk of CZS or adverse developmental outcomes. The variable accuracy of diagnostic assays and differences in exposure and outcome definitions means that included studies will have a higher level of systematic variability, a component of measurement error, than an IPD-MA of studies of an established pathogen. We will use expert testimony, existing internal and external diagnostic accuracy validation studies and laboratory external quality assessments to inform the distribution of measurement error in our models. We will apply both Bayesian and frequentist methods to directly account for these and other sources of uncertainty.

ETHICS AND DISSEMINATION

The IPD-MA was deemed exempt from ethical review. We will convene a group of patient advocates to evaluate the ethical implications and utility of the risk stratification tool. Findings from these analyses will be shared via national and international conferences and through publication in open access, peer-reviewed journals.

TRIAL REGISTRATION NUMBER

PROSPERO International prospective register of systematic reviews (CRD42017068915).

Collapse

Affiliation(s)

Annelies Wilder-Smith Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
Yinghui Wei Centre for Mathematical Sciences, University of Plymouth, Plymouth, UK
Thalia Velho Barreto de Araújo Department of Social Medicine, Universidade Federal de Pernambuco, Recife, Brazil
Maria VanKerkhove Health Emergencies Programme, Organisation mondiale de la Sante, Geneve, Switzerland
Celina Maria Turchi Martelli Department of Collective Health, Institute Aggeu Magalhães (CPqAM), Oswaldo Cruz Foundation, Recife, Brazil
Marília Dalva Turchi Institute of Tropical Pathology and Public Health, Federal University of Goias, Goiânia, Brazil
Mauro Teixeira Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
Adriana Tami Department of Medical Microbiology, University Medical Center Groningen, Groningen, The Netherlands
João Souza Department of Social Medicine, University of São Paulo, São Paulo, Brazil
Patricia Sousa Reference Center for Neurodevelopment, Assistance, and Rehabilitation of Children, State Department of Health of Maranhão, Sao Luís, Brazil
Antoni Soriano-Arandes Department of Pediatrics, University Hospital Vall d’Hebron, Barcelona, Spain
Carmen Soria-Segarra SOSECALI C. Ltda, Guayaquil, Ecuador
Nuria Sanchez Clemente Department of Epidemiology, University of São Paulo, São Paulo, Brazil
Kerstin Daniela Rosenberger Department of Infectious Diseases, Section Clinical Tropical Medicine, UniversitatsKlinikum Heidelberg, Heidelberg, Germany
Ludovic Reveiz Evidence and Intelligence for Action in Health, Pan American Health Organization, Washington, District of Columbia, USA
Arnaldo Prata-Barbosa Department of Pediatrics, D’Or Institute for Research & Education, Rio de Janeiro, Brazil
Léo Pomar Department of Obstetrics and Gynecology, Centre Hospitalier de l’Ouest Guyanais, Saint-Laurent du Maroni, French Guiana
Luiza Emylce Pelá Rosado Hospital Materno Infantil de Goiânia, Goiânia State Health Secretary, Goiás, Brazil
Freddy Perez Communicable Diseases and Environmental Determinants of Health Department, Pan American Health Organization, Washington, District of Columbia, USA
Saulo D. Passos Department of Pediatrics, FMJ, São Paulo, Brazil
Mauricio Nogueira Faculdade de Medicina de Sao Jose do Rio Preto, Department of Dermatologic Diseases, São José do Rio Preto, Brazil
Trevor P. Noel Windward Islands Research and Education Foundation, St. George’s University, True Blue Point, Grenada
Antônio Moura da Silva Department of Public Health, Universidade Federal do Maranhão – São Luís, São Luís, Brazil
Maria Elisabeth Moreira Department of Neonatology, Oswaldo Cruz Foundation (Fiocruz), Rio de Janeiro, Brazil
Ivonne Morales Department of Infectious Diseases, Section Clinical Tropical Medicine, UniversitatsKlinikum Heidelberg, Heidelberg, Germany
Maria Consuelo Miranda Montoya Facultad de Salud, Universidad Industrial de Santander, Bucaramanga, Colombia
Demócrito de Barros Miranda-Filho Faculty of Medical Sciences, University of Pernambuco, Recife, Brazil
Lauren Maxwell Reproductive Health and Research, World Health Organization, Geneva, Switzerland Hubert Department of Global Health, Emory University, Atlanta, Georgia, USA
Calum N. L. Macpherson Windward Islands Research and Education Foundation, St. George’s University, True Blue Point, Grenada
Nicola Low Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
Zhiyi Lan McGill University Health Centre, McGill University, Montréal, Canada
Angelle Desiree LaBeaud Pediatric Infectious Diseases, Stanford Hospital, Palo Alto, California, USA
Marion Koopmans Department of Virology, Erasmus Medical Center, Rotterdam, The Netherlands
Caron Kim Department of Reproductive Health and Research, World Health Organization, Geneva, Switzerland
Esaú João Department of Infectious Diseases, Hospital Federal dos Servidores do Estado, Rio de Janeiro, Brazil
Thomas Jaenisch Department of Infectious Diseases, Section Clinical Tropical Medicine, UniversitatsKlinikum Heidelberg, Heidelberg, Germany
Cristina Barroso Hofer Instituto de Puericultura e Pediatria Martagão Gesteira, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
Paul Gustafson Statistics, University of British Columbia, British Columbia, Vancouver, Canada
Patrick Gérardin INSERM CIC1410 Clinical Epidemiology, CHU La Réunion, Saint Pierre, Réunion UM 134 PIMIT (CNRS 9192, INSERM U1187, IRD 249, Université de la Réunion), Universite de la Reunion, Sainte Clotilde, Réunion
Jucelia S. Ganz Children’s Hospital Juvencio Matos, São Luís, Brazil
Ana Carolina Fialho Dias Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
Vanessa Elias Sustainable Development and Environmental Health, Pan American Health Organization, Washington, District of Columbia, USA
Geraldo Duarte Department of Gynecology and Obstetrics, University of São Paulo, São Paulo, Brazil
Thomas Paul Alfons Debray Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
María Luisa Cafferata Mother and Children Health Research Department, Instituto de Efectividad Clinica y Sanitaria, Buenos Aires, Argentina
Pierre Buekens School of Public Health and Tropical Medicine, Tulane University, New Orleans, USA
Nathalie Broutet Department of Reproductive Health and Research, World Health Organization, Geneva, Switzerland
Elizabeth B. Brickley Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK
Patrícia Brasil Instituto de pesquisa Clínica Evandro Chagas, Fundacao Oswaldo Cruz, Rio de Janeiro, Brazil
Fátima Brant Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
Sarah Bethencourt Facultad de Ciencias de la Salud, Universidad de Carabobo, Valencia, Carabobo, Bolivarian Republic of Venezuela
Andrea Benedetti Departments of Medicine and of Epidemiology, Biostatistics & Occupational Health, McGill University, Montreal, Quebec, Canada
Vivian Lida Avelino-Silva Department of Infectious and Parasitic Diseases, Faculdade de Medicina da Universidade de Sao Paulo, São Paulo, Brazil
Ricardo Arraes de Alencar Ximenes Department of Tropical Medicine, Federal University of Pernambuco, Recife, Brazil
Antonio Alves da Cunha Department of Pediatrics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
Jackeline Alger Facultad de Ciencias Médicas, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
Zika Virus Individual Participant Data Consortium Abreu de carvalhoLiège MariaBatistaRosangelaBertozziAna PaulaCarlesGabrielCotrimDeniseDamascenoLuanaDimitrakisLadyDuarte rodriguesMaría ManoelaEstofoleteCassia FFragoso da silveira gouvêaMaria IsabelFumadó-pérezVickyGazetaRosa EstelaKaydos-danielsNeelyGilboaSuzanneKrystosikAmyLambertVéroniqueLópez-hortelanoMilagros GarcíaMussi-pinhataMarisa MarciaNelsonChristinaNielsenKarinOlianiDenise MRabelloRenataRibeiroMarizeliaRockxBarryRodriguesLaura CSalgadoSilviaSilveiraKatiaSulleiroElenaTongVanValenciaDianaDe souzaWayner VieiraVillar centenoLuis AngelZinAndrea

Collapse

184

Sung CL, Wang W, Plumlee M, Haaland B. Multiresolution Functional ANOVA for Large-Scale, Many-Input Computer Experiments. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2019.1595630] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

185

Classification tree algorithm for grouped variables. Comput Stat 2019. [DOI: 10.1007/s00180-019-00894-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

186

Luu TD, Fadili J, Chesneau C. PAC-Bayesian risk bounds for group-analysis sparse regression by exponential weighting. J MULTIVARIATE ANAL 2019. [DOI: 10.1016/j.jmva.2018.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

187

Liu M, Zhang J, Adeli E, Shen D. Joint Classification and Regression via Deep Multi-Task Multi-Channel Learning for Alzheimer's Disease Diagnosis. IEEE Trans Biomed Eng 2019;66:1195-1206. [PMID: 30222548 PMCID: PMC6764421 DOI: 10.1109/tbme.2018.2869989] [Citation(s) in RCA: 105] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

188

Zhong H, Kim S, Zhi D, Cui X. Predicting gene expression using DNA methylation in three human populations. PeerJ 2019;7:e6757. [PMID: 31106051 PMCID: PMC6500370 DOI: 10.7717/peerj.6757] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 03/10/2019] [Indexed: 12/30/2022] Open

Abstract

Background

DNA methylation, an important epigenetic mark, is well known for its regulatory role in gene expression, especially the negative correlation in the promoter region. However, its correlation with gene expression across genome at human population level has not been well studied. In particular, it is unclear if genome-wide DNA methylation profile of an individual can predict her/his gene expression profile. Previous studies were mostly limited to association analyses between single CpG site methylation and gene expression. It is not known whether DNA methylation of a gene has enough prediction power to serve as a surrogate for gene expression in existing human study cohorts with DNA samples other than RNA samples.

Results

We examined DNA methylation in the gene region for predicting gene expression across individuals in non-cancer tissues of three human population datasets, adipose tissue of the Multiple Tissue Human Expression Resource Projects (MuTHER), peripheral blood mononuclear cell (PBMC) from Asthma and normal control study participates, and lymphoblastoid cell lines (LCL) from healthy individuals. Three prediction models were investigated, single linear regression, multiple linear regression, and least absolute shrinkage and selection operator (LASSO) penalized regression. Our results showed that LASSO regression has superior performance among these methods. However, the prediction power is generally low and varies across datasets. Only 30 and 42 genes were found to have cross-validation R² greater than 0.3 in the PBMC and Adipose datasets, respectively. A substantially larger number of genes (258) were identified in the LCL dataset, which was generated from a more homogeneous cell line sample source. We also demonstrated that it gives better prediction power not to exclude any CpG probe due to cross hybridization or SNP effect.

Conclusion

In our three population analyses DNA methylation of CpG sites at gene region have limited prediction power for gene expression across individuals with linear regression models. The prediction power potentially varies depending on tissue, cell type, and data sources. In our analyses, the combination of LASSO regression and all probes not excluding any probe on the methylation array provides the best prediction for gene expression.

Collapse

189

Improved Reconstruction of MR Scanned Images by Using a Dictionary Learning Scheme. SENSORS 2019;19:s19081918. [PMID: 31018597 PMCID: PMC6514997 DOI: 10.3390/s19081918] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 04/16/2019] [Accepted: 04/21/2019] [Indexed: 11/17/2022]

190

Ijaz M, Asghar Z, Gul A. Ensemble of penalized logistic models for classification of high-dimensional data. COMMUN STAT-SIMUL C 2019. [DOI: 10.1080/03610918.2019.1595647] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

191

Song H, Raskutti G. PUlasso: High-Dimensional Variable Selection With Presence-Only Data. J Am Stat Assoc 2019;115:334-347. [PMID: 32255883 PMCID: PMC7133715 DOI: 10.1080/01621459.2018.1546587] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 10/13/2018] [Accepted: 10/29/2018] [Indexed: 10/27/2022]

192

Qi Z, Liu D, Fu H, Liu Y. Multi-Armed Angle-Based Direct Learning for Estimating Optimal Individualized Treatment Rules With Various Outcomes. J Am Stat Assoc 2019;115:678-691. [PMID: 34219848 DOI: 10.1080/01621459.2018.1529597] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

193

Jackknife Model Averaging Prediction Methods for Complex Phenotypes with Gene Expression Levels by Integrating External Pathway Information. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019;2019:2807470. [PMID: 31089389 PMCID: PMC6476151 DOI: 10.1155/2019/2807470] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 03/20/2019] [Indexed: 01/03/2023]

Abstract

Motivation

In the past few years many prediction approaches have been proposed and widely employed in high dimensional genetic data for disease risk evaluation. However, those approaches typically ignore in model fitting the important group structures that naturally exists in genetic data.

Methods

In the present study, we applied a novel model-averaging approach, called jackknife model averaging prediction (JMAP), for high dimensional genetic risk prediction while incorporating pathway information into the model specification. JMAP selects the optimal weights across candidate models by minimizing a cross validation criterion in a jackknife way. Compared with previous approaches, one of the primary features of JMAP is to allow model weights to vary from 0 to 1 but without the limitation that the summation of weights is equal to one. We evaluated the performance of JMAP using extensive simulation studies and compared it with existing methods. We finally applied JMAP to four real cancer datasets that are publicly available from TCGA.

Results

The simulations showed that compared with other existing approaches (e.g., gsslasso), JMAP performed best or is among the best methods across a range of scenarios. For example, among 14 out of 16 simulation settings with PVE = 0.3, JMAP has an average of 0.075 higher prediction accuracy compared with gsslasso. We further found that in the simulation, the model weights for the true candidate models have much smaller chances to be zero compared with those for the null candidate models and are substantially greater in magnitude. In the real data application, JMAP also behaves comparably or better compared with the other methods for continuous phenotypes. For example, for the COAD, CRC, and PAAD datasets, the average gains of predictive accuracy of JMAP are 0.019, 0.064, and 0.052 compared with gsslasso.

Conclusion

The proposed method JMAP is a novel model-averaging approach for high dimensional genetic risk prediction while incorporating external useful group structures into the model specification.

Collapse

194

Qian W, Li W, Sogawa Y, Fujimaki R, Yang X, Liu J. An Interactive Greedy Approach to Group Sparsity in High Dimensions. Technometrics 2019. [DOI: 10.1080/00401706.2018.1537897] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

195

Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features. Genes (Basel) 2019;10:genes10030240. [PMID: 30901858 PMCID: PMC6471789 DOI: 10.3390/genes10030240] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 03/12/2019] [Accepted: 03/18/2019] [Indexed: 12/17/2022] Open

196

Greb F, Steffens J, Schlotz W. Modeling Music-Selection Behavior in Everyday Life: A Multilevel Statistical Learning Approach and Mediation Analysis of Experience Sampling Data. Front Psychol 2019;10:390. [PMID: 30941066 PMCID: PMC6433931 DOI: 10.3389/fpsyg.2019.00390] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 02/07/2019] [Indexed: 12/05/2022] Open

197

van de Wiel MA, Te Beest DE, Münch MM. Learning from a lot: Empirical Bayes for high-dimensional model-based prediction. Scand Stat Theory Appl 2019;46:2-25. [PMID: 31007342 PMCID: PMC6472625 DOI: 10.1111/sjos.12335] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 01/24/2018] [Accepted: 03/22/2018] [Indexed: 12/21/2022]

198

Zhang J, Zhao Z, Zhang K, Wei Z. A Feature Sampling Strategy for Analysis of High Dimensional Genomic Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:434-441. [PMID: 29990199 DOI: 10.1109/tcbb.2017.2779492] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

199

Li Y, Wu FX, Ngom A. A review on machine learning principles for multi-view biological data integration. Brief Bioinform 2019;19:325-340. [PMID: 28011753 DOI: 10.1093/bib/bbw113] [Citation(s) in RCA: 126] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Indexed: 01/08/2023] Open

200

Tang Z, Lei S, Zhang X, Yi Z, Guo B, Chen JY, Shen Y, Yi N. Gsslasso Cox: a Bayesian hierarchical model for predicting survival and detecting associated genes by incorporating pathway information. BMC Bioinformatics 2019;20:94. [PMID: 30813883 PMCID: PMC6391807 DOI: 10.1186/s12859-019-2656-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 01/28/2019] [Indexed: 12/13/2022] Open

Abstract

BACKGROUND

Group structures among genes encoded in functional relationships or biological pathways are valuable and unique features in large-scale molecular data for survival analysis. However, most of previous approaches for molecular data analysis ignore such group structures. It is desirable to develop powerful analytic methods for incorporating valuable pathway information for predicting disease survival outcomes and detecting associated genes.

RESULTS

We here propose a Bayesian hierarchical Cox survival model, called the group spike-and-slab lasso Cox (gsslasso Cox), for predicting disease survival outcomes and detecting associated genes by incorporating group structures of biological pathways. Our hierarchical model employs a novel prior on the coefficients of genes, i.e., the group spike-and-slab double-exponential distribution, to integrate group structures and to adaptively shrink the effects of genes. We have developed a fast and stable deterministic algorithm to fit the proposed models. We performed extensive simulation studies to assess the model fitting properties and the prognostic performance of the proposed method, and also applied our method to analyze three cancer data sets.

CONCLUSIONS

Both the theoretical and empirical studies show that the proposed method can induce weaker shrinkage on predictors in an active pathway, thereby incorporating the biological similarity of genes within a same pathway into the hierarchical modeling. Compared with several existing methods, the proposed method can more accurately estimate gene effects and can better predict survival outcomes. For the three cancer data sets, the results show that the proposed method generates more powerful models for survival prediction and detecting associated genes. The method has been implemented in a freely available R package BhGLM at https://github.com/nyiuab/BhGLM .

Collapse