1
|
Hoggart CJ, Choi SW, García-González J, Souaiaia T, Preuss M, O'Reilly PF. BridgePRS leverages shared genetic effects across ancestries to increase polygenic risk score portability. Nat Genet 2024; 56:180-186. [PMID: 38123642 PMCID: PMC10786716 DOI: 10.1038/s41588-023-01583-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 10/20/2023] [Indexed: 12/23/2023]
Abstract
Here we present BridgePRS, a novel Bayesian polygenic risk score (PRS) method that leverages shared genetic effects across ancestries to increase PRS portability. We evaluate BridgePRS via simulations and real UK Biobank data across 19 traits in individuals of African, South Asian and East Asian ancestry, using both UK Biobank and Biobank Japan genome-wide association study summary statistics; out-of-cohort validation is performed in the Mount Sinai (New York) BioMe biobank. BridgePRS is compared with the leading alternative, PRS-CSx, and two other PRS methods. Simulations suggest that the performance of BridgePRS relative to PRS-CSx increases as uncertainty increases: with lower trait heritability, higher polygenicity and greater between-population genetic diversity; and when causal variants are not present in the data. In real data, BridgePRS has a 61% larger average R2 than PRS-CSx in out-of-cohort prediction of African ancestry samples in BioMe (P = 6 × 10-5). BridgePRS is a computationally efficient, user-friendly and powerful approach for PRS analyses in non-European ancestries.
Collapse
Affiliation(s)
- Clive J Hoggart
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA.
| | - Shing Wan Choi
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Judit García-González
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA
| | - Tade Souaiaia
- Department of Cellular Biology, Suny Downstate Health Sciences, Brooklyn, NY, USA
| | - Michael Preuss
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine, Mount Sinai, New York, NY, USA
| | - Paul F O'Reilly
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA.
| |
Collapse
|
2
|
Wang W, Qi F, Wipf DP, Cai C, Yu T, Li Y, Zhang Y, Yu Z, Wu W. Sparse Bayesian Learning for End-to-End EEG Decoding. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:15632-15649. [PMID: 37506000 DOI: 10.1109/tpami.2023.3299568] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2023]
Abstract
Decoding brain activity from non-invasive electroencephalography (EEG) is crucial for brain-computer interfaces (BCIs) and the study of brain disorders. Notably, end-to-end EEG decoding has gained widespread popularity in recent years owing to the remarkable advances in deep learning research. However, many EEG studies suffer from limited sample sizes, making it difficult for existing deep learning models to effectively generalize to highly noisy EEG data. To address this fundamental limitation, this paper proposes a novel end-to-end EEG decoding algorithm that utilizes a low-rank weight matrix to encode both spatio-temporal filters and the classifier, all optimized under a principled sparse Bayesian learning (SBL) framework. Importantly, this SBL framework also enables us to learn hyperparameters that optimally penalize the model in a Bayesian fashion. The proposed decoding algorithm is systematically benchmarked on five motor imagery BCI EEG datasets ( N=192) and an emotion recognition EEG dataset ( N=45), in comparison with several contemporary algorithms, including end-to-end deep-learning-based EEG decoding algorithms. The classification results demonstrate that our algorithm significantly outperforms the competing algorithms while yielding neurophysiologically meaningful spatio-temporal patterns. Our algorithm therefore advances the state-of-the-art by providing a novel EEG-tailored machine learning tool for decoding brain activity.
Collapse
|
3
|
Vamvourellis K, Kalogeropoulos K, Moustaki I. Assessment of generalised Bayesian structural equation models for continuous and binary data. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2023; 76:559-584. [PMID: 37401608 DOI: 10.1111/bmsp.12314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 04/17/2023] [Indexed: 07/05/2023]
Abstract
The paper proposes a novel model assessment paradigm aiming to address shortcoming of posterior predictivep -values, which provide the default metric of fit for Bayesian structural equation modelling (BSEM). The model framework presented in the paper focuses on the approximate zero approach (Psychological Methods, 17, 2012, 313), which involves formulating certain parameters (such as factor loadings) to be approximately zero through the use of informative priors, instead of explicitly setting them to zero. The introduced model assessment procedure monitors the out-of-sample predictive performance of the fitted model, and together with a list of guidelines we provide, one can investigate whether the hypothesised model is supported by the data. We incorporate scoring rules and cross-validation to supplement existing model assessment metrics for BSEM. The proposed tools can be applied to models for both continuous and binary data. The modelling of categorical and non-normally distributed continuous data is facilitated with the introduction of an item-individual random effect. We study the performance of the proposed methodology via simulation experiments as well as real data on the 'Big-5' personality scale and the Fagerstrom test for nicotine dependence.
Collapse
Affiliation(s)
| | | | - Irini Moustaki
- Department of Statistics, London School of Economics, London, UK
| |
Collapse
|
4
|
Ozelim LCDSM, Ribeiro DB, Schiavon JA, Domingues VR, de Queiroz PIB. HPOSS: A hierarchical portfolio optimization stacking strategy to reduce the generalization error of ensembles of models. PLoS One 2023; 18:e0290331. [PMID: 37651433 PMCID: PMC10470931 DOI: 10.1371/journal.pone.0290331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 08/04/2023] [Indexed: 09/02/2023] Open
Abstract
Surrogate models are frequently used to replace costly engineering simulations. A single surrogate is frequently chosen based on previous experience or by fitting multiple surrogates and selecting one based on mean cross-validation errors. A novel stacking strategy will be presented in this paper. This new strategy results from reinterpreting the model selection process based on the generalization error. For the first time, this problem is proposed to be translated into a well-studied financial problem: portfolio management and optimization. In short, it is demonstrated that the individual residues calculated by leave-one-out procedures are samples from a given random variable ϵi, whose second non-central moment is the i-th model's generalization error. Thus, a stacking methodology based solely on evaluating the behavior of the linear combination of the random variables ϵi is proposed. At first, several surrogate models are calibrated. The Directed Bubble Hierarchical Tree (DBHT) clustering algorithm is then used to determine which models are worth stacking. The stacking weights can be calculated using any financial approach to the portfolio optimization problem. This alternative understanding of the problem enables practitioners to use established financial methodologies to calculate the models' weights, significantly improving the ensemble of models' out-of-sample performance. A study case is carried out to demonstrate the applicability of the new methodology. Overall, a total of 124 models were trained using a specific dataset: 40 Machine Learning models and 84 Polynomial Chaos Expansion models (which considered 3 types of base random variables, 7 least square algorithms for fitting the up to fourth order expansion's coefficients). Among those, 99 models could be fitted without convergence and other numerical issues. The DBHT algorithm with Pearson correlation distance and generalization error similarity was able to select a subgroup of 23 models from the 99 fitted ones, implying a reduction of about 77% in the total number of models, representing a good filtering scheme which still preserves diversity. Finally, it has been demonstrated that the weights obtained by building a Hierarchical Risk Parity (HPR) portfolio perform better for various input random variables, indicating better out-of-sample performance. In this way, an economic stacking strategy has demonstrated its worth in improving the out-of-sample capabilities of stacked models, which illustrates how the new understanding of model stacking methodologies may be useful.
Collapse
|
5
|
Forbes O, Santos-Fernandez E, Wu PPY, Xie HB, Schwenn PE, Lagopoulos J, Mills L, Sacks DD, Hermens DF, Mengersen K. clusterBMA: Bayesian model averaging for clustering. PLoS One 2023; 18:e0288000. [PMID: 37603575 PMCID: PMC10441802 DOI: 10.1371/journal.pone.0288000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/16/2023] [Indexed: 08/23/2023] Open
Abstract
Various methods have been developed to combine inference across multiple sets of results for unsupervised clustering, within the ensemble clustering literature. The approach of reporting results from one 'best' model out of several candidate clustering models generally ignores the uncertainty that arises from model selection, and results in inferences that are sensitive to the particular model and parameters chosen. Bayesian model averaging (BMA) is a popular approach for combining results across multiple models that offers some attractive benefits in this setting, including probabilistic interpretation of the combined cluster structure and quantification of model-based uncertainty. In this work we introduce clusterBMA, a method that enables weighted model averaging across results from multiple unsupervised clustering algorithms. We use clustering internal validation criteria to develop an approximation of the posterior model probability, used for weighting the results from each model. From a combined posterior similarity matrix representing a weighted average of the clustering solutions across models, we apply symmetric simplex matrix factorisation to calculate final probabilistic cluster allocations. In addition to outperforming other ensemble clustering methods on simulated data, clusterBMA offers unique features including probabilistic allocation to averaged clusters, combining allocation probabilities from 'hard' and 'soft' clustering algorithms, and measuring model-based uncertainty in averaged cluster allocation. This method is implemented in an accompanying R package of the same name. We use simulated datasets to explore the ability of the proposed technique to identify robust integrated clusters with varying levels of separation between subgroups, and with varying numbers of clusters between models. Benchmarking accuracy against four other ensemble methods previously demonstrated to be highly effective in the literature, clusterBMA matches or exceeds the performance of competing approaches under various conditions of dimensionality and cluster separation. clusterBMA substantially outperformed other ensemble methods for high dimensional simulated data with low cluster separation, with 1.16 to 7.12 times better performance as measured by the Adjusted Rand Index. We also explore the performance of this approach through a case study that aims to identify probabilistic clusters of individuals based on electroencephalography (EEG) data. In applied settings for clustering individuals based on health data, the features of probabilistic allocation and measurement of model-based uncertainty in averaged clusters are useful for clinical relevance and statistical communication.
Collapse
Affiliation(s)
- Owen Forbes
- Centre for Data Science, School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
| | - Edgar Santos-Fernandez
- Centre for Data Science, School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
| | - Paul Pao-Yen Wu
- Centre for Data Science, School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
| | - Hong-Bo Xie
- Centre for Data Science, School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
- School of Information Science and Engineering, Yunnan University, Kunming, China
| | - Paul E. Schwenn
- UQ Poche Centre for Indigenous Health, The University of Queensland, Brisbane, QLD, Australia
| | - Jim Lagopoulos
- Thompson Institute, University of the Sunshine Coast, Birtinya, QLD, Australia
| | - Lia Mills
- Thompson Institute, University of the Sunshine Coast, Birtinya, QLD, Australia
| | - Dashiell D. Sacks
- Thompson Institute, University of the Sunshine Coast, Birtinya, QLD, Australia
| | - Daniel F. Hermens
- Thompson Institute, University of the Sunshine Coast, Birtinya, QLD, Australia
| | - Kerrie Mengersen
- Centre for Data Science, School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
| |
Collapse
|
6
|
Holmes CC, Walker SG. Statistical inference with exchangeability and martingales. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2023; 381:20220143. [PMID: 36970832 PMCID: PMC10041353 DOI: 10.1098/rsta.2022.0143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
In this paper, we start by reviewing exchangeability and its relevance to the Bayesian approach. We highlight the predictive nature of Bayesian models and the symmetry assumptions implied by beliefs of an underlying exchangeable sequence of observations. By taking a closer look at the Bayesian bootstrap, the parametric bootstrap of Efron and a version of Bayesian thinking about inference uncovered by Doob based on martingales, we introduce a parametric Bayesian bootstrap. Martingales play a fundamental role. Illustrations are presented as is the relevant theory. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'.
Collapse
Affiliation(s)
| | - Stephen G. Walker
- Department of Mathematics, University of Texas at Austin, Austin, TX, USA
- Department of Statistics & Scientific Computation, University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
7
|
He L, Wall D, Reeck C, Bhatia S. Information acquisition and decision strategies in intertemporal choice. Cogn Psychol 2023; 142:101562. [PMID: 36996641 DOI: 10.1016/j.cogpsych.2023.101562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 03/16/2023] [Accepted: 03/21/2023] [Indexed: 03/30/2023]
Abstract
Intertemporal decision models describe choices between outcomes with different delays. While these models mainly focus on predicting choices, they make implicit assumptions about how people acquire and process information. A link between information processing and choice model predictions is necessary for a complete mechanistic account of decision making. We establish this link by fitting 18 intertemporal choice models to experimental datasets with both choice and information acquisition data. First, we show that choice models have highly correlated fits: people that behave according to one model also behave according to other models that make similar information processing assumptions. Second, we develop and fit an attention model to information acquisition data. Critically, the attention model parameters predict which type of intertemporal choice models best describes a participant's choices. Overall, our results relate attentional processes to models of intertemporal choice, providing a stepping stone towards a complete mechanistic account of intertemporal decision making.
Collapse
|
8
|
Hoggart C, Choi SW, García-González J, Souaiaia T, Preuss M, O'Reilly P. BridgePRS : A powerful trans-ancestry Polygenic Risk Score method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.17.528938. [PMID: 36865148 PMCID: PMC9979992 DOI: 10.1101/2023.02.17.528938] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/26/2023]
Abstract
Polygenic Risk Scores (PRS) have huge potential to contribute to biomedical research and to a future of precision medicine, but to date their calculation relies largely on Europeanancestry GWAS data. This global bias makes most PRS substantially less accurate in individuals of non-European ancestry. Here we present BridgePRS , a novel Bayesian PRS method that leverages shared genetic effects across ancestries to increase the accuracy of PRS in non-European populations. The performance of BridgePRS is evaluated in simulated data and real UK Biobank (UKB) data across 19 traits in African, South Asian and East Asian ancestry individuals, using both UKB and Biobank Japan GWAS summary statistics. BridgePRS is compared to the leading alternative, PRS-CSx , and two single-ancestry PRS methods adapted for trans-ancestry prediction. PRS trained in the UK Biobank are then validated out-of-cohort in the independent Mount Sinai (New York) Bio Me Biobank. Simulations reveal that BridgePRS performance, relative to PRS-CSx , increases as uncertainty increases: with lower heritability, higher polygenicity, greater between-population genetic diversity, and when causal variants are not present in the data. Our simulation results are consistent with real data analyses in which BridgePRS has better predictive accuracy in African ancestry samples, especially in out-of-cohort prediction (into Bio Me ), which shows a 60% boost in mean R 2 compared to PRS-CSx ( P = 2 × 10 -6 ). BridgePRS performs the full PRS analysis pipeline, is computationally efficient, and is a powerful method for deriving PRS in diverse and under-represented ancestry populations.
Collapse
|
9
|
Bayes Factors for Mixed Models: Perspective on Responses. COMPUTATIONAL BRAIN & BEHAVIOR 2023; 6:127-139. [PMID: 36879767 PMCID: PMC9981503 DOI: 10.1007/s42113-022-00158-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 10/09/2022] [Indexed: 02/19/2023]
Abstract
In van Doorn et al. (2021), we outlined a series of open questions concerning Bayes factors for mixed effects model comparison, with an emphasis on the impact of aggregation, the effect of measurement error, the choice of prior distributions, and the detection of interactions. Seven expert commentaries (partially) addressed these initial questions. Surprisingly perhaps, the experts disagreed (often strongly) on what is best practice-a testament to the intricacy of conducting a mixed effect model comparison. Here, we provide our perspective on these comments and highlight topics that warrant further discussion. In general, we agree with many of the commentaries that in order to take full advantage of Bayesian mixed model comparison, it is important to be aware of the specific assumptions that underlie the to-be-compared models.
Collapse
|
10
|
Gianola D, Fernando RL, Schön CC. Inference about quantitative traits under selection: a Bayesian revisitation for the post-genomic era. Genet Sel Evol 2022; 54:78. [PMID: 36460973 PMCID: PMC9716705 DOI: 10.1186/s12711-022-00765-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 10/26/2022] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Selection schemes distort inference when estimating differences between treatments or genetic associations between traits, and may degrade prediction of outcomes, e.g., the expected performance of the progeny of an individual with a certain genotype. If input and output measurements are not collected on random samples, inferences and predictions must be biased to some degree. Our paper revisits inference in quantitative genetics when using samples stemming from some selection process. The approach used integrates the classical notion of fitness with that of missing data. Treatment is fully Bayesian, with inference and prediction dealt with, in an unified manner. While focus is on animal and plant breeding, concepts apply to natural selection as well. Examples based on real data and stylized models illustrate how selection can be accounted for in four different situations, and sometimes without success. RESULTS Our flexible "soft selection" setting helps to diagnose the extent to which selection can be ignored. The clear connection between probability of missingness and the concept of fitness in stylized selection scenarios is highlighted. It is not realistic to assume that a fixed selection threshold t holds in conceptual replication, as the chance of selection depends on observed and unobserved data, and on unequal amounts of information over individuals, aspects that a "soft" selection representation addresses explicitly. There does not seem to be a general prescription to accommodate potential distortions due to selection. In structures that combine cross-sectional, longitudinal and multi-trait data such as in animal breeding, balance is the exception rather than the rule. The Bayesian approach provides an integrated answer to inference, prediction and model choice under selection that goes beyond the likelihood-based approach, where breeding values are inferred indirectly. CONCLUSIONS The approach used here for inference and prediction under selection may or may not yield the best possible answers. One may believe that selection has been accounted for diligently, but the central problem of whether statistical inferences are good or bad does not have an unambiguous solution. On the other hand, the quality of predictions can be gauged empirically via appropriate training-testing of competing methods.
Collapse
Affiliation(s)
- Daniel Gianola
- grid.28803.310000 0001 0701 8607Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI USA
| | - Rohan L. Fernando
- grid.34421.300000 0004 1936 7312Department of Animal Science, Iowa State University, Ames, IA USA
| | - Chris C. Schön
- grid.6936.a0000000123222966Department of Plant Breeding, Technical University of Munich, Freising, Germany
| |
Collapse
|
11
|
Ouatu I, Spiers BT, Aboushelbaya R, Feng Q, von der Leyen MW, Paddock RW, Timmis R, Ticos C, Krushelnick KM, Norreys PA. Ionization states for the multipetawatt laser-QED regime. Phys Rev E 2022; 106:015205. [PMID: 35974572 DOI: 10.1103/physreve.106.015205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 06/30/2022] [Indexed: 06/15/2023]
Abstract
A paradigm shift in the physics of laser-plasma interactions is approaching with the commissioning of multipetawatt laser facilities worldwide. Radiation reaction processes will result in the onset of electron-positron pair cascades and, with that, the absorption and partitioning of the incident laser energy, as well as the energy transport throughout the irradiated targets. To accurately quantify these effects, one must know the focused intensity on target in situ. In this work, a way of measuring the focused intensity on target is proposed based upon the ionization of xenon gas at low ambient pressure. The field ionization rates from two works [Phys. Rev. A 59, 569 (1999)1050-294710.1103/PhysRevA.59.569 and Phys. Rev. A 98, 043407 (2018)2469-992610.1103/PhysRevA.98.043407], where the latter rate has been derived using quantum mechanics, have been implemented in the particle-in-cell code SMILEI [Comput. Phys. Commun. 222, 351 (2018)0010-465510.1016/j.cpc.2017.09.024]. A series of one- and two-dimensional simulations are compared and shown to reproduce the charge states without presenting visible differences when increasing the simulation dimensionality. They provide a way to accurately verify the intensity on target using in situ measurements.
Collapse
Affiliation(s)
- I Ouatu
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
| | - B T Spiers
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
- Central Laser Facility, UKRI-STFC Rutherford Appleton Laboratory, Didcot, Oxon OX11 0QX, United Kingdom
| | - R Aboushelbaya
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
| | - Q Feng
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
| | - M W von der Leyen
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
| | - R W Paddock
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
| | - R Timmis
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
| | - C Ticos
- Extreme Light Infrastructure-Nuclear Physics (ELI-NP), Horia Hulubei National Institute for Physics and Nuclear Engineering, Măgurele 077125, Romania
| | - K M Krushelnick
- Center for Ultra-Fast Optics, University of Michigan, Ann Arbor, Michigan, USA
| | - P A Norreys
- Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
- Central Laser Facility, UKRI-STFC Rutherford Appleton Laboratory, Didcot, Oxon OX11 0QX, United Kingdom
- John Adams Institute, Denys Wilkinson Building, Oxford OX1 3RH, United Kingdom
| |
Collapse
|
12
|
Fortuin V. Priors in Bayesian Deep Learning: A Review. Int Stat Rev 2022. [DOI: 10.1111/insr.12502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- Vincent Fortuin
- Department of Computer Science ETH Zürich Zürich Switzerland
| |
Collapse
|
13
|
Jersakova R, Lomax J, Hetherington J, Lehmann B, Nicholson G, Briers M, Holmes C. Bayesian imputation of COVID-19 positive test counts for nowcasting under reporting lag. J R Stat Soc Ser C Appl Stat 2022; 71:RSSC12557. [PMID: 35601481 PMCID: PMC9115539 DOI: 10.1111/rssc.12557] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 02/12/2022] [Indexed: 11/27/2022]
Abstract
Obtaining up to date information on the number of UK COVID-19 regional infections is hampered by the reporting lag in positive test results for people with COVID-19 symptoms. In the UK, for 'Pillar 2' swab tests for those showing symptoms, it can take up to five days for results to be collated. We make use of the stability of the under reporting process over time to motivate a statistical temporal model that infers the final total count given the partial count information as it arrives. We adopt a Bayesian approach that provides for subjective priors on parameters and a hierarchical structure for an underlying latent intensity process for the infection counts. This results in a smoothed time-series representation nowcasting the expected number of daily counts of positive tests with uncertainty bands that can be used to aid decision making. Inference is performed using sequential Monte Carlo.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Chris Holmes
- The Alan Turing InstituteLondonUK
- University of OxfordOxfordUK
| |
Collapse
|
14
|
Lage-Freitas A, Allende-Cid H, Santana O, Oliveira-Lage L. Predicting Brazilian Court Decisions. PeerJ Comput Sci 2022; 8:e904. [PMID: 35494851 PMCID: PMC9044329 DOI: 10.7717/peerj-cs.904] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 02/07/2022] [Indexed: 06/14/2023]
Abstract
Predicting case outcomes is useful for legal professionals to understand case law, file a lawsuit, raise a defense, or lodge appeals, for instance. However, it is very hard to predict legal decisions since this requires extracting valuable information from myriads of cases and other documents. Moreover, legal system complexity along with a huge volume of litigation make this problem even harder. This paper introduces an approach to predicting Brazilian court decisions, including whether they will be unanimous. Our methodology uses various machine learning algorithms, including classifiers and state-of-the-art Deep Learning models. We developed a working prototype whose F1-score performance is ~80.2% by using 4,043 cases from a Brazilian court. To our knowledge, this is the first study to present methods for predicting Brazilian court decision outcomes.
Collapse
Affiliation(s)
- André Lage-Freitas
- Universidade Federal de Alagoas, Maceió, Brazil
- JusPredict, Salvador, Brazil
| | - Héctor Allende-Cid
- JusPredict, Salvador, Brazil
- Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile
| | - Orivaldo Santana
- JusPredict, Salvador, Brazil
- Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | | |
Collapse
|
15
|
Liakoni V, Lehmann MP, Modirshanechi A, Brea J, Lutti A, Gerstner W, Preuschoff K. Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making. Neuroimage 2021; 246:118780. [PMID: 34875383 DOI: 10.1016/j.neuroimage.2021.118780] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 08/03/2021] [Accepted: 12/04/2021] [Indexed: 11/25/2022] Open
Abstract
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and potentially guided by multiple parallel learning modules. Current brain imaging of learning modules is limited by (i) simple experimental paradigms, (ii) entanglement of brain signals of different learning modules, and (iii) a limited number of computational models considered as candidates for explaining behavior. Here, we address these three limitations and (i) introduce a complex sequential decision making task with surprising events that allows us to (ii) dissociate correlates of reward prediction errors from those of surprise in functional magnetic resonance imaging (fMRI); and (iii) we test behavior against a large repertoire of model-free, model-based, and hybrid reinforcement learning algorithms, including a novel surprise-modulated actor-critic algorithm. Surprise, derived from an approximate Bayesian approach for learning the world-model, is extracted in our algorithm from a state prediction error. Surprise is then used to modulate the learning rate of a model-free actor, which itself learns via the reward prediction error from model-free value estimation by the critic. We find that action choices are well explained by pure model-free policy gradient, but reaction times and neural data are not. We identify signatures of both model-free and surprise-based learning signals in blood oxygen level dependent (BOLD) responses, supporting the existence of multiple parallel learning modules in the brain. Our results extend previous fMRI findings to a multi-step setting and emphasize the role of policy gradient and surprise signalling in human learning.
Collapse
Affiliation(s)
- Vasiliki Liakoni
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland.
| | - Marco P Lehmann
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Alireza Modirshanechi
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Johanni Brea
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Antoine Lutti
- Laboratoire de recherche en neuroimagerie (LREN), Department of Clinical Neurosciences, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Wulfram Gerstner
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Kerstin Preuschoff
- Geneva Finance Research Institute & Interfaculty Center for Affective Sciences, University of Geneva, Geneva, Switzerland
| |
Collapse
|
16
|
Abstract
Bayesian approaches for criterion based selection include the marginal likelihood based highest posterior model (HPM) and the deviance information criterion (DIC). The DIC is popular in practice as it can often be estimated from sampling based methods with relative ease and DIC is readily available in various Bayesian software. We find that sensitivity of DIC based selection can be high, in the range of 90 - 100%. However, correct selection by DIC can be in the range of 0 - 2%. These performances persist consistently with increase in sample size. We establish that both marginal likelihood and DIC asymptotically disfavor under-fitted models, explaining the high sensitivities of both criteria. However, mis-selection probability of DIC remains bounded below by a positive constant in linear models with g -priors whereas mis-selection probability by marginal likelihood converges to 0 under certain conditions. A consequence of our results is that not only the DIC cannot asymptotically differentiate between the data-generating and an over-fitted model, but, in fact, it cannot asymptotically differentiate between two over-fitted models as well. We illustrate these results in multiple simulation studies and in a biomarker selection problem on cancer cachexia of non-small cell lung cancer patients. We further study performances of HPM and DIC in generalized linear model as practitioners often choose to use DIC that is readily available in software in such non-conjugate settings.
Collapse
Affiliation(s)
| | - Sanjib Basu
- University of Illinois at Chicago, Chicago, IL
| | | |
Collapse
|
17
|
Xu HA, Modirshanechi A, Lehmann MP, Gerstner W, Herzog MH. Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making. PLoS Comput Biol 2021; 17:e1009070. [PMID: 34081705 PMCID: PMC8205159 DOI: 10.1371/journal.pcbi.1009070] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 06/15/2021] [Accepted: 05/12/2021] [Indexed: 11/19/2022] Open
Abstract
Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants in these environments, we show that RL theories need to include surprise and novelty, each with a distinct role. While novelty drives exploration before the first encounter of a reward, surprise increases the rate of learning of a world-model as well as of model-free action-values. Even though the world-model is available for model-based RL, we find that human decisions are dominated by model-free action choices. The world-model is only marginally used for planning, but it is important to detect surprising events. Our theory predicts human action choices with high probability and allows us to dissociate surprise, novelty, and reward in EEG signals.
Collapse
Affiliation(s)
- He A. Xu
- Laboratory of Psychophysics, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Alireza Modirshanechi
- Brain-Mind Institute, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- School of Computer and Communication Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Marco P. Lehmann
- Brain-Mind Institute, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- School of Computer and Communication Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Wulfram Gerstner
- Brain-Mind Institute, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- School of Computer and Communication Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Michael H. Herzog
- Laboratory of Psychophysics, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Brain-Mind Institute, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
18
|
Demirkaya E, Feng Y, Basu P, Lv J. Large-scale model selection in misspecified generalized linear models. Biometrika 2021. [DOI: 10.1093/biomet/asab005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Summary
Model selection is crucial both to high-dimensional learning and to inference for contemporary big data applications in pinpointing the best set of covariates among a sequence of candidate interpretable models. Most existing work implicitly assumes that the models are correctly specified or have fixed dimensionality, yet both model misspecification and high dimensionality are prevalent in practice. In this paper, we exploit the framework of model selection principles under the misspecified generalized linear models presented in Lv & Liu (2014), and investigate the asymptotic expansion of the posterior model probability in the setting of high-dimensional misspecified models. With a natural choice of prior probabilities that encourages interpretability and incorporates the Kullback–Leibler divergence, we suggest using the high-dimensional generalized Bayesian information criterion with prior probability for large-scale model selection with misspecification. Our new information criterion characterizes the impacts of both model misspecification and high dimensionality on model selection. We further establish the consistency of covariance contrast matrix estimation and the model selection consistency of the new information criterion in ultrahigh dimensions under some mild regularity conditions. Our numerical studies demonstrate that the proposed method enjoys improved model selection consistency over its main competitors.
Collapse
|