Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Hoggart CJ, Choi SW, García-González J, Souaiaia T, Preuss M, O'Reilly PF. BridgePRS leverages shared genetic effects across ancestries to increase polygenic risk score portability. Nat Genet 2024;56:180-186. [PMID: 38123642 PMCID: PMC10786716 DOI: 10.1038/s41588-023-01583-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 10/20/2023] [Indexed: 12/23/2023]

Wang W, Qi F, Wipf DP, Cai C, Yu T, Li Y, Zhang Y, Yu Z, Wu W. Sparse Bayesian Learning for End-to-End EEG Decoding. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023;45:15632-15649. [PMID: 37506000 DOI: 10.1109/tpami.2023.3299568] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2023]

Vamvourellis K, Kalogeropoulos K, Moustaki I. Assessment of generalised Bayesian structural equation models for continuous and binary data. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2023;76:559-584. [PMID: 37401608 DOI: 10.1111/bmsp.12314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 04/17/2023] [Indexed: 07/05/2023]

Ozelim LCDSM, Ribeiro DB, Schiavon JA, Domingues VR, de Queiroz PIB. HPOSS: A hierarchical portfolio optimization stacking strategy to reduce the generalization error of ensembles of models. PLoS One 2023;18:e0290331. [PMID: 37651433 PMCID: PMC10470931 DOI: 10.1371/journal.pone.0290331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 08/04/2023] [Indexed: 09/02/2023] Open

Abstract

Surrogate models are frequently used to replace costly engineering simulations. A single surrogate is frequently chosen based on previous experience or by fitting multiple surrogates and selecting one based on mean cross-validation errors. A novel stacking strategy will be presented in this paper. This new strategy results from reinterpreting the model selection process based on the generalization error. For the first time, this problem is proposed to be translated into a well-studied financial problem: portfolio management and optimization. In short, it is demonstrated that the individual residues calculated by leave-one-out procedures are samples from a given random variable ϵi, whose second non-central moment is the i-th model's generalization error. Thus, a stacking methodology based solely on evaluating the behavior of the linear combination of the random variables ϵi is proposed. At first, several surrogate models are calibrated. The Directed Bubble Hierarchical Tree (DBHT) clustering algorithm is then used to determine which models are worth stacking. The stacking weights can be calculated using any financial approach to the portfolio optimization problem. This alternative understanding of the problem enables practitioners to use established financial methodologies to calculate the models' weights, significantly improving the ensemble of models' out-of-sample performance. A study case is carried out to demonstrate the applicability of the new methodology. Overall, a total of 124 models were trained using a specific dataset: 40 Machine Learning models and 84 Polynomial Chaos Expansion models (which considered 3 types of base random variables, 7 least square algorithms for fitting the up to fourth order expansion's coefficients). Among those, 99 models could be fitted without convergence and other numerical issues. The DBHT algorithm with Pearson correlation distance and generalization error similarity was able to select a subgroup of 23 models from the 99 fitted ones, implying a reduction of about 77% in the total number of models, representing a good filtering scheme which still preserves diversity. Finally, it has been demonstrated that the weights obtained by building a Hierarchical Risk Parity (HPR) portfolio perform better for various input random variables, indicating better out-of-sample performance. In this way, an economic stacking strategy has demonstrated its worth in improving the out-of-sample capabilities of stacked models, which illustrates how the new understanding of model stacking methodologies may be useful.

Collapse

Forbes O, Santos-Fernandez E, Wu PPY, Xie HB, Schwenn PE, Lagopoulos J, Mills L, Sacks DD, Hermens DF, Mengersen K. clusterBMA: Bayesian model averaging for clustering. PLoS One 2023;18:e0288000. [PMID: 37603575 PMCID: PMC10441802 DOI: 10.1371/journal.pone.0288000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/16/2023] [Indexed: 08/23/2023] Open

Abstract

Various methods have been developed to combine inference across multiple sets of results for unsupervised clustering, within the ensemble clustering literature. The approach of reporting results from one 'best' model out of several candidate clustering models generally ignores the uncertainty that arises from model selection, and results in inferences that are sensitive to the particular model and parameters chosen. Bayesian model averaging (BMA) is a popular approach for combining results across multiple models that offers some attractive benefits in this setting, including probabilistic interpretation of the combined cluster structure and quantification of model-based uncertainty. In this work we introduce clusterBMA, a method that enables weighted model averaging across results from multiple unsupervised clustering algorithms. We use clustering internal validation criteria to develop an approximation of the posterior model probability, used for weighting the results from each model. From a combined posterior similarity matrix representing a weighted average of the clustering solutions across models, we apply symmetric simplex matrix factorisation to calculate final probabilistic cluster allocations. In addition to outperforming other ensemble clustering methods on simulated data, clusterBMA offers unique features including probabilistic allocation to averaged clusters, combining allocation probabilities from 'hard' and 'soft' clustering algorithms, and measuring model-based uncertainty in averaged cluster allocation. This method is implemented in an accompanying R package of the same name. We use simulated datasets to explore the ability of the proposed technique to identify robust integrated clusters with varying levels of separation between subgroups, and with varying numbers of clusters between models. Benchmarking accuracy against four other ensemble methods previously demonstrated to be highly effective in the literature, clusterBMA matches or exceeds the performance of competing approaches under various conditions of dimensionality and cluster separation. clusterBMA substantially outperformed other ensemble methods for high dimensional simulated data with low cluster separation, with 1.16 to 7.12 times better performance as measured by the Adjusted Rand Index. We also explore the performance of this approach through a case study that aims to identify probabilistic clusters of individuals based on electroencephalography (EEG) data. In applied settings for clustering individuals based on health data, the features of probabilistic allocation and measurement of model-based uncertainty in averaged clusters are useful for clinical relevance and statistical communication.

Collapse

Holmes CC, Walker SG. Statistical inference with exchangeability and martingales. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2023;381:20220143. [PMID: 36970832 PMCID: PMC10041353 DOI: 10.1098/rsta.2022.0143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 01/30/2023] [Indexed: 06/18/2023]

He L, Wall D, Reeck C, Bhatia S. Information acquisition and decision strategies in intertemporal choice. Cogn Psychol 2023;142:101562. [PMID: 36996641 DOI: 10.1016/j.cogpsych.2023.101562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 03/16/2023] [Accepted: 03/21/2023] [Indexed: 03/30/2023]

Hoggart C, Choi SW, García-González J, Souaiaia T, Preuss M, O'Reilly P. BridgePRS : A powerful trans-ancestry Polygenic Risk Score method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.17.528938. [PMID: 36865148 PMCID: PMC9979992 DOI: 10.1101/2023.02.17.528938] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/26/2023]

Bayes Factors for Mixed Models: Perspective on Responses. COMPUTATIONAL BRAIN & BEHAVIOR 2023;6:127-139. [PMID: 36879767 PMCID: PMC9981503 DOI: 10.1007/s42113-022-00158-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 10/09/2022] [Indexed: 02/19/2023]

Gianola D, Fernando RL, Schön CC. Inference about quantitative traits under selection: a Bayesian revisitation for the post-genomic era. Genet Sel Evol 2022;54:78. [PMID: 36460973 PMCID: PMC9716705 DOI: 10.1186/s12711-022-00765-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 10/26/2022] [Indexed: 12/03/2022] Open

Abstract

BACKGROUND

Selection schemes distort inference when estimating differences between treatments or genetic associations between traits, and may degrade prediction of outcomes, e.g., the expected performance of the progeny of an individual with a certain genotype. If input and output measurements are not collected on random samples, inferences and predictions must be biased to some degree. Our paper revisits inference in quantitative genetics when using samples stemming from some selection process. The approach used integrates the classical notion of fitness with that of missing data. Treatment is fully Bayesian, with inference and prediction dealt with, in an unified manner. While focus is on animal and plant breeding, concepts apply to natural selection as well. Examples based on real data and stylized models illustrate how selection can be accounted for in four different situations, and sometimes without success.

RESULTS

Our flexible "soft selection" setting helps to diagnose the extent to which selection can be ignored. The clear connection between probability of missingness and the concept of fitness in stylized selection scenarios is highlighted. It is not realistic to assume that a fixed selection threshold t holds in conceptual replication, as the chance of selection depends on observed and unobserved data, and on unequal amounts of information over individuals, aspects that a "soft" selection representation addresses explicitly. There does not seem to be a general prescription to accommodate potential distortions due to selection. In structures that combine cross-sectional, longitudinal and multi-trait data such as in animal breeding, balance is the exception rather than the rule. The Bayesian approach provides an integrated answer to inference, prediction and model choice under selection that goes beyond the likelihood-based approach, where breeding values are inferred indirectly.

CONCLUSIONS

The approach used here for inference and prediction under selection may or may not yield the best possible answers. One may believe that selection has been accounted for diligently, but the central problem of whether statistical inferences are good or bad does not have an unambiguous solution. On the other hand, the quality of predictions can be gauged empirically via appropriate training-testing of competing methods.

Collapse

Ouatu I, Spiers BT, Aboushelbaya R, Feng Q, von der Leyen MW, Paddock RW, Timmis R, Ticos C, Krushelnick KM, Norreys PA. Ionization states for the multipetawatt laser-QED regime. Phys Rev E 2022;106:015205. [PMID: 35974572 DOI: 10.1103/physreve.106.015205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 06/30/2022] [Indexed: 06/15/2023]

Affiliation(s)

I Ouatu Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
B T Spiers Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom Central Laser Facility, UKRI-STFC Rutherford Appleton Laboratory, Didcot, Oxon OX11 0QX, United Kingdom
R Aboushelbaya Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
Q Feng Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
M W von der Leyen Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
R W Paddock Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
R Timmis Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom
C Ticos Extreme Light Infrastructure-Nuclear Physics (ELI-NP), Horia Hulubei National Institute for Physics and Nuclear Engineering, Măgurele 077125, Romania
K M Krushelnick Center for Ultra-Fast Optics, University of Michigan, Ann Arbor, Michigan, USA
P A Norreys Department of Physics, Atomic and Laser Physics sub-Department, Clarendon Laboratory, University of Oxford, Parks Road, Oxford OX1 3PU, United Kingdom Central Laser Facility, UKRI-STFC Rutherford Appleton Laboratory, Didcot, Oxon OX11 0QX, United Kingdom John Adams Institute, Denys Wilkinson Building, Oxford OX1 3RH, United Kingdom

Collapse

Fortuin V. Priors in Bayesian Deep Learning: A Review. Int Stat Rev 2022. [DOI: 10.1111/insr.12502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Jersakova R, Lomax J, Hetherington J, Lehmann B, Nicholson G, Briers M, Holmes C. Bayesian imputation of COVID-19 positive test counts for nowcasting under reporting lag. J R Stat Soc Ser C Appl Stat 2022;71:RSSC12557. [PMID: 35601481 PMCID: PMC9115539 DOI: 10.1111/rssc.12557] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 02/12/2022] [Indexed: 11/27/2022]

Lage-Freitas A, Allende-Cid H, Santana O, Oliveira-Lage L. Predicting Brazilian Court Decisions. PeerJ Comput Sci 2022;8:e904. [PMID: 35494851 PMCID: PMC9044329 DOI: 10.7717/peerj-cs.904] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 02/07/2022] [Indexed: 06/14/2023]

Liakoni V, Lehmann MP, Modirshanechi A, Brea J, Lutti A, Gerstner W, Preuschoff K. Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making. Neuroimage 2021;246:118780. [PMID: 34875383 DOI: 10.1016/j.neuroimage.2021.118780] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 08/03/2021] [Accepted: 12/04/2021] [Indexed: 11/25/2022] Open

Maity AK, Basu S, Ghosh S. Bayesian Criterion Based Variable Selection. J R Stat Soc Ser C Appl Stat 2021;70:835-857. [PMID: 38863987 PMCID: PMC11166016 DOI: 10.1111/rssc.12488] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Xu HA, Modirshanechi A, Lehmann MP, Gerstner W, Herzog MH. Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making. PLoS Comput Biol 2021;17:e1009070. [PMID: 34081705 PMCID: PMC8205159 DOI: 10.1371/journal.pcbi.1009070] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 06/15/2021] [Accepted: 05/12/2021] [Indexed: 11/19/2022] Open

Demirkaya E, Feng Y, Basu P, Lv J. Large-scale model selection in misspecified generalized linear models. Biometrika 2021. [DOI: 10.1093/biomet/asab005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open