Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kianercy A, Galstyan A. Dynamics of Boltzmann Q learning in two-player two-action games. Phys Rev E Stat Nonlin Soft Matter Phys 2012;85:041145. [PMID: 22680455 DOI: 10.1103/physreve.85.041145] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Revised: 02/28/2012] [Indexed: 06/01/2023]

For:	Kianercy A, Galstyan A. Dynamics of Boltzmann Q learning in two-player two-action games. Phys Rev E Stat Nonlin Soft Matter Phys 2012;85:041145. [PMID: 22680455 DOI: 10.1103/physreve.85.041145] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Revised: 02/28/2012] [Indexed: 06/01/2023]

Number

Cited by Other Article(s)

Galstyan V, Saakian DB. Quantifying the stochasticity of policy parameters in reinforcement learning problems. Phys Rev E 2023;107:034112. [PMID: 37072940 DOI: 10.1103/physreve.107.034112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 02/16/2023] [Indexed: 04/20/2023]

Leonardos S, Sakos J, Courcoubetis C, Piliouras G. Catastrophe by Design in Population Games: A Mechanism to Destabilize Inefficient Locked-in Technologies. ACM TRANSACTIONS ON ECONOMICS AND COMPUTATION 2023. [DOI: 10.1145/3583782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]

Banisch S, Gaisbauer F, Olbrich E. Modelling Spirals of Silence and Echo Chambers by Learning from the Feedback of Others. ENTROPY (BASEL, SWITZERLAND) 2022;24:1484. [PMID: 37420504 DOI: 10.3390/e24101484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 10/10/2022] [Accepted: 10/12/2022] [Indexed: 07/09/2023]

Barfuss W. Dynamical systems as a level of cognitive analysis of multi-agent learning: Algorithmic foundations of temporal-difference learning dynamics. Neural Comput Appl 2022;34:1653-1671. [PMID: 35221541 PMCID: PMC8827307 DOI: 10.1007/s00521-021-06117-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 05/11/2021] [Indexed: 01/02/2023]

Leonardos S, Piliouras G. Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory. ARTIF INTELL 2022. [DOI: 10.1016/j.artint.2021.103653] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Lauffenburger JC, Yom-Tov E, Keller PA, McDonnell ME, Bessette LG, Fontanet CP, Sears ES, Kim E, Hanken K, Buckley JJ, Barlev RA, Haff N, Choudhry NK. REinforcement learning to improve non-adherence for diabetes treatments by Optimising Response and Customising Engagement (REINFORCE): study protocol of a pragmatic randomised trial. BMJ Open 2021;11:e052091. [PMID: 34862289 PMCID: PMC8647547 DOI: 10.1136/bmjopen-2021-052091] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open

Abstract

INTRODUCTION

Achieving optimal diabetes control requires several daily self-management behaviours, especially adherence to medication. Evidence supports the use of text messages to support adherence, but there remains much opportunity to improve their effectiveness. One key limitation is that message content has been generic. By contrast, reinforcement learning is a machine learning method that can be used to identify individuals' patterns of responsiveness by observing their response to cues and then optimising them accordingly. Despite its demonstrated benefits outside of healthcare, its application to tailoring communication for patients has received limited attention. The objective of this trial is to test the impact of a reinforcement learning-based text messaging programme on adherence to medication for patients with type 2 diabetes.

METHODS AND ANALYSIS

In the REinforcement learning to Improve Non-adherence For diabetes treatments by Optimising Response and Customising Engagement (REINFORCE) trial, we are randomising 60 patients with suboptimal diabetes control treated with oral diabetes medications to receive a reinforcement learning intervention or control. Subjects in both arms will receive electronic pill bottles to use, and those in the intervention arm will receive up to daily text messages. The messages will be individually adapted using a reinforcement learning prediction algorithm based on daily adherence measurements from the pill bottles. The trial's primary outcome is average adherence to medication over the 6-month follow-up period. Secondary outcomes include diabetes control, measured by glycated haemoglobin A1c, and self-reported adherence. In sum, the REINFORCE trial will evaluate the effect of personalising the framing of text messages for patients to support medication adherence and provide insight into how this could be adapted at scale to improve other self-management interventions.

ETHICS AND DISSEMINATION

This study was approved by the Mass General Brigham Institutional Review Board (IRB) (USA). Findings will be disseminated through peer-reviewed journals, clinicaltrials.gov reporting and conferences.

TRIAL REGISTRATION NUMBER

Clinicaltrials.gov (NCT04473326).

Collapse

Affiliation(s)

Julie C Lauffenburger Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Elad Yom-Tov Microsoft Research, Microsoft, Herzeliya, Israel
Punam A Keller Tuck School of Business, Dartmouth College, Hanover, NH, USA
Marie E McDonnell Endocrinology, Diabetes and Hypertension, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Lily G Bessette Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Constance P Fontanet Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Ellen S Sears Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Erin Kim Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Kaitlin Hanken Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
J Joseph Buckley Division of Sleep Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Renee A Barlev Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Nancy Haff Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Niteesh K Choudhry Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA

Collapse

Gaisbauer F, Olbrich E, Banisch S. Dynamics of opinion expression. Phys Rev E 2020;102:042303. [PMID: 33212677 DOI: 10.1103/physreve.102.042303] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Accepted: 09/04/2020] [Indexed: 11/07/2022]

Zhang SP, Dong JQ, Liu L, Huang ZG, Huang L, Lai YC. Reinforcement learning meets minority game: Toward optimal resource allocation. Phys Rev E 2019;99:032302. [PMID: 30999513 DOI: 10.1103/physreve.99.032302] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Indexed: 11/06/2022]

Path planning of a mobile robot in a free-space environment using Q-learning. PROGRESS IN ARTIFICIAL INTELLIGENCE 2018. [DOI: 10.1007/s13748-018-00168-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Adaptive decision making via entropy minimization. Int J Approx Reason 2018. [DOI: 10.1016/j.ijar.2018.10.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Bifurcation Mechanism Design—From Optimal Flat Taxes to Better Cancer Treatments. GAMES 2018. [DOI: 10.3390/g9020021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Li X, Cao R, Hao J. An Adaptive Learning Based Network Selection Approach for 5G Dynamic Environments. ENTROPY 2018;20:e20040236. [PMID: 33265327 PMCID: PMC7512751 DOI: 10.3390/e20040236] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Revised: 03/07/2018] [Accepted: 03/24/2018] [Indexed: 11/17/2022]

Zhang Z, Zhao D, Gao J, Wang D, Dai Y. FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:1367-1379. [PMID: 27101627 DOI: 10.1109/tcyb.2016.2544866] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Zschache J. Melioration Learning in Two-Person Games. PLoS One 2016;11:e0166708. [PMID: 27851815 PMCID: PMC5112854 DOI: 10.1371/journal.pone.0166708] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Accepted: 11/02/2016] [Indexed: 11/18/2022] Open

Sun J, Wang L. The interaction between BIM's promotion and interest game under information asymmetry. ACTA ACUST UNITED AC 2015. [DOI: 10.3934/jimo.2015.11.1301] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Kianercy A, Veltri R, Pienta KJ. Critical transitions in a game theoretic model of tumour metabolism. Interface Focus 2014;4:20140014. [PMID: 25097747 PMCID: PMC4071509 DOI: 10.1098/rsfs.2014.0014] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Juul J, Kianercy A, Bernhardsson S, Pigolotti S. Replicator dynamics with turnover of players. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013;88:022806. [PMID: 24032882 DOI: 10.1103/physreve.88.022806] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2013] [Revised: 07/05/2013] [Indexed: 06/02/2023]

Kianercy A, Galstyan A. Coevolutionary networks of reinforcement-learning agents. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013;88:012815. [PMID: 23944526 DOI: 10.1103/physreve.88.012815] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2013] [Indexed: 06/02/2023]