Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang J, Elfwing S, Uchibe E. Modular deep reinforcement learning from reward and punishment for robot navigation. Neural Netw 2021;135:115-126. [PMID: 33383526 DOI: 10.1016/j.neunet.2020.12.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 10/27/2020] [Accepted: 12/03/2020] [Indexed: 11/25/2022]

For:	Wang J, Elfwing S, Uchibe E. Modular deep reinforcement learning from reward and punishment for robot navigation. Neural Netw 2021;135:115-126. [PMID: 33383526 DOI: 10.1016/j.neunet.2020.12.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 10/27/2020] [Accepted: 12/03/2020] [Indexed: 11/25/2022]

Number

Cited by Other Article(s)

Liu X, Zhang T, Liu M. Joint estimation of pose, depth, and optical flow with a competition-cooperation transformer network. Neural Netw 2024;171:263-275. [PMID: 38103436 DOI: 10.1016/j.neunet.2023.12.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 10/31/2023] [Accepted: 12/12/2023] [Indexed: 12/19/2023]

Blackwell KT, Doya K. Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks. PLoS Comput Biol 2023;19:e1011385. [PMID: 37594982 PMCID: PMC10479916 DOI: 10.1371/journal.pcbi.1011385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/05/2023] [Accepted: 07/25/2023] [Indexed: 08/20/2023] Open

Abstract

A major advance in understanding learning behavior stems from experiments showing that reward learning requires dopamine inputs to striatal neurons and arises from synaptic plasticity of cortico-striatal synapses. Numerous reinforcement learning models mimic this dopamine-dependent synaptic plasticity by using the reward prediction error, which resembles dopamine neuron firing, to learn the best action in response to a set of cues. Though these models can explain many facets of behavior, reproducing some types of goal-directed behavior, such as renewal and reversal, require additional model components. Here we present a reinforcement learning model, TD2Q, which better corresponds to the basal ganglia with two Q matrices, one representing direct pathway neurons (G) and another representing indirect pathway neurons (N). Unlike previous two-Q architectures, a novel and critical aspect of TD2Q is to update the G and N matrices utilizing the temporal difference reward prediction error. A best action is selected for N and G using a softmax with a reward-dependent adaptive exploration parameter, and then differences are resolved using a second selection step applied to the two action probabilities. The model is tested on a range of multi-step tasks including extinction, renewal, discrimination; switching reward probability learning; and sequence learning. Simulations show that TD2Q produces behaviors similar to rodents in choice and sequence learning tasks, and that use of the temporal difference reward prediction error is required to learn multi-step tasks. Blocking the update rule on the N matrix blocks discrimination learning, as observed experimentally. Performance in the sequence learning task is dramatically improved with two matrices. These results suggest that including additional aspects of basal ganglia physiology can improve the performance of reinforcement learning models, better reproduce animal behaviors, and provide insight as to the role of direct- and indirect-pathway striatal neurons.

Collapse

Tsantekidis A, Passalis N, Tefas A. Modeling limit order trading with a continuous action policy for deep reinforcement learning. Neural Netw 2023;165:506-515. [PMID: 37348431 DOI: 10.1016/j.neunet.2023.05.051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 01/20/2023] [Accepted: 05/28/2023] [Indexed: 06/24/2023]

Dulberg Z, Dubey R, Berwian IM, Cohen JD. Having multiple selves helps learning agents explore and adapt in complex changing worlds. Proc Natl Acad Sci U S A 2023;120:e2221180120. [PMID: 37399387 PMCID: PMC10334746 DOI: 10.1073/pnas.2221180120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 05/09/2023] [Indexed: 07/05/2023] Open

Short WD, Olutoye OO, Padon BW, Parikh UM, Colchado D, Vangapandu H, Shams S, Chi T, Jung JP, Balaji S. Advances in non-invasive biosensing measures to monitor wound healing progression. Front Bioeng Biotechnol 2022;10:952198. [PMID: 36213059 PMCID: PMC9539744 DOI: 10.3389/fbioe.2022.952198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 07/12/2022] [Indexed: 01/09/2023] Open

Abstract

Impaired wound healing is a significant financial and medical burden. The synthesis and deposition of extracellular matrix (ECM) in a new wound is a dynamic process that is constantly changing and adapting to the biochemical and biomechanical signaling from the extracellular microenvironments of the wound. This drives either a regenerative or fibrotic and scar-forming healing outcome. Disruptions in ECM deposition, structure, and composition lead to impaired healing in diseased states, such as in diabetes. Valid measures of the principal determinants of successful ECM deposition and wound healing include lack of bacterial contamination, good tissue perfusion, and reduced mechanical injury and strain. These measures are used by wound-care providers to intervene upon the healing wound to steer healing toward a more functional phenotype with improved structural integrity and healing outcomes and to prevent adverse wound developments. In this review, we discuss bioengineering advances in 1) non-invasive detection of biologic and physiologic factors of the healing wound, 2) visualizing and modeling the ECM, and 3) computational tools that efficiently evaluate the complex data acquired from the wounds based on basic science, preclinical, translational and clinical studies, that would allow us to prognosticate healing outcomes and intervene effectively. We focus on bioelectronics and biologic interfaces of the sensors and actuators for real time biosensing and actuation of the tissues. We also discuss high-resolution, advanced imaging techniques, which go beyond traditional confocal and fluorescence microscopy to visualize microscopic details of the composition of the wound matrix, linearity of collagen, and live tracking of components within the wound microenvironment. Computational modeling of the wound matrix, including partial differential equation datasets as well as machine learning models that can serve as powerful tools for physicians to guide their decision-making process are discussed.

Collapse

Affiliation(s)

Walker D. Short Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States
Oluyinka O. Olutoye Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States
Benjamin W. Padon Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States
Umang M. Parikh Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States
Daniel Colchado Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States
Hima Vangapandu Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States
Shayan Shams Department of Applied Data Science, San Jose State University, San Jose, CA, United States School of Biomedical Informatics, University of Texas Health Science Center, Houston, TX, United States
Taiyun Chi Department of Electrical and Computer Engineering, Rice University, Houston, TX, United States
Jangwook P. Jung Department of Biological Engineering, Louisiana State University, Baton Rouge, LA, United States
Swathi Balaji Laboratory for Regenerative Tissue Repair, Division of Pediatric Surgery, Department of Surgery, Texas Children’s Hospital and Baylor College of Medicine, Houston, TX, United States *Correspondence: Swathi Balaji,

Collapse

Mobile Robot Application with Hierarchical Start Position DQN. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:4115767. [PMID: 36105641 PMCID: PMC9467786 DOI: 10.1155/2022/4115767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 07/26/2022] [Accepted: 08/03/2022] [Indexed: 11/17/2022]

Yuan Y, Hua L, Cheng Y, Li J, Sang X, Zhang L, Wei W. A novel model-based reinforcement learning algorithm for solving the problem of unbalanced reward. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-210956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Application of an adapted FMEA framework for robot-inclusivity of built environments. Sci Rep 2022;12:3408. [PMID: 35233018 PMCID: PMC8888750 DOI: 10.1038/s41598-022-06902-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 02/08/2022] [Indexed: 11/09/2022] Open