Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Price B, Boutilier C. Accelerating Reinforcement Learning through Implicit Imitation. J ARTIF INTELL RES 2003. [DOI: 10.1613/jair.898] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Number

Cited by Other Article(s)

Mushtaq A, Haq IU, Sarwar MA, Khan A, Khalil W, Mughal MA. Multi-Agent Reinforcement Learning for Traffic Flow Management of Autonomous Vehicles. SENSORS (BASEL, SWITZERLAND) 2023;23:2373. [PMID: 36904577 PMCID: PMC10007156 DOI: 10.3390/s23052373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 02/12/2023] [Accepted: 02/16/2023] [Indexed: 06/18/2023]

Reinforcement Learning. Mach Learn 2021. [DOI: 10.1007/978-981-15-1967-3_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Dhawale AK, Smith MA, Ölveczky BP. The Role of Variability in Motor Learning. Annu Rev Neurosci 2017;40:479-498. [PMID: 28489490 PMCID: PMC6091866 DOI: 10.1146/annurev-neuro-072116-031548] [Citation(s) in RCA: 233] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Kraemer L, Banerjee B. Multi-agent reinforcement learning as a rehearsal for decentralized planning. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.01.031] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Transferring knowledge as heuristics in reinforcement learning: A case-based approach. ARTIF INTELL 2015. [DOI: 10.1016/j.artint.2015.05.008] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Mobbs D, Hagan CC, Dalgleish T, Silston B, Prévost C. The ecology of human fear: survival optimization and the nervous system. Front Neurosci 2015;9:55. [PMID: 25852451 PMCID: PMC4364301 DOI: 10.3389/fnins.2015.00055] [Citation(s) in RCA: 169] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 02/07/2015] [Indexed: 01/04/2023] Open

Abstract

We propose a Survival Optimization System (SOS) to account for the strategies that humans and other animals use to defend against recurring and novel threats. The SOS attempts to merge ecological models that define a repertoire of contextually relevant threat induced survival behaviors with contemporary approaches to human affective science. We first propose that the goal of the nervous system is to reduce surprise and optimize actions by (i) predicting the sensory landscape by simulating possible encounters with threat and selecting the appropriate pre-encounter action and (ii) prevention strategies in which the organism manufactures safe environments. When a potential threat is encountered the (iii) threat orienting system is engaged to determine whether the organism ignores the stimulus or switches into a process of (iv) threat assessment, where the organism monitors the stimulus, weighs the threat value, predicts the actions of the threat, searches for safety, and guides behavioral actions crucial to directed escape. When under imminent attack, (v) defensive systems evoke fast reflexive indirect escape behaviors (i.e., fight or flight). This cascade of responses to threat of increasing magnitude are underwritten by an interconnected neural architecture that extends from cortical and hippocampal circuits, to attention, action and threat systems including the amygdala, striatum, and hard-wired defensive systems in the midbrain. The SOS also includes a modulatory feature consisting of cognitive appraisal systems that flexibly guide perception, risk and action. Moreover, personal and vicarious threat encounters fine-tune avoidance behaviors via model-based learning, with higher organisms bridging data to reduce face-to-face encounters with predators. Our model attempts to unify the divergent field of human affective science, proposing a highly integrated nervous system that has evolved to increase the organism's chances of survival.

Collapse

Combining Learning Algorithms: An Approach to Markov Decision Processes. ENTERP INF SYST-UK 2013. [DOI: 10.1007/978-3-642-40654-6_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Learning domain structure through probabilistic policy reuse in reinforcement learning. PROGRESS IN ARTIFICIAL INTELLIGENCE 2012. [DOI: 10.1007/s13748-012-0026-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Takahashi S, Takahashi Y, Maeda Y, Nakamura T. Kicking Motion Imitation of Inverted-Pendulum Mobile Robot and Development of Body Mapping from Human Demonstrator. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2011. [DOI: 10.20965/jaciii.2011.p1030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Takano T, Takase H, Kawanaka H, Tsuruoka S. Merging with Extraction Method for Transfer Learning in Actor-Critic. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2011. [DOI: 10.20965/jaciii.2011.p0814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Is Someone in this Office Available to Help Me? J INTELL ROBOT SYST 2011. [DOI: 10.1007/s10846-011-9610-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Tamura Y, Takahashi Y, Asada M. Observed Body Clustering for Imitation Based on Value System. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2010. [DOI: 10.20965/jaciii.2010.p0802] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Confidence-Based Multi-Robot Learning from Demonstration. Int J Soc Robot 2010. [DOI: 10.1007/s12369-010-0060-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Kartoun U, Stern H, Edan Y. A Human-Robot Collaborative Reinforcement Learning Algorithm. J INTELL ROBOT SYST 2010. [DOI: 10.1007/s10846-010-9422-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Kormushev PS, Nomoto K, Dong F, Hirota K. Eligibility Propagation to Speed up Time Hopping for Reinforcement Learning. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2009. [DOI: 10.20965/jaciii.2009.p0600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Transfer in variable-reward hierarchical reinforcement learning. Mach Learn 2008. [DOI: 10.1007/s10994-008-5061-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Busoniu L, Babuska R, De Schutter B. A Comprehensive Survey of Multiagent Reinforcement Learning. ACTA ACUST UNITED AC 2008. [DOI: 10.1109/tsmcc.2007.913919] [Citation(s) in RCA: 987] [Impact Index Per Article: 61.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]