Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rothkopf CA, Ballard DH. Modular inverse reinforcement learning for visuomotor behavior. Biol Cybern 2013;107:477-490. [PMID: 23832417 PMCID: PMC3773182 DOI: 10.1007/s00422-013-0562-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Accepted: 06/17/2013] [Indexed: 06/02/2023]

For:	Rothkopf CA, Ballard DH. Modular inverse reinforcement learning for visuomotor behavior. Biol Cybern 2013;107:477-490. [PMID: 23832417 PMCID: PMC3773182 DOI: 10.1007/s00422-013-0562-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Accepted: 06/17/2013] [Indexed: 06/02/2023]

Number

Cited by Other Article(s)

Dulberg Z, Dubey R, Berwian IM, Cohen JD. Having multiple selves helps learning agents explore and adapt in complex changing worlds. Proc Natl Acad Sci U S A 2023;120:e2221180120. [PMID: 37399387 PMCID: PMC10334746 DOI: 10.1073/pnas.2221180120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 05/09/2023] [Indexed: 07/05/2023] Open

Adams S, Cody T, Beling PA. A survey of inverse reinforcement learning. Artif Intell Rev 2022. [DOI: 10.1007/s10462-021-10108-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Arora S, Doshi P. A survey of inverse reinforcement learning: Challenges, methods and progress. ARTIF INTELL 2021. [DOI: 10.1016/j.artint.2021.103500] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Ballard DH, Zhang R. The Hierarchical Evolution in Human Vision Modeling. Top Cogn Sci 2021;13:309-328. [PMID: 33838010 PMCID: PMC9462461 DOI: 10.1111/tops.12527] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 02/22/2021] [Accepted: 02/22/2021] [Indexed: 11/30/2022]

Muryy A, Siddharth N, Nardelli N, Glennerster A, Torr PHS. Lessons from reinforcement learning for biological representations of space. Vision Res 2020;174:79-93. [PMID: 32683096 DOI: 10.1016/j.visres.2020.05.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Revised: 04/26/2020] [Accepted: 05/26/2020] [Indexed: 10/23/2022]

Bhattacharyya R, Hazarika SM. A knowledge-driven layered inverse reinforcement learning approach for recognizing human intents. J EXP THEOR ARTIF IN 2020. [DOI: 10.1080/0952813x.2020.1718773] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Zhang R, Zhang S, Tong MH, Cui Y, Rothkopf CA, Ballard DH, Hayhoe MM. Modeling sensory-motor decisions in natural behavior. PLoS Comput Biol 2018;14:e1006518. [PMID: 30359364 PMCID: PMC6219815 DOI: 10.1371/journal.pcbi.1006518] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Revised: 11/06/2018] [Accepted: 09/18/2018] [Indexed: 11/18/2022] Open

Hayhoe MM, Matthis JS. Control of gaze in natural environments: effects of rewards and costs, uncertainty and memory in target selection. Interface Focus 2018;8:20180009. [PMID: 29951189 DOI: 10.1098/rsfs.2018.0009] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/08/2018] [Indexed: 11/12/2022] Open

Yamaguchi S, Naoki H, Ikeda M, Tsukada Y, Nakano S, Mori I, Ishii S. Identification of animal behavioral strategies by inverse reinforcement learning. PLoS Comput Biol 2018;14:e1006122. [PMID: 29718905 PMCID: PMC5951592 DOI: 10.1371/journal.pcbi.1006122] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 05/14/2018] [Accepted: 04/03/2018] [Indexed: 11/18/2022] Open

Abstract

Animals are able to reach a desired state in an environment by controlling various behavioral patterns. Identification of the behavioral strategy used for this control is important for understanding animals’ decision-making and is fundamental to dissect information processing done by the nervous system. However, methods for quantifying such behavioral strategies have not been fully established. In this study, we developed an inverse reinforcement-learning (IRL) framework to identify an animal’s behavioral strategy from behavioral time-series data. We applied this framework to C. elegans thermotactic behavior; after cultivation at a constant temperature with or without food, fed worms prefer, while starved worms avoid the cultivation temperature on a thermal gradient. Our IRL approach revealed that the fed worms used both the absolute temperature and its temporal derivative and that their behavior involved two strategies: directed migration (DM) and isothermal migration (IM). With DM, worms efficiently reached specific temperatures, which explains their thermotactic behavior when fed. With IM, worms moved along a constant temperature, which reflects isothermal tracking, well-observed in previous studies. In contrast to fed animals, starved worms escaped the cultivation temperature using only the absolute, but not the temporal derivative of temperature. We also investigated the neural basis underlying these strategies, by applying our method to thermosensory neuron-deficient worms. Thus, our IRL-based approach is useful in identifying animal strategies from behavioral time-series data and could be applied to a wide range of behavioral studies, including decision-making, in other organisms.

Understanding animal decision-making has been a fundamental problem in neuroscience and behavioral ecology. Many studies have analyzed the actions representing decision-making in behavioral tasks, in which rewards are artificially designed with specific objectives. However, it is impossible to extend this artificially designed experiment to a natural environment, as in the latter, the rewards for freely-behaving animals cannot be clearly defined. To this end, we sought to reverse the current paradigm so that rewards could be identified from behavioral data. Here, we propose a new reverse-engineering approach (inverse reinforcement learning), which can estimate a behavioral strategy from time-series data of freely-behaving animals. By applying this technique on C. elegans thermotaxis, we successfully identified the respective reward-based behavioral strategy.

Collapse

Hayhoe MM. Davida Teller Award Lecture 2017: What can be learned from natural behavior? J Vis 2018;18:10. [PMID: 29710300 PMCID: PMC5895074 DOI: 10.1167/18.4.10] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 02/05/2018] [Indexed: 11/25/2022] Open

Muelling K, Boularias A, Mohler B, Schölkopf B, Peters J. Learning strategies in table tennis using inverse reinforcement learning. BIOLOGICAL CYBERNETICS 2014;108:603-619. [PMID: 24756167 DOI: 10.1007/s00422-014-0599-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2013] [Accepted: 03/20/2014] [Indexed: 06/03/2023]

Sullivan BT, Johnson L, Rothkopf CA, Ballard D, Hayhoe M. The role of uncertainty and reward on eye movements in a virtual driving task. J Vis 2012;12:19. [PMID: 23262151 DOI: 10.1167/12.13.19] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open