Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Matsuo Y, LeCun Y, Sahani M, Precup D, Silver D, Sugiyama M, Uchibe E, Morimoto J. Deep learning, reinforcement learning, and world models. Neural Netw 2022;152:267-275. [DOI: 10.1016/j.neunet.2022.03.037] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 02/19/2022] [Accepted: 03/28/2022] [Indexed: 12/01/2022]

Manifold-based multi-objective policy search with sample reuse. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.11.094] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Importance sampling policy gradient algorithms in reproducing kernel Hilbert space. Artif Intell Rev 2017. [DOI: 10.1007/s10462-017-9579-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Wang J, Uchibe E, Doya K. Adaptive Baseline Enhances EM-Based Policy Search: Validation in a View-Based Positioning Task of a Smartphone Balancer. Front Neurorobot 2017;11:1. [PMID: 28167910 PMCID: PMC5256123 DOI: 10.3389/fnbot.2017.00001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Accepted: 01/03/2017] [Indexed: 11/28/2022] Open

Model-based reinforcement learning with dimension reduction. Neural Netw 2016;84:1-16. [DOI: 10.1016/j.neunet.2016.08.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Revised: 08/02/2016] [Accepted: 08/16/2016] [Indexed: 11/23/2022]

Hwangbo J, Gehring C, Sommer H, Siegwart R, Buchli J. Policy Learning with an Efficient Black-Box Optimization Algorithm. INT J HUM ROBOT 2015. [DOI: 10.1142/s0219843615500292] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation. Neural Netw 2014;57:128-40. [PMID: 24995917 DOI: 10.1016/j.neunet.2014.06.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 04/17/2014] [Accepted: 06/11/2014] [Indexed: 11/23/2022]

Sugiyama M, Yamada M, du Plessis MC. Learning under nonstationarity: covariate shift and class-balance change. ACTA ACUST UNITED AC 2013. [DOI: 10.1002/wics.1275] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]