• Reference Citation Analysis
  • v
  • v
  • Find an Article
Find an Article PDF (4622107)   Today's Articles (21187)   Subscriber (49405)
For:  [Subscribe] [Scholar Register]
Number Cited by Other Article(s)
1
Matsuo Y, LeCun Y, Sahani M, Precup D, Silver D, Sugiyama M, Uchibe E, Morimoto J. Deep learning, reinforcement learning, and world models. Neural Netw 2022;152:267-275. [DOI: 10.1016/j.neunet.2022.03.037] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 02/19/2022] [Accepted: 03/28/2022] [Indexed: 12/01/2022]
2
Manifold-based multi-objective policy search with sample reuse. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.11.094] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
3
Importance sampling policy gradient algorithms in reproducing kernel Hilbert space. Artif Intell Rev 2017. [DOI: 10.1007/s10462-017-9579-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
4
Wang J, Uchibe E, Doya K. Adaptive Baseline Enhances EM-Based Policy Search: Validation in a View-Based Positioning Task of a Smartphone Balancer. Front Neurorobot 2017;11:1. [PMID: 28167910 PMCID: PMC5256123 DOI: 10.3389/fnbot.2017.00001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Accepted: 01/03/2017] [Indexed: 11/28/2022]  Open
5
Model-based reinforcement learning with dimension reduction. Neural Netw 2016;84:1-16. [DOI: 10.1016/j.neunet.2016.08.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Revised: 08/02/2016] [Accepted: 08/16/2016] [Indexed: 11/23/2022]
6
Hwangbo J, Gehring C, Sommer H, Siegwart R, Buchli J. Policy Learning with an Efficient Black-Box Optimization Algorithm. INT J HUM ROBOT 2015. [DOI: 10.1142/s0219843615500292] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
7
Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation. Neural Netw 2014;57:128-40. [PMID: 24995917 DOI: 10.1016/j.neunet.2014.06.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 04/17/2014] [Accepted: 06/11/2014] [Indexed: 11/23/2022]
8
Sugiyama M, Yamada M, du Plessis MC. Learning under nonstationarity: covariate shift and class-balance change. ACTA ACUST UNITED AC 2013. [DOI: 10.1002/wics.1275] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
PrevPage 1 of 1 1Next
© 2004-2024 Baishideng Publishing Group Inc. All rights reserved. 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA