Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Wang Z, Chen C, Dong D. Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023;34:9742-9756. [PMID: 35349452 DOI: 10.1109/tnnls.2022.3160173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Zaniolo M, Giuliani M, Castelletti A. Neuro-Evolutionary Direct Policy Search for Multiobjective Optimal Control. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:5926-5938. [PMID: 33882008 DOI: 10.1109/tnnls.2021.3071960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Kapoor A, Nukala E, Chandra R. Bayesian neuroevolution using distributed swarm optimization and tempered MCMC. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Adaptive evolution strategy with ensemble of mutations for Reinforcement Learning. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Salih A, Moshaiov A. Evolving topology and weights of specialized and non-specialized neuro-controllers for robot motion in various environments. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07357-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Distributed Bayesian optimisation framework for deep neuroevolution. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.10.045] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Radaideh MI, Forget B, Shirvan K. Large-scale design optimisation of boiling water reactor bundles with neuroevolution. ANN NUCL ENERGY 2021. [DOI: 10.1016/j.anucene.2021.108355] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Li S, Li M, Su J, Chen S, Yuan Z, Ye Q. PP-PG: Combining Parameter Perturbation with Policy Gradient Methods for Effective and Efficient Explorations in Deep Reinforcement Learning. ACM T INTEL SYST TEC 2021. [DOI: 10.1145/3452008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Cuccu G, Togelius J, Cudré-Mauroux P. Playing Atari with few neurons: Improving the efficacy of reinforcement learning by decoupling feature extraction and decision making. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS 2021;35:17. [PMID: 34720684 PMCID: PMC8550197 DOI: 10.1007/s10458-021-09497-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 03/09/2021] [Indexed: 06/13/2023]

Abstract

We propose a new method for learning compact state representations and policies separately but simultaneously for policy approximation in vision-based applications such as Atari games. Approaches based on deep reinforcement learning typically map pixels directly to actions to enable end-to-end training. Internally, however, the deep neural network bears the responsibility of both extracting useful information and making decisions based on it, two objectives which can be addressed independently. Separating the image processing from the action selection allows for a better understanding of either task individually, as well as potentially finding smaller policy representations which is inherently interesting. Our approach learns state representations using a compact encoder based on two novel algorithms: (i) Increasing Dictionary Vector Quantization builds a dictionary of state representations which grows in size over time, allowing our method to address new observations as they appear in an open-ended online-learning context; and (ii) Direct Residuals Sparse Coding encodes observations in function of the dictionary, aiming for highest information inclusion by disregarding reconstruction error and maximizing code sparsity. As the dictionary size increases, however, the encoder produces increasingly larger inputs for the neural network; this issue is addressed with a new variant of the Exponential Natural Evolution Strategies algorithm which adapts the dimensionality of its probability distribution along the run. We test our system on a selection of Atari games using tiny neural networks of only 6 to 18 neurons (depending on each game's controls). These are still capable of achieving results that are not much worse, and occasionally superior, to the state-of-the-art in direct policy search which uses two orders of magnitude more neurons.

Collapse

Radaideh MI, Shirvan K. Rule-based reinforcement learning methodology to inform evolutionary algorithms for constrained optimization of engineering applications. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106836] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Zhang W, Zhou Q. Software test data generation technology based on polymorphic particle swarm evolutionary algorithm. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-189811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Santos I, Castro L, Rodriguez-Fernandez N, Torrente-Patiño Á, Carballal A. Artificial Neural Networks and Deep Learning in the Visual Arts: a review. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05565-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

A Survey of Planning and Learning in Games. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10134529] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Chen D, Wang Y, Gao W. Combining a gradient-based method and an evolution strategy for multi-objective reinforcement learning. APPL INTELL 2020. [DOI: 10.1007/s10489-020-01702-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Adversarial neural networks for playing hide-and-search board game Scotland Yard. Neural Comput Appl 2020. [DOI: 10.1007/s00521-018-3701-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI. KUNSTLICHE INTELLIGENZ 2020. [DOI: 10.1007/s13218-020-00647-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Reinforcement Learning and Neuroevolution in Flappy Bird Game. PATTERN RECOGNITION AND IMAGE ANALYSIS 2019. [DOI: 10.1007/978-3-030-31332-6_20] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Yang X, Deng S, Ji M, Zhao J, Zheng W. Neural Network Evolving Algorithm Based on the Triplet Codon Encoding Method. Genes (Basel) 2018;9:genes9120626. [PMID: 30551648 PMCID: PMC6315701 DOI: 10.3390/genes9120626] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 12/03/2018] [Accepted: 12/10/2018] [Indexed: 11/29/2022] Open