1
|
Abstract
In general, games pose interesting and complex problems for the implementation of intelligent agents and are a popular domain in the study of artificial intelligence. In fact, games have been at the center of some of the most well-known achievements in artificial intelligence. From classical board games such as chess, checkers, backgammon and Go, to video games such as Dota 2 and StarCraft II, artificial intelligence research has devised computer programs that can play at the level of a human master and even at a human world champion level. Planning and learning, two well-known and successful paradigms of artificial intelligence, have greatly contributed to these achievements. Although representing distinct approaches, planning and learning try to solve similar problems and share some similarities. They can even complement each other. This has led to research on methodologies to combine the strengths of both approaches to derive better solutions. This paper presents a survey of the multiple methodologies that have been proposed to integrate planning and learning in the context of games. In order to provide a richer contextualization, the paper also presents learning and planning techniques commonly used in games, both in terms of their theoretical foundations and applications.
Collapse
|
2
|
Kirsch A. Shakey Ever After? Questioning Tacit Assumptions in Robotics and Artificial Intelligence. KUNSTLICHE INTELLIGENZ 2019. [DOI: 10.1007/s13218-019-00626-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
3
|
|
4
|
Abstract
AbstractGeneralized planningstudies the representation, computation and evaluation of solutions that are valid for multiple planning instances. These are topics studied since the early days of AI. However, in recent years, we are experiencing the appearance of novel formalisms to compactly represent generalized planning tasks, the solutions to these tasks (calledgeneralized plans) and efficient algorithms to compute generalized plans. The paper reviews recent advances in generalized planning and relates them to existing planning formalisms, such asplanning with domain control knowledgeand approaches forplanning under uncertainty, that also aim at generality.
Collapse
|
5
|
Abstract
One of the central issues in cognitive science is the nature of human representations. We argue that symbolic representations are essential for capturing human cognitive capabilities. We start by examining some common misconceptions found in discussions of representations and models. Next we examine evidence that symbolic representations are essential for capturing human cognitive capabilities, drawing on the analogy literature. Then we examine fundamental limitations of feature vectors and other distributed representations that, despite their recent successes on various practical problems, suggest that they are insufficient to capture many aspects of human cognition. After that, we describe the implications for cognitive architecture of our view that analogy is central, and we speculate on roles for hybrid approaches. We close with an analogy that might help bridge the gap.
Collapse
Affiliation(s)
| | - Chen Liang
- Department of Computer Science, Northwestern University
| | - Irina Rabkina
- Department of Computer Science, Northwestern University
| |
Collapse
|
6
|
|
7
|
|
8
|
|
9
|
Jiménez S, Fernández F, Borrajo D. INTEGRATING PLANNING, EXECUTION, AND LEARNING TO IMPROVE PLAN EXECUTION. Comput Intell 2012. [DOI: 10.1111/j.1467-8640.2012.00447.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
10
|
|
11
|
Fernández Arregui S, Jiménez Celorrio S, de la Rosa Turbides T. Improving Automated Planning with Machine Learning. Mach Learn 2012. [DOI: 10.4018/978-1-60960-818-7.ch510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This chapter reports the last machine learning techniques for the assistance of automated planning. Recent discoveries in automated planning have opened the scope of planners, from toy problems to real-world applications, making new challenges come into focus. The planning community believes that machine learning can assist to address these new challenges. The chapter collects the last machine learning techniques for assisting automated planners classified in: techniques for the improvement of the planning search processes and techniques for the automatic definition of planning action models. For each technique, the chapter provides an in-depth analysis of their domain, advantages and disadvantages. Finally, the chapter draws the outline of the new promising avenues for research in learning for planning systems.
Collapse
|
12
|
KERKEZ BORIS, COX MICHAELT. INCREMENTAL CASE-BASED PLAN RECOGNITION WITH LOCAL PREDICTIONS. INT J ARTIF INTELL T 2011. [DOI: 10.1142/s0218213003001307] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We present a novel case-based plan recognition method that interprets observations of plan behavior using an incrementally constructed case library of past observations. The technique is novel in several ways. It combines plan recognition with case-based reasoning and leverages the strengths of both. The representation of a plan is a sequence of action-state pairs rather than only the actions. The technique compensates for the additional complexity with a unique abstraction scheme augmented by pseudo-isomorphic similarity relations to represent indices into the case base. Past cases are used to predict subsequent actions by adapting old actions and their arguments. Moreover, the technique makes predictions despite observations of unknown actions. This paper evaluates the algorithms and their implementation both analytically and empirically. The evaluation criteria include prediction accuracy at both an abstract and a concrete level and across multiple domains with and without case-adaptation. In each domain the system starts with an empty case base that grows to include thousands of past observations. Results demonstrate that this new method is accurate, robust, scalable, and general across domains.
Collapse
Affiliation(s)
- BORIS KERKEZ
- Department of Computer Science and Engineering, Wright State University, Dayton, OH, USA
| | - MICHAEL T. COX
- Department of Computer Science and Engineering, Wright State University, Dayton, OH, USA
| |
Collapse
|
13
|
KOŠMERLJ ALJAŽ, BRATKO IVAN, ŽABKAR JURE. EMBODIED CONCEPT DISCOVERY THROUGH QUALITATIVE ACTION MODELS. INT J UNCERTAIN FUZZ 2011. [DOI: 10.1142/s0218488511007088] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We present a novel approach to embodied learning of qualitative models. We introduce algorithm STRUDEL that enables an autonomous robot to discover new concepts by performing experiments in its environment. The robot collects data about its actions and its observations of the environment. From the obtained data, the robot learns qualitative descriptive models of the effects that its actions have in the environment. Models are learned using inductive logic programming. We describe two experiments with a humanoid robot Nao in which Nao learns descriptive qualitative models which contain what can be interpreted as simple definitions of the concepts of movability and stability.
Collapse
Affiliation(s)
- ALJAŽ KOŠMERLJ
- Artificial Intelligence Laboratory, University of Ljubljana, Faculty of Computer and Information Science, Tržaska cesta 25, Ljubljana, SI-1001, Slovenia
| | - IVAN BRATKO
- Artificial Intelligence Laboratory, University of Ljubljana, Faculty of Computer and Information Science, Tržaska cesta 25, Ljubljana, SI-1001, Slovenia
| | - JURE ŽABKAR
- Artificial Intelligence Laboratory, University of Ljubljana, Faculty of Computer and Information Science, Tržaska cesta 25, Ljubljana, SI-1001, Slovenia
| |
Collapse
|
14
|
Including cognitive biases and distance-based rewards in a connectionist model of complex problem solving. Neural Netw 2011; 25:41-56. [PMID: 21840172 DOI: 10.1016/j.neunet.2011.06.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2010] [Revised: 06/29/2011] [Accepted: 06/29/2011] [Indexed: 11/23/2022]
Abstract
We present a cognitive, connectionist-based model of complex problem solving that integrates cognitive biases and distance-based and environmental rewards under a temporal-difference learning mechanism. The model is tested against experimental data obtained in a well-defined and planning-intensive problem. We show that incorporating cognitive biases (symmetry and simplicity) in a temporal-difference learning rule (SARSA) increases model adequacy-the solution space explored by biased models better fits observed human solutions. While learning from explicit rewards alone is intrinsically slow, adding distance-based rewards, a measure of closeness to goal, to the learning rule significantly accelerates learning. Finally, the model correctly predicts that explicit rewards have little impact on problem solvers' ability to discover optimal solutions.
Collapse
|
15
|
|
16
|
|
17
|
|
18
|
Murdock JW, Goel AK. Meta-case-based reasoning: self-improvement through self-understanding. J EXP THEOR ARTIF IN 2008. [DOI: 10.1080/09528130701472416] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
19
|
Abstract
AbsractArtificial intelligence (AI) planning solves the problem of generating a correct and efficient ordered set of instantiated activities, from a knowledge base of generic actions, which when executed will transform some initial state into some desirable end-state. There is a long tradition of work in AI for developing planners that make use of heuristics that are shown to improve their performance in many real world and artificial domains. The developers of planners have chosen between two extremes when defining those heuristics. The domain-independent planners use domain-independent heuristics, which exploit information only from the ‘syntactic’ structure of the problem space and of the search tree. Therefore, they do not need any ‘semantic’ information from a given domain in order to guide the search. From a knowledge engineering (KE) perspective, the planners that use this type of heuristics have the advantage that the users of this technology need only focus on defining the domain theory and not on defining how to make the planner efficient (how to obtain ‘good’ solutions with the minimal computational resources). However, the domain-dependent planners require users to manually represent knowledge not only about the domain theory, but also about how to make the planner efficient. This approach has the advantage of using either better domain-theory formulations or using domain knowledge for defining the heuristics, thus potentially making them more efficient. However, the efficiency of these domain-dependent planners strongly relies on the KE and planning expertise of the user. When the user is an expert on these two types of knowledge, domain-dependent planners clearly outperform domain-independent planners in terms of number of solved problems and quality of solutions. Machine-learning (ML) techniques applied to solve the planning problems have focused on providing middle-ground solutions as compared to the aforementioned two extremes. Here, the user first defines a domain theory, and then executes the ML techniques that automatically modify or generate new knowledge with respect to both the domain theory and the heuristics. In this paper, we present our work on building a tool, PLTOOL (planning and learning tool), to help users interact with a set of ML techniques and planners. The goal is to provide a KE framework for mixed-initiative generation of efficient and good planning knowledge.
Collapse
|
20
|
|
21
|
|
22
|
|
23
|
|
24
|
Ilghami O, Nau DS, Munoz-Avila H, Aha DW. LEARNING PRECONDITIONS FOR PLANNING FROM PLAN TRACES AND HTN STRUCTURE. Comput Intell 2005. [DOI: 10.1111/j.1467-8640.2005.00279.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
25
|
Phung T, Winikoff M, Padgham L. Learning Within the BDI Framework: An Empirical Analysis. LECTURE NOTES IN COMPUTER SCIENCE 2005. [DOI: 10.1007/11553939_41] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
26
|
AI Planning Technology as a Component of Computerised Clinical Practice Guidelines. Artif Intell Med 2005. [DOI: 10.1007/11527770_26] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
27
|
Wickramasinghe LK, Amarasiri R, Alahakoon LD. A Hybrid Intelligent Multiagent System for E-Business. Comput Intell 2004. [DOI: 10.1111/j.0824-7935.2004.00256.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
28
|
Galindo C, Fernandez-Madrigal JA, Gonzalez J. Improving Efficiency in Mobile Robot Task Planning Through World Abstraction. IEEE T ROBOT 2004. [DOI: 10.1109/tro.2004.829480] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
29
|
Fink E. Automatic evaluation and selection of problem-solving methods: theory and experiments. J EXP THEOR ARTIF IN 2004. [DOI: 10.1080/09528130410001724797] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
30
|
Baier JA, Pinto JA. Planning under uncertainty as GOLOGprograms. J EXP THEOR ARTIF IN 2003. [DOI: 10.1080/0952813031000064567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
31
|
Aler R, Borrajo D, Camacho D, Sierra-Alonso A. A knowledge-based approach for business process reengineering, SHAMASH. Knowl Based Syst 2002. [DOI: 10.1016/s0950-7051(02)00032-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
32
|
|
33
|
Aler R, Borrajo D, Isasi P. Learning to solve planning problems efficiently by means of genetic programming. EVOLUTIONARY COMPUTATION 2001; 9:387-420. [PMID: 11709102 DOI: 10.1162/10636560152642841] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Declarative problem solving, such as planning, poses interesting challenges for Genetic Programming (GP). There have been recent attempts to apply GP to planning that fit two approaches: (a) using GP to search in plan space or (b) to evolve a planner. In this article, we propose to evolve only the heuristics to make a particular planner more efficient. This approach is more feasible than (b) because it does not have to build a planner from scratch but can take advantage of already existing planning systems. It is also more efficient than (a) because once the heuristics have been evolved, they can be used to solve a whole class of different planning problems in a planning domain, instead of running GP for every new planning problem. Empirical results show that our approach (EvoCK) is able to evolve heuristics in two planning domains (the blocks world and the logistics domain) that improve PRODIGY4.0 performance. Additionally, we experiment with a new genetic operator --Instance-Based Crossover--that is able to use traces of the base planner as raw genetic material to be injected into the evolving population.
Collapse
Affiliation(s)
- R Aler
- Department of Computer Science, Universidad Carlos III de Madrid, 28911 Leganés, Madrid, Spain.
| | | | | |
Collapse
|
34
|
|
35
|
|
36
|
|
37
|
ZITA KAREN, JONATHAN HAIGH, SHEWCHUK RICHARD, VELOSO MANUELAM. Exploiting domain geometry in analogical route planning. J EXP THEOR ARTIF IN 1997. [DOI: 10.1080/095281397147013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
38
|
Controlling for unexpected goals when planning in a mixed-initiative setting. PROGRESS IN ARTIFICIAL INTELLIGENCE 1997. [DOI: 10.1007/bfb0023933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
39
|
Fink E, Yang Q. Automatically selecting and using primary effects in planning: theory and experiments. ARTIF INTELL 1997. [DOI: 10.1016/s0004-3702(96)00020-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
40
|
Veloso MM. Merge strategies for multiple case plan replay. CASE-BASED REASONING RESEARCH AND DEVELOPMENT 1997. [DOI: 10.1007/3-540-63233-6_511] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|