Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chadès I, Pascal LV, Nicol S, Fletcher CS, Ferrer‐Mestres J. A primer on partially observable Markov decision processes (POMDPs). Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13692] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

For:	Chadès I, Pascal LV, Nicol S, Fletcher CS, Ferrer‐Mestres J. A primer on partially observable Markov decision processes (POMDPs). Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13692] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Deshpande SV, Harikrishnan R, Sampe J, Patwa A. An algorithm to create model file for Partially Observable Markov Decision Process for mobile robot path planning. MethodsX 2024;12:102552. [PMID: 38299041 PMCID: PMC10828799 DOI: 10.1016/j.mex.2024.102552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 01/04/2024] [Indexed: 02/02/2024] Open

Leong KH, Xiu Y, Chen B, Chan WK(V. Neural Causal Information Extractor for Unobserved Causes. ENTROPY (BASEL, SWITZERLAND) 2023;26:46. [PMID: 38248172 PMCID: PMC11154551 DOI: 10.3390/e26010046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/18/2023] [Accepted: 12/22/2023] [Indexed: 01/23/2024]

Williams BK, Brown ED. Four conservation challenges and a synthesis. Ecol Evol 2023;13:e10052. [PMID: 37153016 PMCID: PMC10154884 DOI: 10.1002/ece3.10052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 04/12/2023] [Accepted: 04/17/2023] [Indexed: 05/09/2023] Open

Situation assessment in air combat considering incomplete frame of discernment in the generalized evidence theory. Sci Rep 2022;12:22639. [PMID: 36587044 PMCID: PMC9805455 DOI: 10.1038/s41598-022-27076-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Accepted: 12/26/2022] [Indexed: 01/01/2023] Open

Gu Y, Zhu Z, Lv J, Shi L, Hou Z, Xu S. DM-DQN: Dueling Munchausen deep Q network for robot path planning. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00948-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Williams BK, Brown ED. Partial observability and management of ecological systems. Ecol Evol 2022;12:e9197. [PMID: 36172296 PMCID: PMC9468910 DOI: 10.1002/ece3.9197] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 07/19/2022] [Indexed: 11/10/2022] Open

Abstract

The actual state of ecological systems is rarely known with certainty, but management actions must often be taken regardless of imperfect measurement (partial observability). Because of the difficulties in accounting for partial observability, it is usually treated in an ad hoc fashion, or simply ignored altogether. Yet incorporating partial observability into decision processes lends a realism that has the potential to improve ecological outcomes significantly. We review frameworks for dealing with partial observability, focusing specifically on dynamic ecological systems with Markovian transitions, i.e., transitions among system states that are influenced by the current system state and management action over time. Fully observable states are represented in an observable Markov decision process (MDP), whereas obscure or hidden states are represented in a partially observable process (POMDP). POMDPs can be seen as a natural extension of observable MDPs. Management under partial observability generalizes the situation for complete observability, by recognizing uncertainty about the system's state and incorporating sequential observations associated with, but not the same as, the states themselves. Decisions that otherwise would depend on the actual state must be based instead on state probability distributions (“belief states”). Partial observability requires adaptation of the entire decision process, including the use of belief states and Bayesian updates, valuation that includes expectations over observations, and optimal strategy that identifies actions for belief states over a continuous belief space. We compare MDPs and POMDPs and highlight POMDP applications to some common ecological problems. We clarify the structure and operations, approaches for finding solutions, and analytic challenges of POMDPs for practicing ecologists. Both observable and partially observable MDPs can use an inductive approach to identify optimal strategies and values, with a considerable increase in mathematical complexity with POMDPs. Better understanding of POMDPs can help decision makers manage imperfectly measured ecological systems more effectively.

Collapse

Mallela A, Hastings A. Optimal management of stochastic invasion in a metapopulation with Allee effects. J Theor Biol 2022;549:111221. [PMID: 35843441 DOI: 10.1016/j.jtbi.2022.111221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 06/28/2022] [Accepted: 07/10/2022] [Indexed: 10/17/2022]

Learning Dynamics and Control of a Stochastic System under Limited Sensing Capabilities. SENSORS 2022;22:s22124491. [PMID: 35746272 PMCID: PMC9230096 DOI: 10.3390/s22124491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 06/07/2022] [Accepted: 06/12/2022] [Indexed: 02/04/2023]

Abstract

The operation of a variety of natural or man-made systems subject to uncertainty is maintained within a range of safe behavior through run-time sensing of the system state and control actions selected according to some strategy. When the system is observed from an external perspective, the control strategy may not be known and it should rather be reconstructed by joint observation of the applied control actions and the corresponding evolution of the system state. This is largely hurdled by limitations in the sensing of the system state and different levels of noise. We address the problem of optimal selection of control actions for a stochastic system with unknown dynamics operating under a controller with unknown strategy, for which we can observe trajectories made of the sequence of control actions and noisy observations of the system state which are labeled by the exact value of some reward functions. To this end, we present an approach to train an Input–Output Hidden Markov Model (IO-HMM) as the generative stochastic model that describes the state dynamics of a POMDP by the application of a novel optimization objective adopted from the literate. The learning task is hurdled by two restrictions: the only available sensed data are the limited number of trajectories of applied actions, noisy observations of the system state, and system state; and, the high failure costs prevent interaction with the online environment, preventing exploratory testing. Traditionally, stochastic generative models have been used to learn the underlying system dynamics and select appropriate actions in the defined task. However, current state of the art techniques, in which the state dynamics of the POMDP is first learned and then strategies are optimized over it, frequently fail because the model that best fits the data may not be well suited for controlling. By using the aforementioned optimization objective, we try to to tackle the problems related to model mis-specification. The proposed methodology is illustrated in a scenario of failure avoidance for a multi component system. The quality of the decision making is evaluated by using the collected reward on the test data and compared against the previous literature usual approach.

Collapse