文档详情

Symbolic Stochastic Focused Dynamic Programming with Decision Diagrams.pdf

发布：2015-09-22约2.3万字共3页下载文档

文本预览下载声明

Symbolic Stochastic Focused Dynamic Programming with Decision Diagrams ¨ Florent Teichteil-Konigsbuch and Patrick Fabiani ONERA-DCSD ´ 2 Avenue Edouard-Belin 31055 Toulouse, France (ﬂorent.teichteil,patrick.fabiani)@cert.fr Abstract based on dynamic programming and includes two classes of algorithms : value iteration and policy iteration. The ﬁrst is We present a stochastic planner based on Markov De- an iteration on the value function associated with each state, cision Processes (MDPs) that participates to the prob- abilistic planning track of the 2006 International Plan- that is to say the expected accumulated reward if we start ning Competition. The planner transforms the PPDDL from this state. When the iterated value function stabilizes, problems into factored MDPs that are then solved with the optimal value function is reached and the optimal policy a structured modiﬁed value iteration algorithm based on follows. In the policy iteration scheme, the current policy is the safest stochastic path computation from the initial assessed on the inﬁnite horizon and improved locally at each states to the goal states. First, a state subspace is com- iteration. The value of a policy π is solution

显示全部

相似文档