文档详情

Symbolic Stochastic Focused Dynamic Programming with Decision Diagrams.pdf

发布:2015-09-22约2.3万字共3页下载文档
文本预览下载声明
Symbolic Stochastic Focused Dynamic Programming with Decision Diagrams ¨ Florent Teichteil-Konigsbuch and Patrick Fabiani ONERA-DCSD ´ 2 Avenue Edouard-Belin 31055 Toulouse, France (florent.teichteil,patrick.fabiani)@cert.fr Abstract based on dynamic programming and includes two classes of algorithms : value iteration and policy iteration. The first is We present a stochastic planner based on Markov De- an iteration on the value function associated with each state, cision Processes (MDPs) that participates to the prob- abilistic planning track of the 2006 International Plan- that is to say the expected accumulated reward if we start ning Competition. The planner transforms the PPDDL from this state. When the iterated value function stabilizes, problems into factored MDPs that are then solved with the optimal value function is reached and the optimal policy a structured modified value iteration algorithm based on follows. In the policy iteration scheme, the current policy is the safest stochastic path computation from the initial assessed on the infinite horizon and improved locally at each states to the goal states. First, a state subspace is com- iteration. The value of a policy π is solution
显示全部
相似文档