论文标题
验证不定型 - 马pomdps
Verification of indefinite-horizon POMDPs
论文作者
论文摘要
MDP中的验证问题询问,对于任何解决非确定性的策略,是否发生了不良事件的可能性是由某些给定阈值所限制的。这种验证问题通常过于悲观,因为它认为的政策可能取决于完整的系统状态。本文考虑了部分可观察到的MDP的验证问题,在该问题中,政策根据系统发出的观察结果(历史)做出决定。我们提出了一个抽象式框架,该框架扩展了以前的LoveJoy诉求的实例。我们的实验表明,该框架可显着提高方法的可扩展性。
The verification problem in MDPs asks whether, for any policy resolving the nondeterminism, the probability that something bad happens is bounded by some given threshold. This verification problem is often overly pessimistic, as the policies it considers may depend on the complete system state. This paper considers the verification problem for partially observable MDPs, in which the policies make their decisions based on (the history of) the observations emitted by the system. We present an abstraction-refinement framework extending previous instantiations of the Lovejoy-approach. Our experiments show that this framework significantly improves the scalability of the approach.