Expected window mean-payoff

Bordais, Benjamin; Guha, Shibashis; Raskin, Jean-François

doi:doi/10.4230/LIPIcs.FSTTCS.2019.32

Citer

Expected window mean-payoff

par Bordais, Benjamin ;Guha, Shibashis

;Raskin, Jean-François

Référence Leibniz international proceedings in informatics, 150, 32
Publication Publié, 2019-12-01

Article révisé par les pairs

Résumé :

We study the expected value of the window mean-payoff measure in Markov decision processes (MDPs) and Markov chains (MCs). The window mean-payoff measure strengthens the classical mean-payoff measure by measuring the mean-payoff over a window of bounded length that slides along an infinite path. This measure ensures better stability properties than the classical mean-payoff. Window mean-payoff has been introduced previously for two-player zero-sum games. As in the case of games, we study several variants of this definition: the measure can be defined to be prefix-independent or not, and for a fixed window length or for a window length that is left parametric. For fixed window length, we provide polynomial time algorithms for the prefix-independent version for both MDPs and MCs. When the length is left parametric, the problem of computing the expected value on MDPs is as hard as computing the mean-payoff value in two-player zero-sum games, a problem for which it is not known if it can be solved in polynomial time. For the prefix-dependent version, surprisingly, the expected window mean-payoff value cannot be computed in polynomial time unless P=PSpace. For the parametric case and the prefix-dependent case, we manage to obtain algorithms with better complexities for MCs.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

Expected window mean-payoff

Documents en relation

DI-fusion