An Experimental Study of Advice in Sequential Decision-Making under Uncertainty

Florian Benavent; Bruno Zanuttini

Communication Dans Un Congrès Année : 2018

An Experimental Study of Advice in Sequential Decision-Making under Uncertainty

(1) , (1)

Florian Benavent

Fonction : Auteur

Equipe MAD - Laboratoire GREYC - UMR6072

Bruno Zanuttini

Fonction : Auteur
PersonId : 952903

Equipe MAD - Laboratoire GREYC - UMR6072

Résumé

We consider sequential decision making problems under uncertainty , in which a user has a general idea of the task to achieve, and gives advice to an agent in charge of computing an optimal policy. Many different notions of advice have been proposed in somewhat different settings, especially in the field of inverse reinforcement learning and for resolution of Markov Decision Problems with Imprecise Rewards. Two key questions are whether the advice required by a specific method is natural for the user to give, and how much advice is needed for the agent to compute a good policy, as evaluated by the user. We give a unified view of a number of proposals made in the literature, and propose a new notion of advice, which corresponds to a user telling why she would take a given action in a given state. For all these notions, we discuss their naturalness for a user and the integration of advice. We then report on an experimental study of the amount of advice needed for the agent to compute a good policy. Our study shows in particular that continual interaction between the user and the agent is worthwhile, and sheds light on the pros and cons of each type of advice.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

Florian-Benavent-pour-HAL.pdf (454.82 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Bruno Zanuttini : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01646200

Soumis le : mercredi 6 décembre 2017-00:21:26

Dernière modification le : mercredi 20 mars 2024-16:20:04

Dates et versions

hal-01646200 , version 1 (06-12-2017)

Identifiants

HAL Id : hal-01646200 , version 1

Citer

Florian Benavent, Bruno Zanuttini. An Experimental Study of Advice in Sequential Decision-Making under Uncertainty. 32nd AAAI Conference on Artificial Intelligence, Feb 2018, New Orleans, United States. ⟨hal-01646200⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS GREYC GREYC-MAD COMUE-NORMANDIE ENSICAEN UNICAEN

232 Consultations

295 Téléchargements

An Experimental Study of Advice in Sequential Decision-Making under Uncertainty

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager