A Unified Decision-Theoretic Model for Information Gathering and Communication Planning

Renoux, Jennifer; Veiga, Tiago Santos; Lima, Pedro; Spaan, Matthijs

Renoux, Jennifer; Veiga, Tiago Santos; Lima, Pedro; Spaan, Matthijs

Chapter

Accepted version

Åpne

Renoux (559.5Kb)

Permanent lenke

https://hdl.handle.net/11250/2783703

Utgivelsesdato

2020

Sammendrag

We consider the problem of communication planning for human-machine cooperation in stochastic and partially observable environments. Partially Observable Markov Decision Processes with Information Rewards (POMDPs-IR) form a powerful framework for information-gathering tasks in such environments. We propose an extension of the POMDP-IR model, called a Communicating POMDP-IR (com-POMDP-IR), that allows an agent to proactively plan its communication actions by using an approximation of the human's beliefs. We experimentally demonstrate the capability of our com-POMDPIR agent to limit its communication to relevant information and its robustness to lost messages.

Utgiver

Institute of Electrical and Electronics Engineers (IEEE)

Opphavsrett

© IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.