Abstract: This paper considers the problem of providing advice to an autonomous agent when neither the behavioural policy nor the goals of that agent are known to the advisor. We present an approach based on building a model of “commonsense” behaviour in the domain, from an aggregation of different users performing various tasks, modeled as MDPs, in the same domain. From this model, we estimate the normalcy of the trajectory given by a new agent in the domain, and provide behavioural advice based on an approximation of the trade-off in utility between potential benefits to the exploring agent and the costs incurred in giving this advice. This model is evaluated on a maze world domain by providing advice to different typesof agents, and we show that this leads to a considerable and unanimous improvement in the completion rate of their tasks.
Citation: B. Rosman, S. Ramamoorthy (2014). Giving advice to agents with hidden goals. In Proc. IEEE International Conference on Robotics and Automation (ICRA), 2014.