行业报告详情 - 行业报告数据库

行业分类

找到报告 1 篇当前为第 1 页共 1 页

教练机器人：人主观反馈的上网学习行为

Coaching Robots: Online Behavior Learning from Human Subjective Feedback

作者：Masakazu HirkoawaKenji Suzuki 加工时间：2014-09-27 信息来源：科技报告（Other）

关键词：机器人；手臂系统；模拟
摘要：This chapter describes a novel methodology for behavior learning of an agent, called Coaching. The proposed method is an interactive and iterative learning method which allows a human trainer to give a subjective evaluation to the robotic agent in real time, and the agent can update the reward function dynamically based on this evaluation simultaneously. We demonstrated that the agent is capable of learning the desired behavior by receiving simple and subjective instructions such as positive and negative. The proposed approach is also effective when it is difficult to determine a suitable reward function for the learning situation in advance. We have conducted several experiments with a simulated and a real robot arm system, and the advantage of the proposed method is verified throughout those experiments.

行业分类

友情链接

联系我们

QQ咨询

电话咨询

微信公众号

感谢访问