Coordinated Behavior for Sequential Task using Two-Stage Reward Assignment in Multi-agent Systems
Recently, multi-agent deep reinforcement learning (MADRL) has been studied to learn actions to achieve complicated tasks and generate their coordination structure. The reward assignment in MADRL is a crucial factor to guide and produce both their behaviors for their own tasks and coordinated behaviors. However, it has not been sufficiently clarified the reward assignment in MADRL’s effect on learned coordinated behavior. To address this issue, using the sequential tasks coordinated delivery and execution problem, we analyze the effect of various ratios of the reward given for the task that agent is responsible for.
PDFファイルをご覧いただくには、Adobe Readerが必要です。
Adobe Readerをダウンロードする