Sökning: "Siavash Paidar"
Hittade 1 uppsats innehållade orden Siavash Paidar.
1. Elaborate Operational Requirements to Address Reward Hacking in Reinforcement Learning Agents
Kandidat-uppsats, Göteborgs universitet/Institutionen för data- och informationsteknikSammanfattning : Autonomous agents, in recent times have been used to address several problems, but these agents in their course of achieving their task also emit side effects to the environment in which they operate. Paramount of these side effects is reward hacking. In this report, we try to address reward hacking using elaborate operational requirements. LÄS MER
Resultatsidor:
1