Sökning: "A3C lambda"

Hittade 1 uppsats innehållade orden A3C lambda.

  1. 1. Asynchronous Advantage Actor-Critic and Flappy Bird

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Marcus Wibrink; Markus Fredriksson; [2021]
    Nyckelord :reinforcement learning; A3C; entropy; A3C lambda ; Cart-Pole; Flappy Bird; sparse rewards;

    Sammanfattning : Games provide ideal environments for assessingreinforcement learning algorithms because of their simple dynamicsand their inexpensive testing, compared to real-worldenvironments. Asynchronous Advantage Actor-Critic (A3C), developedby DeepMind, has shown significant improvements inperformance over other state-of-the-art algorithms on Atarigames. LÄS MER