Sökning: "Cart-Pole"

Hittade 5 uppsatser innehållade ordet Cart-Pole.

  1. 1. Risk-Sensitive Decision-Making for Autonomous-Driving

    Master-uppsats, Uppsala universitet/Institutionen för informationsteknologi

    Författare :Hardy Hasan; [2022]
    Nyckelord :;

    Sammanfattning : A natural aspect of the real world is that one can face uncertain situations on a daily basis. Depending on one's experience, we humans behave and respond differently to uncertainty. However, when designing intelligent agents, one needs to pay attention to the uncertainty inlearning tasks to design risk-sensitive algorithms. LÄS MER

  2. 2. Asynchronous Advantage Actor-Critic and Flappy Bird

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Marcus Wibrink; Markus Fredriksson; [2021]
    Nyckelord :reinforcement learning; A3C; entropy; A3C lambda ; Cart-Pole; Flappy Bird; sparse rewards;

    Sammanfattning : Games provide ideal environments for assessingreinforcement learning algorithms because of their simple dynamicsand their inexpensive testing, compared to real-worldenvironments. Asynchronous Advantage Actor-Critic (A3C), developedby DeepMind, has shown significant improvements inperformance over other state-of-the-art algorithms on Atarigames. LÄS MER

  3. 3. Deep Reinforcement Learning in Cart Pole and Pong

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Dennis Kuurne Uussilta; Viktor Olsson; [2020]
    Nyckelord :Artificial Intelligence; Machine Learning; Rein-forcement Learning; Deep Q-learning Network; CartPole; Pong;

    Sammanfattning : In this project, we aim to reproduce previous resultsachieved with Deep Reinforcement Learning. We present theMarkov Decision Process model as well as the algorithms Q-learning and Deep Q-learning Network (DQN). We implement aDQN agent, first in an environment called CartPole, and later inthe game Pong. LÄS MER

  4. 4. Extracting Behaviour Trees from Deep Q-Networks : Using learning from demostration to transfer knowledge between models.

    Master-uppsats, Linköpings universitet/Artificiell intelligens och integrerade datorsystem

    Författare :Zacharias Nordström; [2020]
    Nyckelord :Behaviour tree; deep q-network; extraction;

    Sammanfattning : In recent years the advancement in machine learning have solved more and more complex problems. But still these techniques are not commonly used in the industry. One problem is that many of the techniques are black boxes, it is hard to analyse them to make sure that their behaviour is safe. LÄS MER

  5. 5. Federated Averaging Deep Q-NetworkA Distributed Deep Reinforcement Learning Algorithm

    Master-uppsats, Umeå universitet/Institutionen för datavetenskap

    Författare :Sebastian Backstad; [2018]
    Nyckelord :;

    Sammanfattning : In the telecom sector, there is a huge amount of rich data generated every day. This trend will increase with the launch of 5G networks. Telco companies are interested in analyzing their data to shape and improve their core businesses. LÄS MER