Sökning: "Q-learning"

Visar resultat 1 - 5 av 55 uppsatser innehållade ordet Q-learning.

  1. 1. MIXED MEMORY Q-LEARNER An adaptive reinforcement learning algorithm for the Iterated Prisoner’s Dilemma

    Kandidat-uppsats, Institutionen för tillämpad informationsteknologi

    Författare :Anna Dollbo; [2021-09-21]
    Nyckelord :Machine learning; reinforcement learning; game theory; iterated prisoner’s dilemma; state representation; Q-learning;

    Sammanfattning : The success of future societies is likely to depend on cooperative interactionsbetween humans and artificial agents. As such, it is important to investigate howmachines can learn to cooperate. LÄS MER

  2. 2. Application of Deep Q-learning for Vision Control on Atari Environments

    Master-uppsats, Lunds universitet/Beräkningsbiologi och biologisk fysik

    Författare :Jim Öhman; [2021]
    Nyckelord :Reinforcement learning; Atari 2600; Deep Q-learning; Myopic Agents; Vision Control; Physics and Astronomy;

    Sammanfattning : The success of Reinforcement Learning (RL) has mostly been in artificial domains, with only some successful real-world applications. One of the reasons being that most real-world domains fail to satisfy a set of assumptions of RL theory. LÄS MER

  3. 3. Implementing an OpenAI Gym for Machine Learning of Microgrid Electricity Trading

    Kandidat-uppsats, Mittuniversitetet/Institutionen för informationssystem och –teknologi

    Författare :André Lundholm; [2021]
    Nyckelord :Q-learning; Microgrids; Python; Neural networks; OpenAI Gym; Q-learning; Mikronät; Python; Neurala nätverk; OpenAI Gym;

    Sammanfattning : Samhället går idag bort från centraliserad energi mot decentraliserade system. Istället för att köpa från stora företag som skapar el från fossila bränslen har många förnybara alternativ kommit. Eftersom konsumenter kan generera solenergi med solpaneler kan de också bli producenter. LÄS MER

  4. 4. A Comparison Between Deep Q-learning and Deep Deterministic Policy Gradient for an Autonomous Drone in a Simulated Environment

    Kandidat-uppsats, Mälardalens högskola/Akademin för innovation, design och teknik

    Författare :Dennis Tagesson; [2021]
    Nyckelord :;

    Sammanfattning : This thesis investigates how the performance between Deep Q-Network (DQN) with a continuous and discrete state- and action space, respectively, and Deep Deterministic Policy Gradient (DDPG) with a continuous state- and action space compare when trained in an environment with a continuous state- and action space. The environment was a simulation where the task for the algorithms was to control a drone from the start position to the location of the goal. LÄS MER

  5. 5. Not All Goals Are Created Equal : Evaluating Hockey Players in the NHL Using Q-Learning with a Contextual Reward Function

    Master-uppsats, Linköpings universitet/Databas och informationsteknik

    Författare :Jon Vik; [2021]
    Nyckelord :Sports Analytics; Markov Game; Machine Learning; Reinforcement Learning; Q-Learning; Data Mining; National Hockey League; Ice Hockey; Reward Function; Player Evaluation;

    Sammanfattning : Not all goals in the game of ice hockey are created equal: some goals increase the chances of winning more than others. This thesis investigates the result of constructing and using a reward function that takes this fact into consideration, instead of the common binary reward function. LÄS MER