Sökning: "Markov decision processes"

Visar resultat 1 - 5 av 12 uppsatser innehållade orden Markov decision processes.

  1. 1. A Bandit Approach to Indirect Inference

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Erik Ildring; Felix Steinberger Eriksson; [2023]
    Nyckelord :;

    Sammanfattning : We present a novel approach to the family of parameter estimation methods known asindirect inference (II), using results from bandit optimization, a sub-field of reinforcementlearning concerned with stateless Markov decision processes (MDPs). First, we present theproblem of indirect inference and show how it may be cast into the general framework ofMDPs. LÄS MER

  2. 2. Deep Reinforcement Learning and Simulation for the Optimization of Production Systems

    Master-uppsats, Uppsala universitet/Institutionen för informationsteknologi

    Författare :Siyuan Chen; [2022]
    Nyckelord :;

    Sammanfattning : The main objective of this master thesis project is to use the deep reinforcement learning (DRL) and simulation method for optimization of production systems. In this project, the Deep Q-learning Networks (DQN) algorithm is first used to optimize seven decision variables in Averill Law’s production system to find the best profit, with 99. LÄS MER

  3. 3. Autonomous UAV Path Planning using RSS signals in Search and Rescue Operations

    Master-uppsats, Linköpings universitet/Reglerteknik

    Författare :Axel Anhammer; Hugo Lundeberg; [2022]
    Nyckelord :UAV; DQN; Deep Q Network; particle filter; point mass filter; MDP; POMDP; Markov decision process; partially observable Markov decision process;

    Sammanfattning : Unmanned aerial vehicles (UAVs) have emerged as a promising technology in search and rescue operations (SAR). UAVs have the ability to provide more timely localization, thus decreasing the crucial duration of SAR operations. LÄS MER

  4. 4. Policy-based Reinforcement learning control for window opening and closing in an office building

    Master-uppsats, Högskolan Dalarna/Mikrodataanalys

    Författare :Gokul Kaisaravalli Bhojraj; Yeswanth Surya Achyut Markonda; [2020]
    Nyckelord :Markov decision processes; Policy-based Reinforcement learning; Value-based Reinforcement learning; Q-learning; REINFORCE; policy gradient; window control; indoor comfort level;

    Sammanfattning : The level of indoor comfort can highly be influenced by window opening and closing behavior of the occupant in an office building. It will not only affect the comfort level but also affects the energy consumption, if not properly managed. This occupant behavior is not easy to predict and control in conventional way. LÄS MER

  5. 5. Constructing a Context-aware Recommender System with Web Sessions

    H-uppsats,

    Författare :Albin Bramstång; Yanling Jin; [2019-07-03]
    Nyckelord :Informations- och kommunikationsteknik; Data- och informationsvetenskap; Information Communication Technology; Computer and Information Science;

    Sammanfattning : During the last decade, the importance of recommender systems has been increasing to the point that the success of many well-known service providers depends on these technologies. Recommender systems can assist people in their decision making process by anticipating preferences. LÄS MER