Sökning: "sample-efficient reinforcement learning"

Hittade 3 uppsatser innehållade orden sample-efficient reinforcement learning.

  1. 1. Improving sample-efficiency of model-free reinforcement learning algorithms on image inputs with representation learning

    Master-uppsats, Göteborgs universitet/Institutionen för data- och informationsteknik

    Författare :Marko Guberina; Betelhem Dejene Desta; [2022-10-14]
    Nyckelord :sample-efficient reinforcement learning; state representation learning; unsupervised learning; autoencoder;

    Sammanfattning : Reinforcement learning struggles to solve control tasks on directly on images. Performance on identical tasks with access to the underlying states is much better. LÄS MER

  2. 2. Model-based Residual Policy Learning for Sample Efficient Mobile Network Optimization

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Viktor Eriksson Möllerstedt; [2022]
    Nyckelord :Reinforcement Learning; Sample Efficiency; Model-based; Expert Policy; Remote Electrical Tilt; Telecommunication; Förstärkande inlärning; dataeffektivitet; modell-baserad; expert-policy; fjärrstyrning av antenners nedåtlutning; telekommunikation;

    Sammanfattning : Reinforcement learning is a powerful tool which enables an agent to learn how to control complex systems. However, during the early phases of training, the performance is often poor. LÄS MER

  3. 3. Model Based Reinforcement Learning for Automatic Tuning of Cavity Filters

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Doumitrou Daniil Nimara; [2021]
    Nyckelord :Reinforcement Learning; Cavity Filter Tuning; Sample Complexity; Background Planning; Förstärkande inlärning; inställning av kavitetsfilter; provkomplexitet; bakgrundsplanering;

    Sammanfattning : As telecommunication continues developing, the demand for mass production of well calibrated Base Transceiver Stations (BTS) components increases. Cavity Filters are an essential piece of every BTS; however, manufacturing tolerances often lead to detuned filters which require costly post-production fine tuning. LÄS MER