Sökning: "Behavior Cloning"

Visar resultat 1 - 5 av 6 uppsatser innehållade orden Behavior Cloning.

  1. 1. Deep Reinforcement Learning Applied to an Image-Based Sensor Control Task

    Master-uppsats, Linköpings universitet/Informationskodning

    Författare :Rickard Eriksson; [2021]
    Nyckelord :Reinforcement Learning; Deep Learning; Proximal Policy Optimization; PPO; Sensor Control;

    Sammanfattning : An intelligent sensor system has the potential of providing its operator with relevant information, lowering the risk of human errors, and easing the operator's workload. One way of creating such a system is by using reinforcement learning, and this thesis studies how reinforcement learning can be applied to a simple sensor control task within a detailed 3D rendered environment. LÄS MER

  2. 2. On the Efficiency of Transfer Learning in a Fighter Pilot Behavior Modelling Context

    Master-uppsats, KTH/Matematik (Inst.)

    Författare :Viktor Sandström; [2021]
    Nyckelord :Imitation Learning; Transfer Learning; Applied Mathematics; Behavior Cloning; DAgger; FOI; Fighter Pilot; Mathematics; Deep Learning; Machine Learning; Imitationsinlärning; Överföringsinlärning; Maskininlärning; Tillämpad Matematik;

    Sammanfattning : Creating realistic models of human fighter pilot behavior is made possible with recent deep learning techniques. However, these techniques are often highly dependent on large datasets, often unavailable in many settings, or expensive to produce. LÄS MER

  3. 3. Imitation Learning using Reward-Guided DAgger

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Nora Al-Naami; [2020]
    Nyckelord :;

    Sammanfattning : End-to-end autonomous driving can be approached by finding a policy function that maps observation (e.g. driving view of the road) to driving action. This is done by imitating an expert driver. LÄS MER

  4. 4. Continual imitation learning: Enhancing safe data set aggregation with elastic weight consolidation

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Andreas Elers; [2019]
    Nyckelord :Elasticweight consolidation; SafeDAGGER; DAGGER; Rehearsal buffer; Self-driving vehicle; Continual learning; Elastisk viktkonsolidering; SafeDAGGER; DAGGER; Repeteringsbuffert; Självkörande fordon; Stegvis inlärning;

    Sammanfattning : The field of machine learning currently draws massive attention due to ad- vancements and successful applications announced in the last few years. One of these applications is self-driving vehicles. A machine learning model can learn to drive through behavior cloning. Behavior cloning uses an expert’s behavioral traces as training data. LÄS MER

  5. 5. Reinforcement Learning for Dexterity Transfer Between Manipulators

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Carlo Rapisarda; [2019]
    Nyckelord :;

    Sammanfattning : Learning complex manipulation skills with robotic arms is a challenging problem in Reinforcement Learning. Training policies from scratch is often timeconsuming and normally infeasible when using real robots. LÄS MER