Sökning: "markov decision process"
Visar resultat 1 - 5 av 18 uppsatser innehållade orden markov decision process.
- Master-uppsats, Linköpings universitet/Statistik och maskininlärning
Sammanfattning : In the US, breast cancer is one of the most common forms of cancer and the most lethal. There are many decisions that must be made by the doctor and/or the patient when dealing with a potential breast cancer. LÄS MER
- Master-uppsats, Luleå tekniska universitet/Datavetenskap
Sammanfattning : In this work, we develop the transfer learning (TL) of reinforcement learning (RL) for the robotic skill of throwing a ball into a basket, from a computer simulated environment to a real-world implementation. Whereas learning of the same skill has been previously explored by using a Programming by Demonstration approach directly on the real-world robot, for our work, the model-based RL algorithm PILCO was employed as an alternative as it provides the robot with no previous knowledge or hints, i. LÄS MER
- Kandidat-uppsats, Lunds universitet/Matematik LTH
Sammanfattning : Real-time bidding is getting increasingly popular for buying and selling online display advertisement. This has spurred a research interest into how to design optimal bidding algorithms, with advances during the last two to three years focusing heavily on reinforcement learning. LÄS MER
- Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Sammanfattning : This degree project, conducted at ABB, aims to analyze and solve differentsituations that a crew on board a vessel might face by controllingits propulsion system. The propulsion system is viewed as static,transition-deterministic, as well as stochastic when measuring data. LÄS MER
- Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS); KTH/Skolan för elektroteknik och datavetenskap (EECS)
Sammanfattning : We consider the problem of automatic control strategy synthesis for discrete models of robotic systems, where the goal is to travel from some region to another while obeying a given set of safety rules in an environment with uncertain properties. This is a probabilistic extension of the work by Jana Tumová et al. LÄS MER