Sökning: "Martin Christiansson"
Hittade 1 uppsats innehållade orden Martin Christiansson.
1. Reinforcement Learning– Intelligent Weighting of Monte Carlo and Temporal Differences
Uppsats för yrkesexamina på avancerad nivå, Lunds universitet/Institutionen för reglerteknikSammanfattning : In Reinforcement learning the updating of the value functions determines the information spreading across the state/state-action space which condenses the valuebased control policy. It is important to have an information propagation across the value domain in a manner that is effective. LÄS MER
Resultatsidor:
1