Sökning: "policy gradient"
Visar resultat 6 - 10 av 45 uppsatser innehållade orden policy gradient.
6. Effects of fuel and weather conditions on forest fire behaviour in Southern Sweden in oak dominated forests
Master-uppsats, SLU/Southern Swedish Forest Research CentreSammanfattning : Fire activity is influenced by weather and climate, fuels, ignition agents, and human activities. Fire suppression policy in Sweden resulted in a gradual decrease in the annually burnt areas, changes in fuel loads and a decline in species diversity. LÄS MER
7. Reinforcement Learning for the Optimization of Explicit Runge-Kutta Method Parameters
Kandidat-uppsats, Lunds universitet/Matematik LTH; Lunds universitet/Matematik (naturvetenskapliga fakulteten); Lunds universitet/MatematikcentrumSammanfattning : Reinforcement learning is one of the three main paradigms in machine learning, which is increasingly used as a method to approach scientific problems. In this thesis, we introduce and use reinforcement learning to find the optimal parameters of a numerical solver. LÄS MER
8. Predicting Workforce in Healthcare : Using Machine Learning Algorithms, Statistical Methods and Swedish Healthcare Data
Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : Denna studie undersöker användningen av maskininlärningsmodeller för att predicera arbetskraftstrender inom hälso- och sjukvården i Sverige. Med hjälp av en linjär regressionmodell, en Gradient Boosting Regressor-modell och en Exponential Smoothing-modell syftar forskningen för detta arbete till att ge viktiga insikter för underlaget till makroekonomiska överväganden och att ge en djupare förståelse av Beveridge-kurvan i ett sammanhang relaterat till hälso- och sjukvårdssektorn. LÄS MER
9. Fine-tuning Bot Play Styles From Demonstration
Master-uppsats, Uppsala universitet/Institutionen för informationsteknologiSammanfattning : In recent years, Reinforcement Learning (RL) has successfully been used to train agents for games. Nonetheless, in the game industry there is still a necessity for bots not only to succeed in the environments but also to act human-like while playing the game. LÄS MER
10. Reinforcement Learning for Hydrobatic AUVs
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : This master thesis focuses on developing a Reinforcement Learning (RL) controller to perform hydrobatic maneuvers on an Autonomous Underwater Vehicle (AUV) successfully. This work also aims to analyze the robustness of the RL controller, as well as provide a comparison between RL algorithms and Proportional Integral Derivative (PID) control. LÄS MER