Sökning: "policy gradient"

Visar resultat 6 - 10 av 45 uppsatser innehållade orden policy gradient.

  1. 6. Effects of fuel and weather conditions on forest fire behaviour in Southern Sweden in oak dominated forests

    Master-uppsats, SLU/Southern Swedish Forest Research Centre

    Författare :Olga Teresa Wepryk; [2023]
    Nyckelord :forest fire; forest fuel; fire weather; ignition experiments; prescribed burning;

    Sammanfattning : Fire activity is influenced by weather and climate, fuels, ignition agents, and human activities. Fire suppression policy in Sweden resulted in a gradual decrease in the annually burnt areas, changes in fuel loads and a decline in species diversity. LÄS MER

  2. 7. Reinforcement Learning for the Optimization of Explicit Runge-Kutta Method Parameters

    Kandidat-uppsats, Lunds universitet/Matematik LTH; Lunds universitet/Matematik (naturvetenskapliga fakulteten); Lunds universitet/Matematikcentrum

    Författare :Mélanie Fournier; [2023]
    Nyckelord :reinforcement learning; numerical analysis; Runge-Kutta; policy gradient; REINFORCE; Mathematics and Statistics;

    Sammanfattning : Reinforcement learning is one of the three main paradigms in machine learning, which is increasingly used as a method to approach scientific problems. In this thesis, we introduce and use reinforcement learning to find the optimal parameters of a numerical solver. LÄS MER

  3. 8. Predicting Workforce in Healthcare : Using Machine Learning Algorithms, Statistical Methods and Swedish Healthcare Data

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Gabriel Diskay; Carl Joelsson; [2023]
    Nyckelord :Machine Learning ML ; Linear Regression Model LRM ; Gradient Boosting Regressor GBR ; Exponential Smoothing Model ESM ; Workforce Prediction WP ; Healthcare Sector HS ; Labor Policy LP ; Beveridge Curve BC ; Economic Forecasting EF ; Recursive Feature Elimination RFE ; Human Resource Management HRM ;

    Sammanfattning : Denna studie undersöker användningen av maskininlärningsmodeller för att predicera arbetskraftstrender inom hälso- och sjukvården i Sverige. Med hjälp av en linjär regressionmodell, en Gradient Boosting Regressor-modell och en Exponential Smoothing-modell syftar forskningen för detta arbete till att ge viktiga insikter för underlaget till makroekonomiska överväganden och att ge en djupare förståelse av Beveridge-kurvan i ett sammanhang relaterat till hälso- och sjukvårdssektorn. LÄS MER

  4. 9. Fine-tuning Bot Play Styles From Demonstration

    Master-uppsats, Uppsala universitet/Institutionen för informationsteknologi

    Författare :Felicia Fredriksson; [2023]
    Nyckelord :;

    Sammanfattning : In recent years, Reinforcement Learning (RL) has successfully been used to train agents for games. Nonetheless, in the game industry there is still a necessity for bots not only to succeed in the environments but also to act human-like while playing the game. LÄS MER

  5. 10. Reinforcement Learning for Hydrobatic AUVs

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Grzegorz Woźniak; [2022]
    Nyckelord :Deep Reinforcement learning; Deep learning; Optimal control; Hydrobatics; Deep Reinforcement learning; Deep learning; Optimal control; Hydrobatics;

    Sammanfattning : This master thesis focuses on developing a Reinforcement Learning (RL) controller to perform hydrobatic maneuvers on an Autonomous Underwater Vehicle (AUV) successfully. This work also aims to analyze the robustness of the RL controller, as well as provide a comparison between RL algorithms and Proportional Integral Derivative (PID) control. LÄS MER