Sökning: "Stochastic Agents"

Visar resultat 1 - 5 av 15 uppsatser innehållade orden Stochastic Agents.

  1. 1. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space

    Master-uppsats, KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Andreu Matoses Gimenez; [2023]
    Nyckelord :Control theory; Multi-agent systems; Distributed systems; Formation control; Collision avoidance; Reinforcement learning; Teoria de control; Sistemes multiagent; Sistemes distribuïts; Control de formació; Prevenció de col·lisions; Reinforcement Learning; Reglerteknik; Multi-agent system; Distribuerade system; formationskontroll; Kollisionsundvikande; Reinforcement learning; Teoría de control; Sistemas multiagente; Sistemas distribuidos; Control de formación; Prevención de colisiones; Reinforcement Learning;

    Sammanfattning : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. LÄS MER

  2. 2. Interaction Aware Decision Making for Automated Vehicles Based on Reinforcement Learning

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Ning Wang; [2022]
    Nyckelord :Reinforcement Learning; Automated Vehicles; Decision-making; Rollout; Driver Behaviour Modeling; Trajectory Prediction; Förstärkningsinlärning; automatiserade fordon; beslutsfattande; utrullning; modellering av förarbeteende; banförutsägelse;

    Sammanfattning : Decision-making is one of the key challenges blocking full autonomy of automated vehicles. In highway scenarios, automated vehicles are expected to be aware of their surroundings and make decisions by interacting with other road participants to drive safely and efficiently. LÄS MER

  3. 3. Do we have a feasible case for an economy-wide UBI policy that is a Pareto Improvement over the status-quo?

    D-uppsats, Handelshögskolan i Stockholm/Institutionen för finansiell ekonomi; Handelshögskolan i Stockholm/Institutionen för nationalekonomi

    Författare :Pratanu Mitra; Mattias Windahl; [2021]
    Nyckelord :Universal Basic Income; Social Welfare; Financial Inter-mediation; Household Finance; Normative implications of UBI;

    Sammanfattning : The question, merits and normative underpinnings of a Universal Basic Income policy have a long-standing genealogy in the various schools of thought that straddle economic reasoning. The demand for an exercise in dynamic general equilibrium macroeconomics, with microeconomic foundations, has been expressed by Ghatak and Maniquet (2019), Banerjee et al. LÄS MER

  4. 4. Domain independent enhancements to Monte Carlo tree search for eurogames

    Kandidat-uppsats, Mittuniversitetet/Institutionen för data- och systemvetenskap

    Författare :Peter Bergh; [2020]
    Nyckelord :Monte Carlo tree search · Domain independence · Stochasticity · Eurogames · Carcassonne;

    Sammanfattning : The Monte Carlo tree search-algorithm (MCTS) has been proven successful when applied to combinatorial games, a term applied to sequential games with perfect information. As the focus for MCTS has tended to lean towards combinatorial games, general MCTS-strategies for other types of board games are hard to find. LÄS MER

  5. 5. Stochastic Model Identification and Model Metrics with Deep Learning Applications

    Master-uppsats, Mälardalens högskola/Akademin för utbildning, kultur och kommunikation

    Författare :Berk Alp Yilmaz; [2020]
    Nyckelord :Event study; Abnormal Returns; ARIMA; Kalman Filter; Deep Learning;

    Sammanfattning : Each economic event has a dynamic effect on the market due to decision complexity for agents valuating rare events. The more complex is the event, the more it takes for the markets to incorporate the new information. The goal of this thesis is to explore the dynamic effects of economic events on equity returns and statistical indicators. LÄS MER