Sökning: "Reward Function"
Visar resultat 6 - 10 av 85 uppsatser innehållade orden Reward Function.
6. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space
Master-uppsats, KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. LÄS MER
7. Random Edge is not faster than Random Facet on Linear Programs
Master-uppsats, KTH/Matematik (Avd.)Sammanfattning : A Linear Program is a problem where the goal is to maximize a linear function subject to a set of linear inequalities. Geometrically, this can be rephrased as finding the highest point on a polyhedron. The Simplex method is a commonly used algorithm to solve Linear Programs. LÄS MER
8. Multi-Agent Information Gathering Using Stackelberg Games
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : Multi-agent information gathering (MA-IG) enables autonomous robots to cooperatively collect information in an unfamiliar area. In some scenarios, the focus is on gathering the true mapping of a physical quantity such as temperature or magnetic field. LÄS MER
9. The influence of favouritism as non financial incentives on employee performance
Magister-uppsats, Umeå universitet/Handelshögskolan vid Umeå universitet (USBE)Sammanfattning : ABSTRACT In the business sector, favouritism is a frequent and typically disapproved behaviour. However, when used as a reward for excellent employee performance, favouritism can incentivize increased employee productivity and performance. LÄS MER
10. Intelligent autoscaling in Kubernetes : the impact of container performance indicators in model-free DRL methods
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : A key challenge in the field of cloud computing is to automatically scale software containers in a way that accurately matches the demand for the services they run. To manage such components, container orchestrator tools such as Kubernetes are employed, and in the past few years, researchers have attempted to optimise its autoscaling mechanism with different approaches. LÄS MER