Sökning: "Förstärkande inlärning"

Visar resultat 6 - 10 av 35 uppsatser innehållade orden Förstärkande inlärning.

6. Improving Co-existence of URLLC and Distributed AI using RL
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Författare :Wei Shi; [2023]
Nyckelord :5G; URLLC; RL; HRL; Optimization; 5G; URLLC; RL; HRL; Optimering;

Sammanfattning : In 5G, Ultra-reliable and low-Latency communications (URLLC) service is envisioned to enable use cases with strict reliability and latency requirements on wireless communication. For the upcoming 6G network, machine learning (ML) also stands an important role that introduces intelligence and further enhances the system performance. LÄS MER
7. Playstyle Generation with Multimodal Generative Adversarial Imitation Learning : Style-reward from Human Demonstration for Playtesting Agents
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Författare :William Ahlberg; [2023]
Nyckelord :Imitation Learning; Reinforcement Learning; Game-testing; Imitationsinlärning; Förstärkande inlärning; Speltestning;

Sammanfattning : Playtesting plays a crucial role in video game production. The presence of gameplay issues and faulty design choices can be of great detriment to the overall player experience. LÄS MER
8. Fine-tuning a LLM using Reinforcement Learning from Human Feedback for a Therapy Chatbot Application
Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Författare :Desirée Bill; Theodor Eriksson; [2023]
Nyckelord :Ethics; Fine-tuning; Large Language Models; Machine learning; Psychology; Reinforcement Learning from Human Feedback;

Sammanfattning : The field of AI and machine learning has seen exponential growth in the last decade and even more so in the recent year with the considerable public interest in Large Language models (LLMs) such as chat-GPT. LLMs can be used for several purposes, but one possible application would be fine-tuning a model to perform a particular function in a specific field. LÄS MER
9. Optimal Path Planning for Aerial Swarm in Area Exploration
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Författare :Johanna Norén; [2022]
Nyckelord :Optimization; Path planning; Dynamic programming; Area exploration; Aerial swarm; Multi-agent system; Optimering; Ruttplanering; Dynamisk programmering; Områdesutforskning; Drönarsvärm; Fler-agentsfall;

Sammanfattning : This thesis presents an approach to solve an optimal path planning problem for a swarm of drones. We optimize and improve information retrieval in area exploration within applications such a ‘Search and Rescue’-missions or reconnaissance missions. For this, dynamic programming has been used as a solving approach for a optimization problem. LÄS MER
10. Model-based Residual Policy Learning for Sample Efficient Mobile Network Optimization
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Författare :Viktor Eriksson Möllerstedt; [2022]
Nyckelord :Reinforcement Learning; Sample Efficiency; Model-based; Expert Policy; Remote Electrical Tilt; Telecommunication; Förstärkande inlärning; dataeffektivitet; modell-baserad; expert-policy; fjärrstyrning av antenners nedåtlutning; telekommunikation;

Sammanfattning : Reinforcement learning is a powerful tool which enables an agent to learn how to control complex systems. However, during the early phases of training, the performance is often poor. LÄS MER

Tidigare 1 2 3 4 5 6 Nästa

Sökning: "Förstärkande inlärning"

6. Improving Co-existence of URLLC and Distributed AI using RL

7. Playstyle Generation with Multimodal Generative Adversarial Imitation Learning : Style-reward from Human Demonstration for Playtesting Agents

8. Fine-tuning a LLM using Reinforcement Learning from Human Feedback for a Therapy Chatbot Application

9. Optimal Path Planning for Aerial Swarm in Area Exploration

10. Model-based Residual Policy Learning for Sample Efficient Mobile Network Optimization

Sökningar just nu

Populära sökningar

Uppsatser med många visningar igår (2024-04-27)