Sökning: "Förstärkande inlärning"
Visar resultat 6 - 10 av 35 uppsatser innehållade orden Förstärkande inlärning.
6. Improving Co-existence of URLLC and Distributed AI using RL
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : In 5G, Ultra-reliable and low-Latency communications (URLLC) service is envisioned to enable use cases with strict reliability and latency requirements on wireless communication. For the upcoming 6G network, machine learning (ML) also stands an important role that introduces intelligence and further enhances the system performance. LÄS MER
7. Playstyle Generation with Multimodal Generative Adversarial Imitation Learning : Style-reward from Human Demonstration for Playtesting Agents
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : Playtesting plays a crucial role in video game production. The presence of gameplay issues and faulty design choices can be of great detriment to the overall player experience. LÄS MER
8. Fine-tuning a LLM using Reinforcement Learning from Human Feedback for a Therapy Chatbot Application
Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : The field of AI and machine learning has seen exponential growth in the last decade and even more so in the recent year with the considerable public interest in Large Language models (LLMs) such as chat-GPT. LLMs can be used for several purposes, but one possible application would be fine-tuning a model to perform a particular function in a specific field. LÄS MER
9. Optimal Path Planning for Aerial Swarm in Area Exploration
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : This thesis presents an approach to solve an optimal path planning problem for a swarm of drones. We optimize and improve information retrieval in area exploration within applications such a ‘Search and Rescue’-missions or reconnaissance missions. For this, dynamic programming has been used as a solving approach for a optimization problem. LÄS MER
10. Model-based Residual Policy Learning for Sample Efficient Mobile Network Optimization
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : Reinforcement learning is a powerful tool which enables an agent to learn how to control complex systems. However, during the early phases of training, the performance is often poor. LÄS MER