Sökning: "Q-Learning"
Visar resultat 21 - 25 av 95 uppsatser innehållade ordet Q-Learning.
21. Energy Efficient Communication Scheduling for IoT-based Waterbirds Monitoring: Decentralized Strategies
Master-uppsats, Luleå tekniska universitet/Institutionen för system- och rymdteknikSammanfattning : Monitoring waterbirds have several benefits, including analyzing the number of endangered species, giving a reliable indication of public health, etc. Monitoring waterbirds in their habitat is a challenging task since the location is distant, and the collection of monitoring data requires large bandwidth. LÄS MER
22. Deep Reinforcement Learning and Simulation for the Optimization of Production Systems
Master-uppsats, Uppsala universitet/Institutionen för informationsteknologiSammanfattning : The main objective of this master thesis project is to use the deep reinforcement learning (DRL) and simulation method for optimization of production systems. In this project, the Deep Q-learning Networks (DQN) algorithm is first used to optimize seven decision variables in Averill Law’s production system to find the best profit, with 99. LÄS MER
23. Learning medical triage by using a reinforcement learning approach
Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Institutionen för informationsteknologiSammanfattning : Many emergency departments are today suffering from a overcrowding of people seeking care. The first stage in seeking care is being prioritised in different orders depending on symptoms by a doctor or nurse called medical triage. This is a cumbersome process that could be subject of automatisation. LÄS MER
24. Reinforcement Learning for Market Making
Master-uppsats, KTH/Matematisk statistikSammanfattning : Market making – the process of simultaneously and continuously providing buy and sell prices in a financial asset – is rather complicated to optimize. Applying reinforcement learning (RL) to infer optimal market making strategies is a relatively uncharted and novel research area. LÄS MER
25. Graph Bandits : Multi-Armed Bandits with Locality Constraints
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in a wealth of domains, including recommendation systems, dynamic pricing, and investment management. On the one hand, the current MAB literature largely seems to focus on the setting where each arm is available to play at each time step, and ignores how agents move between the arms. LÄS MER