Markov Decision Processes
biblio
Submitted by aekwall on Fri, 01/13/2023 - 3:51pm
biblio
Submitted by grigby1 on Thu, 06/07/2018 - 3:09pm
biblio
Submitted by grigby1 on Tue, 02/27/2018 - 2:36pm
video
Submitted by pstone on Mon, 02/15/2016 - 3:03pm
file
Abstract:
The use of robots in society could be expanded by using reinforcement learning (RL) to allow robots to learn and adapt to new situations on-line. RL is a paradigm for learning sequential decision making tasks, usually formulated as a Markov Decision Process (MDP). For an RL algorithm to be practical for robotic control tasks, it must learn in very few samples, while continually taking actions in real-time. In addition, the algorithm must learn efficiently in the face of noise, sensor/actuator delays, and continuous state features.