Blar i Institutt for teknisk kybernetikk på forfatter "Sawant, Shambhuraj Vijaysinh"
-
Bridging the gap between QP-based and MPC-based Reinforcement Learning
Sawant, Shambhuraj Vijaysinh; Gros, Sebastien Nicolas (Journal article; Peer reviewed, 2022) -
A Painless Deterministic Policy Gradient Method for Learning-based MPC
Sadanandan Anand, Akhil; Sawant, Shambhuraj Vijaysinh; Gros, Sebastien Nicolas; Gravdahl, Jan Tommy (Chapter, 2023)The combination of Reinforcement Learning (RL) and Model Predictive Control (MPC) has gained a lot of interest in the recent literature as a way of computing the optimal policies from MPC schemes based on inaccurate models. ...