Solving Adaptive Optimal Control Problems with Dynamic Programming

Fuglstad, Hilde

dc.contributor.advisor	Foss, Bjarne Anton
dc.contributor.advisor	Heirung, Tor Aksel N.
dc.contributor.author	Fuglstad, Hilde
dc.date.accessioned	2017-08-10T14:00:52Z
dc.date.available	2017-08-10T14:00:52Z
dc.date.created	2017-06-12
dc.date.issued	2017
dc.identifier	ntnudaim:16539
dc.identifier.uri	http://hdl.handle.net/11250/2450460
dc.description.abstract	In optimal control of uncertain systems, lack of crucial information about the system can lead to unacceptable performance like the violation of constraints. In these, or similar situations where it is important to reduce uncertainty quickly, excitation can be used for learning purposes. The optimal balance between learning and control is achieved with dual control. This concept was introduced over seventy years ago and is still relevant. It has been shown that dynamic programming (DP) can be used to solve these problems, along with a number of approximate methods. Analytical solution of the problems are in most cases impossible and it is therefore necessary to solve them numerically. The purpose of this thesis is to provide an overview of adaptive optimal control problems (AOCP) and the use of DP for solving them. The method is explored through several illustrating examples and the dual control algorithm is evaluated through computer simulations. The main examples considered are a simple integrator problem with unknown gains, and a minimum-time problem with an unknown breaking coefficient. The unknown parameters and noise in the systems are modelled as stochastic variables with known statistical distributions that are utilized by the dual controller. It is shown how the different AOCP can be formulated, and the DP algorithms can be implemented. Different noise model assumptions are evaluated to see how this can affect the problem. Numerical experiments assess the capabilities of typical hardware configurations and parallelization options explore the possibility of reduced runtime. Results from simulations certainly demonstrate how the dual controller manage to both control the process and learn about it simultaneously. The controller is also compared to a certainty equivalent (CE) and cautious controller to further emphasize the advantages it has to these heuristic, adaptive controllers. Despite the well-known problems related to the curse of dimensionality, it is shown that it is possible to solve the given AOCP using DP with a desired accuracy, within reasonable time.
dc.language	eng
dc.publisher	NTNU
dc.subject	Kybernetikk og robotikk, Autonome systemer
dc.title	Solving Adaptive Optimal Control Problems with Dynamic Programming
dc.type	Master thesis

Tilhørende fil(er)

Filnavn:: 16539_FULLTEXT.pdf
Størrelse:: 3.873Mb
Format:: PDF

Åpne

Filnavn:: 16539_COVER.pdf
Størrelse:: 1.556Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for teknisk kybernetikk [3663]

Vis enkel innførsel