Deep Reinforcement Learning Attitude Control of Fixed Wing UAVs Using Proximal Policy Optimization

Bøhn, Eivind Eigil; Coates, Erlend Magnus Lervik; Moe, Signe; Johansen, Tor Arne

dc.contributor.author	Bøhn, Eivind Eigil
dc.contributor.author	Coates, Erlend Magnus Lervik
dc.contributor.author	Moe, Signe
dc.contributor.author	Johansen, Tor Arne
dc.date.accessioned	2019-11-25T11:24:48Z
dc.date.available	2019-11-25T11:24:48Z
dc.date.created	2019-11-23T12:16:03Z
dc.date.issued	2019
dc.identifier.isbn	978-1-7281-0333-4
dc.identifier.uri	http://hdl.handle.net/11250/2630237
dc.description.abstract	Contemporary autopilot systems for unmanned aerial vehicles (UAVs) are far more limited in their flight envelope as compared to experienced human pilots, thereby restricting the conditions UAVs can operate in and the types of missions they can accomplish autonomously. This paper proposes a deep reinforcement learning (DRL) controller to handle the nonlinear attitude control problem, enabling extended flight envelopes for fixed-wing UAVs. A proof-of-concept controller using the proximal policy optimization (PPO) algorithm is developed, and is shown to be capable of stabilizing a fixed-wing UAV from a large set of initial conditions to reference roll, pitch and airspeed values. The training process is outlined and key factors for its progression rate are considered, with the most important factor found to be limiting the number of variables in the observation vector, and including values for several previous time steps for these variables. The trained reinforcement learning (RL) controller is compared to a proportional-integral-derivative (PID) controller, and is found to converge in more cases than the PID controller, with comparable performance. Furthermore, the RL controller is shown to generalize well to unseen disturbances in the form of wind and turbulence, even in severe disturbance conditions.	nb_NO
dc.language.iso	eng	nb_NO
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	nb_NO
dc.relation.ispartof	2019 International Conference on Unmanned Aircraft Systems (ICUAS)
dc.title	Deep Reinforcement Learning Attitude Control of Fixed Wing UAVs Using Proximal Policy Optimization	nb_NO
dc.type	Chapter	nb_NO
dc.description.version	acceptedVersion	nb_NO
dc.identifier.doi	10.1109/ICUAS.2019.8798254
dc.identifier.cristin	1751318
dc.relation.project	Norges forskningsråd: 261791	nb_NO
dc.relation.project	Norges forskningsråd: 272402	nb_NO
dc.relation.project	Norges forskningsråd: 223254	nb_NO
dc.description.localcode	© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	nb_NO
cristin.unitcode	194,63,25,0
cristin.unitname	Institutt for teknisk kybernetikk
cristin.ispublished	true
cristin.fulltext	postprint
cristin.qualitycode	1

Files in this item

Name:: eeb_ICUAS_Paper.pdf
Size:: 2.324Mb
Format:: PDF
Description:: Bøhn

View/Open

This item appears in the following Collection(s)

Institutt for teknisk kybernetikk [3663]
Publikasjoner fra CRIStin - NTNU [37215]

Show simple item record