Deep learning as optimal control problems: models and numerical methods

Benning, Martin; Celledoni, Elena; Ehrhardt, Matthias J.; Owren, Brynjulf; Schönlieb, Carola-Bibiane

Benning, Martin; Celledoni, Elena; Ehrhardt, Matthias J.; Owren, Brynjulf; Schönlieb, Carola-Bibiane

Journal article, Peer reviewed

Published version

Åpne

Benning (Låst)

Permanent lenke

http://hdl.handle.net/11250/2637641

Utgivelsesdato

2019

Sammendrag

We consider recent work of [11] and [6], where deep learning neuralnetworks have been interpreted as discretisations of an optimal control problemsubject to an ordinary differential equation constraint. We review the first orderconditions for optimality, and the conditions ensuring optimality after discretiza-tion. This leads to a class of algorithms for solving the discrete optimal controlproblem which guarantee that the corresponding discrete necessary conditions foroptimality are fulfilled. We discuss two different deep learning algorithms and makea preliminary analysis of the ability of the algorithms to generalise.

Deep learning as optimal control problems: models and numerical methods

Utgiver

American Institute of Mathematical Sciences (AIMS)

Tidsskrift

Journal of Computational Dynamics