Deep learning as optimal control problems: models and numerical methods
Benning, Martin; Celledoni, Elena; Ehrhardt, Matthias J.; Owren, Brynjulf; Schönlieb, Carola-Bibiane
Journal article, Peer reviewed
Published version
View/ Open
Date
2019Metadata
Show full item recordCollections
- Institutt for matematiske fag [2582]
- Publikasjoner fra CRIStin - NTNU [39196]
Original version
10.3934/jcd.2019009Abstract
We consider recent work of [11] and [6], where deep learning neuralnetworks have been interpreted as discretisations of an optimal control problemsubject to an ordinary differential equation constraint. We review the first orderconditions for optimality, and the conditions ensuring optimality after discretiza-tion. This leads to a class of algorithms for solving the discrete optimal controlproblem which guarantee that the corresponding discrete necessary conditions foroptimality are fulfilled. We discuss two different deep learning algorithms and makea preliminary analysis of the ability of the algorithms to generalise. Deep learning as optimal control problems: models and numerical methods