Vis enkel innførsel

dc.contributor.advisorHetland, Magnus Lie
dc.contributor.authorBaklid, Martin
dc.contributor.authorBarlaug, Nils
dc.date.accessioned2018-11-09T15:00:31Z
dc.date.available2018-11-09T15:00:31Z
dc.date.created2018-07-02
dc.date.issued2018
dc.identifierntnudaim:18253
dc.identifier.urihttp://hdl.handle.net/11250/2571885
dc.description.abstractDeep learning has, in relatively few years, improved significantly the performance of many machine learning applications. Even though its popularity has surged, it's not always easy to apply deep learning to a real-world problem. Developing a good deep learning model is a process that most likely will include several iterations of data collection, training and hyperparameter tuning. One big obstacle in this process is the hunger for data and compute power. Supervised learning often requires a massive amount of annotated examples, and training usually extends over hours or days. This makes the process of developing deep learning models very time and resource consuming. In this thesis we investigate if recent advances in few-shot learning can be used to speed up this process, and look specifically at object detection as an example. Such methods could potentially decrease both the necessary number of examples and the training time. MAML is a promising few-shot learning method based on meta-learning that optimizes the initial parameters of a model to be best possibly suited for fine-tuning. It's model-agnostic by nature and can in principle be applied to most deep learning models. But through extensive exploration we show that it's far from trivial to apply MAML to object detection on natural images. However, we are able to use a simpler method inspired by MAML, Reptile. We show that a model pretrained using Reptile can be fine-tuned considerably faster than a model pretrained normally on object detection, but surprisingly it does not enable using fewer examples. In addition, we show that Reptile is able to speed up the development of deep learning models in practice. This is done by building a proof of concept tool and use this to test some example use cases.
dc.languageeng
dc.publisherNTNU
dc.subjectDatateknologi, Kunstig intelligens
dc.titleTowards Faster Development of Deep Learning Models Using Meta-Learning
dc.typeMaster thesis


Tilhørende fil(er)

Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel