A combined informative and representative active learning approach for plankton taxa labeling

Haug, Martin Lund; Saad, Aya; Stahl, Annette

dc.contributor.author	Haug, Martin Lund
dc.contributor.author	Saad, Aya
dc.contributor.author	Stahl, Annette
dc.date.accessioned	2022-10-19T07:19:35Z
dc.date.available	2022-10-19T07:19:35Z
dc.date.created	2021-04-28T14:15:33Z
dc.date.issued	2021
dc.identifier.citation	Proceedings of SPIE, the International Society for Optical Engineering. 2021, 11878 .	en_US
dc.identifier.issn	0277-786X
dc.identifier.uri	https://hdl.handle.net/11250/3026896
dc.description.abstract	With an ever-increasing amount of image data, the manual labeling process has become the bottleneck in many machine learning applications. Plankton taxa labeling is especially a challenge due to its complex nature, and the manual labeling effort places a large burden on the domain experts. The Active Learning (AL) paradigm is a promising research direction adopted in the literature to minimize the manual labeling effort exerted by domain experts. Many approaches for AL have been proposed over the recent years to improve the labeling task by supporting the construction of large data sets suitable to train machine learning models while minimizing human involvement in the process. Our empirical study suggests that many modern active learning methods fail to incorporate both the samples that represent the statistical pattern of the data and the samples in which the machine learning model is not confident about. Inspired by these limitations, we propose an algorithm that combines these two types of sampling in order to capture the data distribution of the whole feature space, prevent redundant sampling from correlated uncertainty queries and fine-tune the inter-class decision boundary. Our experiments show that the proposed method outperforms each of the methods separately Further, it also proves to be efficient on both the CIFAR-10 data set and the more complex Kaggle plankton dataset.	en_US
dc.language.iso	eng	en_US
dc.publisher	SPIE	en_US
dc.subject	Semisupervised deep learning	en_US
dc.subject	Semisupervised deep learning	en_US
dc.subject	Bildebehandling	en_US
dc.subject	Image processing	en_US
dc.subject	Datasyn	en_US
dc.subject	Computer Vision	en_US
dc.subject	Maskinlæring	en_US
dc.subject	Machine learning	en_US
dc.title	A combined informative and representative active learning approach for plankton taxa labeling	en_US
dc.type	Peer reviewed	en_US
dc.type	Journal article	en_US
dc.description.version	publishedVersion	en_US
dc.rights.holder	© Society of Photo Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.	en_US
dc.subject.nsi	VDP::Teknologi: 500	en_US
dc.subject.nsi	VDP::Technology: 500	en_US
dc.source.pagenumber	9	en_US
dc.source.volume	11878	en_US
dc.source.journal	Proceedings of SPIE, the International Society for Optical Engineering	en_US
dc.identifier.doi	10.1117/12.2601096
dc.identifier.cristin	1906993
dc.relation.project	Norges forskningsråd: 223254	en_US
dc.relation.project	Norges forskningsråd: 262741	en_US
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: Haug2021AcombinedInformativeAn ...
Størrelse:: 1.161Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for teknisk kybernetikk [3789]
Publikasjoner fra CRIStin - NTNU [38688]

Vis enkel innførsel