Adaptive context encoding module for semantic segmentation

Wang, Congcong; Alaya Cheikh, Faouzi; Beghdadi, Azeddine; Elle, Ole Jacob

dc.contributor.author	Wang, Congcong
dc.contributor.author	Alaya Cheikh, Faouzi
dc.contributor.author	Beghdadi, Azeddine
dc.contributor.author	Elle, Ole Jacob
dc.date.accessioned	2021-01-18T14:19:28Z
dc.date.available	2021-01-18T14:19:28Z
dc.date.created	2020-11-19T12:57:35Z
dc.date.issued	2020
dc.identifier.citation	IS&T International Symposium on Electronic Imaging Science and Technology. 2020, 2020 (10), 1-7.	en_US
dc.identifier.issn	2470-1173
dc.identifier.uri	https://hdl.handle.net/11250/2723535
dc.description.abstract	The object sizes in images are diverse, therefore, capturing multiple scale context information is essential for semantic segmentation. Existing context aggregation methods such as pyramid pooling module (PPM) and atrous spatial pyramid pooling (ASPP) employ different pooling size or atrous rate, such that multiple scale information is captured. However, the pooling sizes and atrous rates are chosen empirically. Rethinking of ASPP leads to our observation that learnable sampling locations of the convolution operation can endow the network learnable fieldof-view, thus the ability of capturing object context information adaptively. Following this observation, in this paper, we propose an adaptive context encoding (ACE) module based on deformable convolution operation where sampling locations of the convolution operation are learnable. Our ACE module can be embedded into other Convolutional Neural Networks (CNNs) easily for context aggregation. The effectiveness of the proposed module is demonstrated on Pascal-Context and ADE20K datasets. Although our proposed ACE only consists of three deformable convolution blocks, it outperforms PPM and ASPP in terms of mean Intersection of Union (mIoU) on both datasets. All the experimental studies confirm that our proposed module is effective compared to the state-of-the-art methods.	en_US
dc.language.iso	eng	en_US
dc.publisher	Society for Imaging Science and Technology	en_US
dc.title	Adaptive context encoding module for semantic segmentation	en_US
dc.type	Peer reviewed	en_US
dc.type	Journal article	en_US
dc.description.version	publishedVersion	en_US
dc.source.pagenumber	1-7	en_US
dc.source.volume	2020	en_US
dc.source.journal	IS&T International Symposium on Electronic Imaging Science and Technology	en_US
dc.source.issue	10	en_US
dc.identifier.doi	10.2352/ISSN.2470-1173.2020.10.IPAS-027
dc.identifier.cristin	1849860
dc.description.localcode	This article will not be available due to copyright restrictions (c) 2020 by Society for Imaging Science and Technology	en_US
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: 1907.06082.pdf
Størrelse:: 651.4Kb
Format:: PDF
Beskrivelse:: Wang

Låst

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6551]
Publikasjoner fra CRIStin - NTNU [37219]

Vis enkel innførsel