Vis enkel innførsel

dc.contributor.authorWang, Congcong
dc.contributor.authorAlaya Cheikh, Faouzi
dc.contributor.authorBeghdadi, Azeddine
dc.contributor.authorElle, Ole Jacob
dc.date.accessioned2021-01-18T14:19:28Z
dc.date.available2021-01-18T14:19:28Z
dc.date.created2020-11-19T12:57:35Z
dc.date.issued2020
dc.identifier.citationIS&T International Symposium on Electronic Imaging Science and Technology. 2020, 2020 (10), 1-7.en_US
dc.identifier.issn2470-1173
dc.identifier.urihttps://hdl.handle.net/11250/2723535
dc.description.abstractThe object sizes in images are diverse, therefore, capturing multiple scale context information is essential for semantic segmentation. Existing context aggregation methods such as pyramid pooling module (PPM) and atrous spatial pyramid pooling (ASPP) employ different pooling size or atrous rate, such that multiple scale information is captured. However, the pooling sizes and atrous rates are chosen empirically. Rethinking of ASPP leads to our observation that learnable sampling locations of the convolution operation can endow the network learnable fieldof-view, thus the ability of capturing object context information adaptively. Following this observation, in this paper, we propose an adaptive context encoding (ACE) module based on deformable convolution operation where sampling locations of the convolution operation are learnable. Our ACE module can be embedded into other Convolutional Neural Networks (CNNs) easily for context aggregation. The effectiveness of the proposed module is demonstrated on Pascal-Context and ADE20K datasets. Although our proposed ACE only consists of three deformable convolution blocks, it outperforms PPM and ASPP in terms of mean Intersection of Union (mIoU) on both datasets. All the experimental studies confirm that our proposed module is effective compared to the state-of-the-art methods.en_US
dc.language.isoengen_US
dc.publisherSociety for Imaging Science and Technologyen_US
dc.titleAdaptive context encoding module for semantic segmentationen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber1-7en_US
dc.source.volume2020en_US
dc.source.journalIS&T International Symposium on Electronic Imaging Science and Technologyen_US
dc.source.issue10en_US
dc.identifier.doi10.2352/ISSN.2470-1173.2020.10.IPAS-027
dc.identifier.cristin1849860
dc.description.localcodeThis article will not be available due to copyright restrictions (c) 2020 by Society for Imaging Science and Technologyen_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel