Show simple item record

dc.contributor.authorUllah, Mohib
dc.contributor.authorMohammed, Ahmed Kedir
dc.contributor.authorAlaya Cheikh, Faouzi
dc.date.accessioned2019-02-22T09:48:38Z
dc.date.available2019-02-22T09:48:38Z
dc.date.created2018-09-26T10:23:57Z
dc.date.issued2018
dc.identifier.citationJournal of Imaging. 2018, 4 (9), .nb_NO
dc.identifier.issn2313-433X
dc.identifier.urihttp://hdl.handle.net/11250/2586966
dc.description.abstractArticulation modeling, feature extraction, and classification are the important components of pedestrian segmentation. Usually, these components are modeled independently from each other and then combined in a sequential way. However, this approach is prone to poor segmentation if any individual component is weakly designed. To cope with this problem, we proposed a spatio-temporal convolutional neural network named PedNet which exploits temporal information for spatial segmentation. The backbone of the PedNet consists of an encoder–decoder network for downsampling and upsampling the feature maps, respectively. The input to the network is a set of three frames and the output is a binary mask of the segmented regions in the middle frame. Irrespective of classical deep models where the convolution layers are followed by a fully connected layer for classification, PedNet is a Fully Convolutional Network (FCN). It is trained end-to-end and the segmentation is achieved without the need of any pre- or post-processing. The main characteristic of PedNet is its unique design where it performs segmentation on a frame-by-frame basis but it uses the temporal information from the previous and the future frame for segmenting the pedestrian in the current frame. Moreover, to combine the low-level features with the high-level semantic information learned by the deeper layers, we used long-skip connections from the encoder to decoder network and concatenate the output of low-level layers with the higher level layers. This approach helps to get segmentation map with sharp boundaries. To show the potential benefits of temporal information, we also visualized different layers of the network. The visualization showed that the network learned different information from the consecutive frames and then combined the information optimally to segment the middle frame. We evaluated our approach on eight challenging datasets where humans are involved in different activities with severe articulation (football, road crossing, surveillance). The most common CamVid dataset which is used for calculating the performance of the segmentation algorithm is evaluated against seven state-of-the-art methods. The performance is shown on precision/recall, F1, F2, and mIoU. The qualitative and quantitative results show that PedNet achieves promising results against state-of-the-art methods with substantial improvement in terms of all the performance metrics.nb_NO
dc.language.isoengnb_NO
dc.publisherMDPInb_NO
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titlePedNet: A Spatio-Temporal Deep Convolutional Neural Network for Pedestrian Segmentationnb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionpublishedVersionnb_NO
dc.subject.nsiVDP::Informasjons- og kommunikasjonsteknologi: 550nb_NO
dc.subject.nsiVDP::Information and communication technology: 550nb_NO
dc.source.pagenumber18nb_NO
dc.source.volume4nb_NO
dc.source.journalJournal of Imagingnb_NO
dc.source.issue9nb_NO
dc.identifier.doi10.3390/jimaging4090107
dc.identifier.cristin1613698
dc.relation.projectNorges forskningsråd: 247689nb_NO
dc.relation.projectNorges forskningsråd: 68123734nb_NO
dc.description.localcode(C) 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).nb_NO
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.fulltextpostprint
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Navngivelse 4.0 Internasjonal
Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal