Vis enkel innførsel

dc.contributor.authorBavirisetti, Durga Prasad
dc.contributor.authorMartinsen, Herman Ryen
dc.contributor.authorKiss, Gabriel Hanssen
dc.contributor.authorLindseth, Frank
dc.date.accessioned2024-03-05T11:13:46Z
dc.date.available2024-03-05T11:13:46Z
dc.date.created2024-01-02T12:51:05Z
dc.date.issued2023
dc.identifier.citationIEEE Open Journal of Intelligent Transportation Systems. 2023, 4, 909-928.en_US
dc.identifier.issn2687-7813
dc.identifier.urihttps://hdl.handle.net/11250/3121067
dc.description.abstractIn this paper, we investigate the use of Vision Transformers for processing and understanding visual data in an autonomous driving setting. Specifically, we explore the use of Vision Transformers for semantic segmentation and monocular depth estimation using only a single image as input. We present state-of-the-art Vision Transformers for these tasks and combine them into a multitask model. Through multiple experiments on four different street image datasets, we demonstrate that the multitask approach significantly reduces inference time while maintaining high accuracy for both tasks. Additionally, we show that changing the size of the Transformer-based backbone can be used as a trade-off between inference speed and accuracy. Furthermore, we investigate the use of synthetic data for pre-training and show that it effectively increases the accuracy of the model when real-world data is limited.en_US
dc.language.isoengen_US
dc.publisherIEEEen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleA Multi-Task Vision Transformer for Segmentation and Monocular Depth Estimation for Autonomous Vehiclesen_US
dc.title.alternativeA Multi-Task Vision Transformer for Segmentation and Monocular Depth Estimation for Autonomous Vehiclesen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber909-928en_US
dc.source.volume4en_US
dc.source.journalIEEE Open Journal of Intelligent Transportation Systemsen_US
dc.identifier.doi10.1109/OJITS.2023.3335648
dc.identifier.cristin2218908
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal