Vis enkel innførsel

dc.contributor.authorStefanov, Kalin
dc.contributor.authorBeskow, Jonas
dc.contributor.authorSalvi, Giampiero
dc.date.accessioned2021-02-04T13:41:58Z
dc.date.available2021-02-04T13:41:58Z
dc.date.created2020-06-30T09:08:06Z
dc.date.issued2020
dc.identifier.citationIEEE Transactions on Cognitive and Developmental Systems. 2020, 12 (2), 250-259.en_US
dc.identifier.issn2379-8920
dc.identifier.urihttps://hdl.handle.net/11250/2726214
dc.description.abstractThis paper presents a self-supervised method for visual detection of the active speaker in a multiperson spoken interaction scenario. Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings. The proposed method is intended to complement the acoustic detection of the active speaker, thus improving the system robustness in noisy conditions. The method can detect an arbitrary number of possibly overlapping active speakers based exclusively on visual information about their face. Furthermore, the method does not rely on external annotations, thus complying with cognitive development. Instead, the method uses information from the auditory modality to support learning in the visual domain. This paper reports an extensive evaluation of the proposed method using a large multiperson face-to-face interaction data set. The results show good performance in a speaker dependent setting. However, in a speaker independent setting the proposed method yields a significantly lower performance. We believe that the proposed method represents an essential component of any artificial cognitive system or robotic platform engaging in social interactions.en_US
dc.language.isoengen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.urihttps://ieeexplore.ieee.org/document/8758947
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleSelf-supervised vision-based detection of the active speaker as support for socially aware language acquisitionen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber250-259en_US
dc.source.volume12en_US
dc.source.journalIEEE Transactions on Cognitive and Developmental Systemsen_US
dc.source.issue2en_US
dc.identifier.doi10.1109/TCDS.2019.2927941
dc.identifier.cristin1817709
dc.description.localcodeOpen Access CC-BY 4.0en_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal