Show simple item record

dc.contributor.authorStefanov, Kalin
dc.contributor.authorBeskow, Jonas
dc.contributor.authorSalvi, Giampiero
dc.date.accessioned2019-11-29T13:19:40Z
dc.date.available2019-11-29T13:19:40Z
dc.date.created2019-09-25T16:24:45Z
dc.date.issued2019
dc.identifier.issn2379-8920
dc.identifier.urihttp://hdl.handle.net/11250/2631110
dc.description.abstractThis paper presents a self-supervised method for visual detection of the active speaker in a multi-person spoken interaction scenario. Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings. The proposed method is intended to complement the acoustic detection of the active speaker, thus improving the system robustness in noisy conditions. The method can detect an arbitrary number of possibly overlapping active speakers based exclusively on visual information about their face. Furthermore, the method does not rely on external annotations, thus complying with cognitive development. Instead, the method uses information from the auditory modality to support learning in the visual domain. This paper reports an extensive evaluation of the proposed method using a large multi-person face-to-face interaction dataset. The results show good performance in a speaker dependent setting. However, in a speaker independent setting the proposed method yields a significantly lower performance. We believe that the proposed method represents an essential component of any artificial cognitive system or robotic platform engaging in social interactionsnb_NO
dc.language.isoengnb_NO
dc.publisherIEEEnb_NO
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/deed.no*
dc.titleSelf-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisitionnb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionpublishedVersionnb_NO
dc.source.journalIEEE Transactions on Cognitive and Developmental Systemsnb_NO
dc.identifier.doi10.1109/TCDS.2019.2927941
dc.identifier.cristin1729099
dc.description.localcode© 2019 IEEE. To access the final edited and published work see DOI 10.1109/TCDS.2019.2927941 This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/nb_NO
cristin.unitcode194,63,35,0
cristin.unitnameInstitutt for elektroniske systemer
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal