Blar i NTNU Open på forfatter "Stefanov, Kalin"
-
Self-supervised vision-based detection of the active speaker as support for socially aware language acquisition
Stefanov, Kalin; Beskow, Jonas; Salvi, Giampiero (Peer reviewed; Journal article, 2020)This paper presents a self-supervised method for visual detection of the active speaker in a multiperson spoken interaction scenario. Active speaker detection is a fundamental prerequisite for any artificial cognitive ... -
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition
Stefanov, Kalin; Beskow, Jonas; Salvi, Giampiero (Journal article; Peer reviewed, 2019)This paper presents a self-supervised method for visual detection of the active speaker in a multi-person spoken interaction scenario. Active speaker detection is a fundamental prerequisite for any artificial cognitive ... -
Spatial Bias in Vision-Based Voice Activity Detection
Stefanov, Kalin; Adiban, Mohammad; Salvi, Giampiero (Peer reviewed; Journal article, 2021)We develop and evaluate models for automatic vision-based voice activity detection (VAD) in multiparty human-human interactions that are aimed at complementing acoustic VAD methods. We provide evidence that this type of ...