Blar i Institutt for psykologi på forfatter "Garg, Saurabh"
-
Mouth2Audio: intelligible audio synthesis from videos with distinctive vowel articulation
Garg, Saurabh; Ruan, Haoyao; Hamarneh, Ghassan; Behne, Dawn Marie; Jongman, Allard; Sereno, Joan; Wang, Yue (Journal article, 2023)Humans use both auditory and facial cues to perceive speech, especially when auditory input is degraded, indicating a direct association between visual articulatory and acoustic speech information. This study investigates ...