Distribution and gradient constrained embedding model for zero-shot learning with fewer seen samples

Zhang, Jing; Geng, YangLi-ao; Wang, Wen; Sun, Wenju; Yang, Zhirong; Li, Qingyong

Zhang, Jing; Geng, YangLi-ao; Wang, Wen; Sun, Wenju; Yang, Zhirong; Li, Qingyong

Peer reviewed, Journal article

Submitted version

Åpne

revised_manuscript_2.pdf (2.908Mb)

Permanent lenke

https://hdl.handle.net/11250/3039567

Utgivelsesdato

2022

Sammendrag

Zero-Shot Learning (ZSL), which aims to recognize unseen classes with no training data, has made great progress in recent years. However, established ZSL methods implicitly assumed that there exist sufficient labeled samples for each seen class, which is quite idealistic in general as collecting sufficient labeled samples is a labor-intensive task and may even be naturally impractical for some low-probability events. Accordingly, we investigate how to perform ZSL with fewer seen samples. Specifically, we propose a Distribution and Gradient constrained Embedding Model (DGEM), which aims to predict the visual prototypes (means) for the given semantic vectors of seen classes. Specifically, we summarize the main challenges brought by limited seen samples as the representation bias problem and the over-fitting problem. Correspondingly, two regularizers are proposed to solve them: (1) a prototype refinement loss which uses the relative distribution of class semantics to constrain that of the predicted visual prototypes; (2) a projection smoothing constraint that prevents the model from forming sharp decision boundaries. We validate the effectiveness of DGEM on five ZSL datasets and compare it with several representative ZSL methods. Experimental results show that DGEM outperforms the other established methods when each seen class has only 1/5 sample(s).

Utgiver

Elsevier

Tidsskrift

Knowledge-Based Systems