TY - GEN
T1 - Bi-directional Distribution Alignment for Transductive Zero-Shot Learning
AU - Wang, Zhicai
AU - Hao, Yanbin
AU - Mu, Tingting
AU - Li, Ouxiang
AU - Wang, Shuo
AU - He, Xiangnan
PY - 2023/2/27
Y1 - 2023/2/27
N2 - Zero-shot learning (ZSL) suffers intensely from the domain shift issue, i.e., the mismatch (or misalignment) between the true and learned data distributions for classes without training data (unseen classes). By learning additionally from unlabelled data collected for the unseen classes, transductive ZSL (TZSL) could reduce the shift but only to a certain extent. To improve TZSL, we propose a novel approach Bi-VAEGAN which strengthens the distribution alignment between the visual space and an auxiliary space. As a result, it can reduce largely the domain shift. The proposed key designs include (1) a bi-directional distribution alignment, (2) a simple but effective L2-norm based feature normalization approach, and (3) a more sophisticated unseen class prior estimation. Evaluated by four benchmark datasets, Bi-VAEGAN1 achieves the new state of the art under both the standard and generalized TZSL settings.
AB - Zero-shot learning (ZSL) suffers intensely from the domain shift issue, i.e., the mismatch (or misalignment) between the true and learned data distributions for classes without training data (unseen classes). By learning additionally from unlabelled data collected for the unseen classes, transductive ZSL (TZSL) could reduce the shift but only to a certain extent. To improve TZSL, we propose a novel approach Bi-VAEGAN which strengthens the distribution alignment between the visual space and an auxiliary space. As a result, it can reduce largely the domain shift. The proposed key designs include (1) a bi-directional distribution alignment, (2) a simple but effective L2-norm based feature normalization approach, and (3) a more sophisticated unseen class prior estimation. Evaluated by four benchmark datasets, Bi-VAEGAN1 achieves the new state of the art under both the standard and generalized TZSL settings.
UR - https://arxiv.org/pdf/2303.08698.pdf
M3 - Conference contribution
BT - IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR)
ER -