TY - JOUR
T1 - Dual autoencoder based zero shot learning in special domain
AU - Li, Qiong
AU - Rigall, Eric
AU - Sun, Xin
AU - Lam, Kin Man
AU - Dong, Junyu
N1 - Funding Information:
This work was supported by the National Natural Science Foundation of China (61971388, U1706218), and a research grant from the Key-Area Research and Development Program of Guangdong Province 2020 under Project 76.
Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
PY - 2022
Y1 - 2022
N2 - Zero-shot learning aims to learn a visual classifier for a category which has no training samples leveraging its semantic information and its relationship to other categories. It is common, yet vital, in practical visual scenarios, and particularly prominent in the uncharted ocean field. Phytoplankton plays an important part in the marine ecological environment. It is common to encounter the zero-shot recognition problem during the in situ observation. Therefore, we propose a dual autoencoder model, which contains two similar encoder–decoder structures, to tackle the zero-shot recognition problem. The first one is used for the projection from the visual feature space to a latent space, then to the semantic space. Inversely, the second one projects from the semantic space to another latent space, then back to the visual feature space. This structure guarantees the projection from the visual feature space to the semantic space to be more effective, through the stable mutual mapping. Experimental results on four benchmarks demonstrate that the proposed dual autoencoder model achieves competitive performance compared with six recent state-of-the-art methods. Furthermore, we apply our algorithm to phytoplankton classification. We manually annotated phytoplankton attributes to develop a practical dataset for this real and special domain application, i.e., Zero-shot learning dataset for PHYtoplankton (ZeroPHY). Experiment results show that our method achieves the best performance on this real-world application.
AB - Zero-shot learning aims to learn a visual classifier for a category which has no training samples leveraging its semantic information and its relationship to other categories. It is common, yet vital, in practical visual scenarios, and particularly prominent in the uncharted ocean field. Phytoplankton plays an important part in the marine ecological environment. It is common to encounter the zero-shot recognition problem during the in situ observation. Therefore, we propose a dual autoencoder model, which contains two similar encoder–decoder structures, to tackle the zero-shot recognition problem. The first one is used for the projection from the visual feature space to a latent space, then to the semantic space. Inversely, the second one projects from the semantic space to another latent space, then back to the visual feature space. This structure guarantees the projection from the visual feature space to the semantic space to be more effective, through the stable mutual mapping. Experimental results on four benchmarks demonstrate that the proposed dual autoencoder model achieves competitive performance compared with six recent state-of-the-art methods. Furthermore, we apply our algorithm to phytoplankton classification. We manually annotated phytoplankton attributes to develop a practical dataset for this real and special domain application, i.e., Zero-shot learning dataset for PHYtoplankton (ZeroPHY). Experiment results show that our method achieves the best performance on this real-world application.
KW - Autoencoder
KW - Classification
KW - Phytoplankton
KW - Zero shot learning
UR - http://www.scopus.com/inward/record.url?scp=85139494818&partnerID=8YFLogxK
U2 - 10.1007/s10044-022-01109-9
DO - 10.1007/s10044-022-01109-9
M3 - Journal article
AN - SCOPUS:85139494818
SN - 1433-7541
JO - Pattern Analysis and Applications
JF - Pattern Analysis and Applications
ER -