TY - JOUR
T1 - Fine-grained 3d-attention prototypes for few-shot learning
AU - Hu, Xin
AU - Liu, Jun
AU - Ma, Jie
AU - Pan, Yudai
AU - Zhang, Lingling
N1 - Publisher Copyright:
© 2020 Massachusetts Institute of Technology.
PY - 2020/9/1
Y1 - 2020/9/1
N2 - In the real world, a limited number of labeled finely grained images per class can hardly represent the class distribution effectively. Due to the more subtle visual differences in fine-grained images than simple images with obvious objects, that is, there exist smaller interclass and larger intraclass variations. To solve these issues, we propose an end-toend attention-based model for fine-grained few-shot image classification (AFG) with the recent episode training strategy. It is composed mainly of a feature learning module, an image reconstruction module, and a label distribution module. The feature learning module mainly devises a 3D-Attention mechanism, which considers both the spatial positions and different channel attentions of the image features, in order to learn more discriminative local features to better represent the class distribution. The image reconstruction module calculates the mappings between local features and the original images. It is constrained by a designed loss function as auxiliary supervised information, so that the learning of each local feature does not need extra annotations. The label distributionmodule is used to predict the label distribution of a given unlabeled sample, and we use the local features to represent the image features for classification. By conducting comprehensive experiments on Mini-ImageNet and three fine-grained data sets, we demonstrate that the proposedmodel achieves superior performance over the competitors.
AB - In the real world, a limited number of labeled finely grained images per class can hardly represent the class distribution effectively. Due to the more subtle visual differences in fine-grained images than simple images with obvious objects, that is, there exist smaller interclass and larger intraclass variations. To solve these issues, we propose an end-toend attention-based model for fine-grained few-shot image classification (AFG) with the recent episode training strategy. It is composed mainly of a feature learning module, an image reconstruction module, and a label distribution module. The feature learning module mainly devises a 3D-Attention mechanism, which considers both the spatial positions and different channel attentions of the image features, in order to learn more discriminative local features to better represent the class distribution. The image reconstruction module calculates the mappings between local features and the original images. It is constrained by a designed loss function as auxiliary supervised information, so that the learning of each local feature does not need extra annotations. The label distributionmodule is used to predict the label distribution of a given unlabeled sample, and we use the local features to represent the image features for classification. By conducting comprehensive experiments on Mini-ImageNet and three fine-grained data sets, we demonstrate that the proposedmodel achieves superior performance over the competitors.
UR - https://www.scopus.com/pages/publications/85089407222
U2 - 10.1162/neco_a_01302
DO - 10.1162/neco_a_01302
M3 - 文章
C2 - 32687772
AN - SCOPUS:85089407222
SN - 0899-7667
VL - 32
SP - 1664
EP - 1684
JO - Neural Computation
JF - Neural Computation
IS - 9
ER -