Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping. Academic Article uri icon

abstract

  • Grouping has been commonly used in deep metric learning for computing diverse features. To improve the performance and interpretability, we propose an improved and interpretable grouping method to be integrated flexibly with any metric learning framework. Specifically, our method is based on the attention mechanism with a learnable query for each group. The query is fully trainable and can capture group-specific information when combined with the diversity loss. An appealing property of our method is that it naturally lends itself interpretability. The attention scores between the learnable query and each spatial position can be interpreted as the importance of that position. We formally show that our proposed grouping method is invariant to spatial permutations of features. When used as a module in convolutional neural networks, our method leads to translational invariance. We conduct comprehensive experiments to evaluate our method. Our quantitative results indicate that the proposed method outperforms prior methods consistently and significantly across different datasets, evaluation metrics, base models, and loss functions. For the first time to the best of our knowledge, our interpretation results clearly demonstrate that the proposed method enables the learning of diverse and stable semantic features across groups.

published proceedings

  • IEEE Trans Pattern Anal Mach Intell

altmetric score

  • 0.25

author list (cited authors)

  • Xu, X., Wang, Z., Deng, C., Yuan, H., & Ji, S.

citation count

  • 2

complete list of authors

  • Xu, Xinyi||Wang, Zhengyang||Deng, Cheng||Yuan, Hao||Ji, Shuiwang

publication date

  • January 2023