Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval

Peipei Kang, Zehang Lin, Zhenguo Yang, Xiaozhao Fang, Alexander M. Bronstein, Qing Li, Wenyin Liu

Research output: Journal article publicationJournal articleAcademic researchpeer-review

7 Citations (Scopus)

Abstract

Cross-modal retrieval aims to retrieve related items across different modalities, for example, using an image query to retrieve related text. The existing deep methods ignore both the intra-modal and inter-modal intra-class low-rank structures when fusing various modalities, which decreases the retrieval performance. In this paper, two deep models (denoted as ILCMR and Semi-ILCMR) based on intra-class low-rank regularization are proposed for supervised and semi-supervised cross-modal retrieval, respectively. Specifically, ILCMR integrates the image network and text network into a unified framework to learn a common feature space by imposing three regularization terms to fuse the cross-modal data. First, to align them in the label space, we utilize semantic consistency regularization to convert the data representations to probability distributions over the classes. Second, we introduce an intra-modal low-rank regularization, which encourages the intra-class samples that originate from the same space to be more relevant in the common feature space. Third, an inter-modal low-rank regularization is applied to reduce the cross-modal discrepancy. To enable the low-rank regularization to be optimized using automatic gradients during network back-propagation, we propose the rank-r approximation and specify the explicit gradients for theoretical completeness. In addition to the three regularization terms that rely on label information incorporated by ILCMR, we propose Semi-ILCMR in the semi-supervised regime, which introduces a low-rank constraint before projecting the general representations into the common feature space. Extensive experiments on four public cross-modal datasets demonstrate the superiority of ILCMR and Semi-ILCMR over other state-of-the-art methods.

Original languageEnglish
Pages (from-to)33-54
Number of pages22
JournalApplied Intelligence
Volume52
Issue number1
DOIs
Publication statusPublished - Jan 2022

Keywords

  • Cross-modal retrieval
  • Deep neural networks
  • Intra-class low-rank
  • Semi-supervised learning
  • Supervised learning

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval'. Together they form a unique fingerprint.

Cite this