benty-fields - Search paper

Fine-grained object retrieval aims to learn discriminative representation to retrieve visually similar objects. However, existing top-performing works usually impose pairwise similarities on the semantic embedding spaces or design a localization sub-network to continually fine-tune the entire model in limited data scenarios, thus resulting in convergence to suboptimal solutions. In this paper, we develop Fine-grained Retrieval Prompt Tuning (FRPT), which steers a frozen pre-trained model to perform the fine-grained retrieval task from the perspectives of sample prompting and feature adaptation. Specifically, FRPT only needs to learn fewer parameters in the prompt and adaptation instead of fine-tuning the entire model, thus solving the issue of convergence to suboptimal solutions caused by fine-tuning the entire model. Technically, a discriminative perturbation prompt (DPP) is introduced and deemed as a sample prompting process, which amplifies and even exaggerates some discriminative elements contributing to category prediction via a content-aware inhomogeneous sampling operation. In this way, DPP can make the fine-grained retrieval task aided by the perturbation prompts close to the solved task during the original pre-training. Thereby, it preserves the generalization and discrimination of representation extracted from input samples. Besides, a category-specific awareness head is proposed and regarded as feature adaptation, which removes the species discrepancies in features extracted by the pre-trained model using category-guided instance normalization. And thus, it makes the optimized features only include the discrepancies among subcategories. Extensive experiments demonstrate that our FRPT with fewer learnable parameters achieves the state-of-the-art performance on three widely-used fine-grained datasets.
Authors' comments: Accepted by AAAI 2023

Vote

Add to Library

Recommend

3320. Character-focused Video Thumbnail Retrieval

Shervin Ardeshir, Nagendra Kamath, Hossein Taghavi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2204.06563v1

Vote

Add to Library

Recommend

Benty-search

3301. On the Phase Retrievable Sequences

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.14150v1

3302. Multi-event Video-Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.11551v1

3303. IncDSI: Incrementally Updatable Document Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2307.10323v2

3304. Revisiting Neural Retrieval on Accelerators

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2306.04039v1

3305. The Information Retrieval Experiment Platform

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.18932v1

3306. Multiview Identifiers Enhanced Generative Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.16675v1

3307. Recommender Systems with Generative Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.05065v3

3308. Sketch-based Medical Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.03633v1

3309. Retrieval-Augmented Multimodal Language Modeling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2211.12561v2

3310. Task-aware Retrieval with Instructions

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2211.09260v2

3311. Knowledge Retrieval for Robotic Cooking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2211.04524v2

3312. Suffix Retrieval-Augmented Language Modeling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2211.03053v2

3313. Explainable Information Retrieval: A Survey

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2211.02405v1

3314. Clarinet: A Music Retrieval System

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.12648v2

3315. Nonparametric Decoding for Generative Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.02068v3

3316. Retrieval Based Time Series Forecasting

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2209.13525v1

3317. MuMUR : Multilingual Multimodal Universal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2208.11553v7

3318. Retrieval-based Controllable Molecule Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2208.11126v3

3319. Fine-grained Retrieval Prompt Tuning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2207.14465v3

3320. Character-focused Video Thumbnail Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2204.06563v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.14150v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.11551v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2307.10323v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2306.04039v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.18932v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.16675v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.05065v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.03633v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2211.12561v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2211.09260v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2211.04524v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2211.03053v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2211.02405v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.12648v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.02068v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2209.13525v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2208.11553v7

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2208.11126v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2207.14465v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2204.06563v1