benty-fields - Search paper

Recently, the general-to-customized paradigm has emerged as the dominant approach for Cross-Modal Retrieval (CMR), which reconciles the distribution shift problem between the source domain and the target domain. However, existing general-to-customized CMR methods typically assume that the entire target-domain data is available, which is easily violated in real-world scenarios and thus inevitably suffer from the query shift (QS) problem. Specifically, query shift embraces the following two characteristics and thus poses new challenges to CMR. i) Online Shift: real-world queries always arrive in an online manner, rendering it impractical to access the entire query set beforehand for customization approaches; ii) Diverse Shift: even with domain customization, the CMR models struggle to satisfy queries from diverse users or scenarios, leaving an urgent need to accommodate diverse queries. In this paper, we observe that QS would not only undermine the well-structured common space inherited from the source model, but also steer the model toward forgetting the indispensable general knowledge for CMR. Inspired by the observations, we propose a novel method for achieving online and harmonious adaptation against QS, dubbed Robust adaptation with quEry ShifT (REST). To deal with online shift, REST first refines the retrieval results to formulate the query predictions and accordingly designs a QS-robust objective function on these predictions to preserve the well-established common space in an online manner. As for tackling the more challenging diverse shift, REST employs a gradient decoupling module to dexterously manipulate the gradients during the adaptation process, thus preventing the CMR model from forgetting the general knowledge. Extensive experiments on 20 benchmarks across three CMR tasks verify the effectiveness of our method against QS.
Authors' comments: 19 pages, 6 figures

Vote

Add to Library

Recommend

4919. PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval

Chun Chet Ng, Jia Yu Lim, Wei Zeng Low

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14130v1

Vote

Add to Library

Recommend

4920. DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval

Jing Liu, Bing Guo, Ren Zhu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.12556v1

Vote

Add to Library

Recommend

Benty-search

4901. UISearch: Graph-Based Embeddings for Multimodal Enterprise UI Screenshots Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19380v1

4902. Phase retrieval via overparametrized nonconvex optimization: nonsmooth amplitude loss landscapes

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19045v1

4903. HyperbolicRAG: Enhancing Retrieval-Augmented Generation with Hyperbolic Representations

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.18808v1

4904. CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.18659v1

4905. Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.17044v1

4906. A Benchmark for Procedural Memory Retrieval in Language Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.21730v1

4907. Edge-ANN: Storage-Efficient Edge-Based Remote Sensing Feature Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.16938v1

4908. Mesh RAG: Retrieval Augmentation for Autoregressive Mesh Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.16807v1

4909. ARK: Answer-Centric Retriever Tuning via KG-augmented Curriculum Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.16326v1

4910. MuISQA: Multi-Intent Retrieval-Augmented Generation for Scientific Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.16283v1

4911. Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.16150v1

4912. HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.15435v1

4913. A Compliance-Preserving Retrieval System for Aircraft MRO Task Search

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.15383v1

4914. ItemRAG: Item-Based Retrieval-Augmented Generation for LLM-Based Recommendation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.15141v1

4915. Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.08943v1

4916. Streamlining Industrial Contract Management with Retrieval-Augmented LLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14671v1

4917. DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14449v1

4918. Toward Robust and Harmonious Adaptation for Cross-modal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14416v1

4919. PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14130v1

4920. DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.12556v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19380v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19045v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.18808v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.18659v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.17044v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21730v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16938v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16807v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16326v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16283v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16150v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.15435v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.15383v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.15141v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.08943v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14671v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14449v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14416v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14130v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.12556v1