benty-fields - Search paper

4701. TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control

Shihao He, Yihan Xia, Fang Liu, Taotao Wang, Shengli Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.09332v1

Vote

Add to Library

Recommend

4702. Evoking User Memory: Personalizing LLM via Recollection-Familiarity Adaptive Retrieval

Yingyi Zhang, Junyi Li, Wenlin Zhang, Penyue Jia, Xianneng Li, Yichao Wang, Derong Xu, Yi Wen et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.09250v1

Vote

Add to Library

Recommend

4703. DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval

Taegyeong Lee, Jiwon Park, Seunghyun Hwang, JooYoung Jang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.09185v1

Vote

Add to Library

Recommend

4704. SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation

Yagiz Can Akay, Muhammed Yusuf Kartal, Esra Alparslan, Faruk Ortakoyluoglu, Arda Akpinar

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.08329v1

Vote

Add to Library

Recommend

4705. Structure-Preserving Graph Contrastive Learning for Mathematical Information Retrieval

Chun-Hsi Ku, Hung-Hsuan Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.08012v1

Vote

Add to Library

Recommend

4706. Retrieval-Augmented Generation for Predicting Cellular Responses to Gene Perturbation

Andrea Giuseppe Di Francesco, Andrea Rubbi, Pietro Liò

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.07233v1

Predicting how cells respond to genetic perturbations is fundamental to understanding gene function, disease mechanisms, and therapeutic development. While recent deep learning approaches have shown promise in modeling single-cell perturbation responses, they struggle to generalize across cell types and perturbation contexts due to limited contextual information during generation. We introduce PT-RAG (Perturbation-aware Two-stage Retrieval-Augmented Generation), a novel framework that extends Retrieval-Augmented Generation beyond traditional language-model applications to cellular biology. Unlike standard RAG systems designed for text retrieval with pre-trained LLMs, perturbation retrieval lacks established similarity metrics and requires learning what constitutes relevant context, making differentiable retrieval essential. PT-RAG addresses this through a two-stage pipeline: first, retrieving candidate perturbations $K$ using GenePT embeddings, then adaptively refining the selection through Gumbel-Softmax discrete sampling conditioned on both the cell state and the input perturbation. This cell-type-aware differentiable retrieval enables end-to-end optimization of the retrieval objective jointly with generation. On the Replogle-Nadig single-gene perturbation dataset, we demonstrate that PT-RAG outperforms both STATE and vanilla RAG under identical experimental conditions, with the strongest gains in distributional similarity metrics ($W_1$, $W_2$). Notably, vanilla RAG's dramatic failure is itself a key finding: it demonstrates that differentiable, cell-type-aware retrieval is essential in this domain, and that naive retrieval can actively harm performance. Our results establish retrieval-augmented generation as a promising paradigm for modelling cellular responses to gene perturbation. The code to reproduce our experiments is available at https://github.com/difra100/PT-RAG_ICLR.
Authors' comments: Accepted at ICLR 2026 Workshop: Generative AI in Genomics. 25 pages, 9 figures

Vote

Add to Library

Recommend

4707. Fine-Grained Table Retrieval Through the Lens of Complex Queries

Wojciech Kosiuk, Xingyu Ji, Yeounoh Chung, Fatma Özcan, Madelon Hulsebos

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.07146v1

Vote

Add to Library

Recommend

4708. Configurable Runtime Orchestration for Dynamic Data Retrieval in Distributed Systems

Abhiram Kandiraju

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.06980v1

Vote

Add to Library

Recommend

4709. Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion

Pengcheng Jiang, Judith Yue Li, Moonkyung Ryu, R. Lily Hu, Kun Su, Zhong Yi Wan, Liam Hebert, Hao Peng et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.06397v1

Vote

Add to Library

Recommend

4710. InfoGatherer: Principled Information Seeking via Evidence Retrieval and Strategic Questioning

Maksym Taranukhin, Shuyue Stella Li, Evangelos Milios, Geoff Pleiss, Yulia Tsvetkov, Vered Shwartz

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.05909v1

Vote

Add to Library

Recommend

4711. Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval

Artem Vazhentsev, Maria Marina, Daniil Moskovskiy, Sergey Pletenev, Mikhail Seleznyov, Mikhail Salnikov, Elena Tutubalina, Vasily Konovalov et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.05471v1

Vote

Add to Library

Recommend

4712. SGR3 Model: Scene Graph Retrieval-Reasoning Model in 3D

Zirui Wang, Ruiping Liu, Yufan Chen, Junwei Zheng, Weijia Fan, Kunyu Peng, Di Wen, Jiale Wei et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.04614v1

Vote

Add to Library

Recommend

4713. AgentIR: Reasoning-Aware Retrival for Deep Research Agents

Zijian Chen, Xueguang Ma, Shengyao Zhuang, Jimmy Lin, Akari Asai, Victor Zhong

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.04384v1

Vote

Add to Library

Recommend

4714. GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

Mingleyang Li, Yuran Wang, Yue Chen, Tianxing Chen, Jiaqi Liang, Zishun Shen, Haoran Lu, Ruihai Wu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.04158v1

Vote

Add to Library

Recommend

4715. Behind the Prompt: The Agent-User Problem in Information Retrieval

Saber Zerhoudi, Michael Granitzer, Dang Hai Dang, Jelena Mitrovic, Florian Lemmerich, Annette Hautli-Janisz, Stefan Katzenbeisser, Kanishka Ghosh Dastidar

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03630v1

Vote

Add to Library

Recommend

4716. RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation

Hao Li, Yuhao Wang, Wenning Hao, Pingping Zhang, Dong Wang, Huchuan Lu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03617v1

Vote

Add to Library

Recommend

4717. eTFCE: Exact Threshold-Free Cluster Enhancement via Fast Cluster Retrieval

Xu Chen, Wouter Weeda, Thomas E. Nichols, Jelle J. Goeman

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03004v1

Vote

Add to Library

Recommend

4718. MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

Runze Li, Kedi Chen, Guwei Feng, Mo Yu, Jun Wang, Wei Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.22289v1

Vote

Add to Library

Recommend

4719. Model Editing for New Document Integration in Generative Information Retrieval

Zhen Zhang, Zihan Wang, Xinyu Ma, Shuaiqiang Wang, Dawei Yin, Xin Xin, Pengjie Ren, Maarten de Rijke et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.02773v1

Generative retrieval (GR) reformulates the Information Retrieval (IR) task as the generation of document identifiers (docIDs). Despite its promise, existing GR models exhibit poor generalization to newly added documents, often failing to generate the correct docIDs. While incremental training offers a straightforward remedy, it is computationally expensive, resource-intensive, and prone to catastrophic forgetting, thereby limiting the scalability and practicality of GR. In this paper, we identify the core bottleneck as the decoder's ability to map hidden states to the correct docIDs of newly added documents. Model editing, which enables targeted parameter modifications for docID mapping, represents a promising solution. However, applying model editing to current GR models is not trivial, which is severely hindered by indistinguishable edit vectors across queries, due to the high overlap of shared docIDs in retrieval results. To address this, we propose DOME (docID-oriented model editing), a novel method that effectively and efficiently adapts GR models to unseen documents. DOME comprises three stages: (1) identification of critical layers, (2) optimization of edit vectors, and (3) construction and application of updates. At its core, DOME employs a hybrid-label adaptive training strategy that learns discriminative edit vectors by combining soft labels, which preserve query-specific semantics for distinguishable updates, with hard labels that enforce precise mapping modifications. Experiments on widely used benchmarks, including NQ and MS MARCO, show that our method significantly improves retrieval performance on new documents while maintaining effectiveness on the original collection. Moreover, DOME achieves this with only about 60% of the training time required by incremental training, considerably reducing computational cost and enabling efficient, frequent model updates.
Authors' comments: Accepted to The Web Conference (WWW) 2026

Vote

Add to Library

Recommend

4720. MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

Jiejun Tan, Zhicheng Dou, Liancheng Zhang, Yuyang Hu, Yiruo Cheng, Ji-Rong Wen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03379v1

As Large Language Models (LLMs) are increasingly used for long-duration tasks, maintaining effective long-term memory has become a critical challenge. Current methods often face a trade-off between cost and accuracy. Simple storage methods often fail to retrieve relevant information, while complex indexing methods (such as memory graphs) require heavy computation and can cause information loss. Furthermore, relying on the working LLM to process all memories is computationally expensive and slow. To address these limitations, we propose MemSifter, a novel framework that offloads the memory retrieval process to a small-scale proxy model. Instead of increasing the burden on the primary working LLM, MemSifter uses a smaller model to reason about the task before retrieving the necessary information. This approach requires no heavy computation during the indexing phase and adds minimal overhead during inference. To optimize the proxy model, we introduce a memory-specific Reinforcement Learning (RL) training paradigm. We design a task-outcome-oriented reward based on the working LLM's actual performance in completing the task. The reward measures the actual contribution of retrieved memories by mutiple interactions with the working LLM, and discriminates retrieved rankings by stepped decreasing contributions. Additionally, we employ training techniques such as Curriculum Learning and Model Merging to improve performance. We evaluated MemSifter on eight LLM memory benchmarks, including Deep Research tasks. The results demonstrate that our method meets or exceeds the performance of existing state-of-the-art approaches in both retrieval accuracy and final task completion. MemSifter offers an efficient and scalable solution for long-term LLM memory. We have open-sourced the model weights, code, and training data to support further research.
Authors' comments: Code and datasets are available at https://github.com/plageon/MemSifter

Vote

Add to Library

Recommend

Benty-search

4701. TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.09332v1

4702. Evoking User Memory: Personalizing LLM via Recollection-Familiarity Adaptive Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.09250v1

4703. DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.09185v1

4704. SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.08329v1

4705. Structure-Preserving Graph Contrastive Learning for Mathematical Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.08012v1

4706. Retrieval-Augmented Generation for Predicting Cellular Responses to Gene Perturbation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.07233v1

4707. Fine-Grained Table Retrieval Through the Lens of Complex Queries

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.07146v1

4708. Configurable Runtime Orchestration for Dynamic Data Retrieval in Distributed Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.06980v1

4709. Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.06397v1

4710. InfoGatherer: Principled Information Seeking via Evidence Retrieval and Strategic Questioning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.05909v1

4711. Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.05471v1

4712. SGR3 Model: Scene Graph Retrieval-Reasoning Model in 3D

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.04614v1

4713. AgentIR: Reasoning-Aware Retrival for Deep Research Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.04384v1

4714. GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.04158v1

4715. Behind the Prompt: The Agent-User Problem in Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.03630v1

4716. RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.03617v1

4717. eTFCE: Exact Threshold-Free Cluster Enhancement via Fast Cluster Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.03004v1

4718. MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.22289v1

4719. Model Editing for New Document Integration in Generative Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.02773v1

4720. MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.03379v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.09332v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.09250v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.09185v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.08329v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.08012v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.07233v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.07146v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.06980v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.06397v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.05909v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.05471v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.04614v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.04384v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.04158v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03630v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03617v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03004v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.22289v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.02773v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.03379v1