benty-fields - Search paper

4841. GlyRAG: Context-Aware Retrieval-Augmented Framework for Blood Glucose Forecasting

Shovito Barua Soumma, Hassan Ghasemzadeh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.05353v1

Vote

Add to Library

Recommend

4842. Multivector Reranking in the Era of Strong First-Stage Retrievers

Silvio Martinico, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.05200v1

Learned multivector representations power modern search systems with strong retrieval effectiveness, but their real-world use is limited by the high cost of exhaustive token-level retrieval. Therefore, most systems adopt a \emph{gather-and-refine} strategy, where a lightweight gather phase selects candidates for full scoring. However, this approach requires expensive searches over large token-level indexes and often misses the documents that would rank highest under full similarity. In this paper, we reproduce several state-of-the-art multivector retrieval methods on two publicly available datasets, providing a clear picture of the current multivector retrieval field and observing the inefficiency of token-level gathering. Building on top of that, we show that replacing the token-level gather phase with a single-vector document retriever -- specifically, a learned sparse retriever (LSR) -- produces a smaller and more semantically coherent candidate set. This recasts the gather-and-refine pipeline into the well-established two-stage retrieval architecture. As retrieval latency decreases, query encoding with two neural encoders becomes the dominant computational bottleneck. To mitigate this, we integrate recent inference-free LSR methods, demonstrating that they preserve the retrieval effectiveness of the dual-encoder pipeline while substantially reducing query encoding time. Finally, we investigate multiple reranking configurations that balance efficiency, memory, and effectiveness, and we introduce two optimization techniques that prune low-quality candidates early. Empirical results show that these techniques improve retrieval efficiency by up to 1.8$\times$ with no loss in quality. Overall, our two-stage approach achieves over $24\times$ speedup over the state-of-the-art multivector retrieval systems, while maintaining comparable or superior retrieval quality.
Authors' comments: 17 pages, 2 figures, ECIR 2026

Vote

Add to Library

Recommend

4843. RAAR: Retrieval Augmented Agentic Reasoning for Cross-Domain Misinformation Detection

Zhiwei Liu, Runteng Guo, Baojie Qu, Yuechen Jiang, Min Peng, Qianqian Xie, Sophia Ananiadou

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04853v1

Vote

Add to Library

Recommend

4844. Orion-RAG: Path-Aligned Hybrid Retrieval for Graphless Data

Zhen Chen, Weihao Xie, Peilin Chen, Shiqi Wang, Jianping Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04764v1

Vote

Add to Library

Recommend

4845. Enhancing Multimodal Retrieval via Complementary Information Extraction and Alignment

Delong Zeng, Yuexiang Xie, Yaliang Li, Ying Shen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04571v1

Vote

Add to Library

Recommend

4846. SoK: Privacy Risks and Mitigations in Retrieval-Augmented Generation Systems

Andreea-Elena Bodea, Stephen Meisenbacher, Alexandra Klymenko, Florian Matthes

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.03979v1

The continued promise of Large Language Models (LLMs), particularly in their natural language understanding and generation capabilities, has driven a rapidly increasing interest in identifying and developing LLM use cases. In an effort to complement the ingrained "knowledge" of LLMs, Retrieval-Augmented Generation (RAG) techniques have become widely popular. At its core, RAG involves the coupling of LLMs with domain-specific knowledge bases, whereby the generation of a response to a user question is augmented with contextual and up-to-date information. The proliferation of RAG has sparked concerns about data privacy, particularly with the inherent risks that arise when leveraging databases with potentially sensitive information. Numerous recent works have explored various aspects of privacy risks in RAG systems, from adversarial attacks to proposed mitigations. With the goal of surveying and unifying these works, we ask one simple question: What are the privacy risks in RAG, and how can they be measured and mitigated? To answer this question, we conduct a systematic literature review of RAG works addressing privacy, and we systematize our findings into a comprehensive set of privacy risks, mitigation techniques, and evaluation strategies. We supplement these findings with two primary artifacts: a Taxonomy of RAG Privacy Risks and a RAG Privacy Process Diagram. Our work contributes to the study of privacy in RAG not only by conducting the first systematization of risks and mitigations, but also by uncovering important considerations when mitigating privacy risks in RAG systems and assessing the current maturity of proposed mitigations.
Authors' comments: 17 pages, 3 figures, 5 tables. This work has been accepted for publication at the IEEE Conference on Secure and Trustworthy Machine Learning (SaTML 2026). The final version will be available on IEEE Xplore

Vote

Add to Library

Recommend

4847. Improving Scientific Document Retrieval with Academic Concept Index

Jeyun Lee, Junhyoung Lee, Wonbin Kweon, Bowen Jin, Yu Zhang, Susik Yoon, Dongha Lee, Hwanjo Yu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00567v1

Vote

Add to Library

Recommend

4848. MACA: A Framework for Distilling Trustworthy LLMs into Efficient Retrievers

Satya Swaroop Gudipudi, Sahil Girhepuje, Ponnurangam Kumaraguru, Kristine Ma

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00926v1

Vote

Add to Library

Recommend

4849. Nonlinear determination and phase retrieval under unimodular constraints

Lukas Liehr, Tomasz Szczepanski

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00403v1

Vote

Add to Library

Recommend

4850. R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory

Maoyuan Li, Zhongsheng Wang, Haoyuan Li, Jiamou Liu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.24684v1

Vote

Add to Library

Recommend

4851. SPARK: Search Personalization via Agent-Driven Retrieval and Knowledge-sharing

Gaurab Chhetri, Subasish Das, Tausif Islam Chowdhury

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.24008v1

Personalized search demands the ability to model users' evolving, multi-dimensional information needs; a challenge for systems constrained by static profiles or monolithic retrieval pipelines. We present SPARK (Search Personalization via Agent-Driven Retrieval and Knowledge-sharing), a framework in which coordinated persona-based large language model (LLM) agents deliver task-specific retrieval and emergent personalization. SPARK formalizes a persona space defined by role, expertise, task context, and domain, and introduces a Persona Coordinator that dynamically interprets incoming queries to activate the most relevant specialized agents. Each agent executes an independent retrieval-augmented generation process, supported by dedicated long- and short-term memory stores and context-aware reasoning modules. Inter-agent collaboration is facilitated through structured communication protocols, including shared memory repositories, iterative debate, and relay-style knowledge transfer. Drawing on principles from cognitive architectures, multi-agent coordination theory, and information retrieval, SPARK models how emergent personalization properties arise from distributed agent behaviors governed by minimal coordination rules. The framework yields testable predictions regarding coordination efficiency, personalization quality, and cognitive load distribution, while incorporating adaptive learning mechanisms for continuous persona refinement. By integrating fine-grained agent specialization with cooperative retrieval, SPARK provides insights for next-generation search systems capable of capturing the complexity, fluidity, and context sensitivity of human information-seeking behavior.
Authors' comments: This is the author's preprint. Accepted to WEB&GRAPH 2026 (co-located with WSDM 2026), Boise, Idaho, USA, Feb 26, 2026. Final version will appear in WSDM 2026 Companion Proceedings. Conf: https://wsdm-conference.org/2026/ Workshop: https://aiimlab.org/events/WSDM_2026_WEB_and_GRAPH_2026_Workshop_on_Web_and_Graphs_Responsible_Intelligence_and_Social_Media.html

Vote

Add to Library

Recommend

4852. Retrieval Augmented Question Answering: When Should LLMs Admit Ignorance?

Dingmin Wang, Ji Ma, Shankar Kumar

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.23836v1

Vote

Add to Library

Recommend

4853. Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval

Dao Sy Duy Minh, Huynh Trung Kiet, Nguyen Lam Phu Quy, Phu-Hoa Pham, Tran Chi Nguyen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.21221v1

Vote

Add to Library

Recommend

4854. How important is Recall for Measuring Retrieval Quality?

Shelly Schwartz, Oleg Vasilyev, Randy Sawaya

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20854v1

Vote

Add to Library

Recommend

4855. Storage and retrieval of optical skyrmions with topological characteristics

Jinwen Wang, Xin Yang, Yun Chen, Zhujun Ye, Xinji Zeng, Yongkun Zhou, Shuya Zhang, Claire Marie Cisowski et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20378v1

Vote

Add to Library

Recommend

4856. MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents

Xingbo Du, Loka Li, Duzhen Zhang, Le Song

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20237v1

Vote

Add to Library

Recommend

4857. Retrieval-augmented Prompt Learning for Pre-trained Foundation Models

Xiang Chen, Yixin Ou, Quan Feng, Lei Li, Piji Li, Haibo Ye, Sheng-Jun Huang, Shuofei Qiao et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20145v1

Vote

Add to Library

Recommend

4858. Beyond Vision: Contextually Enriched Image Captioning with Multi-Modal Retrieva

Nguyen Lam Phu Quy, Pham Phu Hoa, Tran Chi Nguyen, Dao Sy Duy Minh, Nguyen Hoang Minh Ngoc, Huynh Trung Kiet

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20042v1

Vote

Add to Library

Recommend

4859. Auto-Prompting with Retrieval Guidance for Frame Detection in Logistics

Do Minh Duc, Quan Xuan Truong, Nguyen Tat Dat, Nguyen Van Vinh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.19247v1

Vote

Add to Library

Recommend

4860. Reason to Contrast: A Cascaded Multimodal Retrieval Framework

Xuanming Cui, Hong-You Chen, Hao Yu, Hao Yuan, Zihao Wang, Shlok Kumar Mishra, Hanchao Yu, Yonghuan Yang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.23369v1

Vote

Add to Library

Recommend

Benty-search

4841. GlyRAG: Context-Aware Retrieval-Augmented Framework for Blood Glucose Forecasting

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.05353v1

4842. Multivector Reranking in the Era of Strong First-Stage Retrievers

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.05200v1

4843. RAAR: Retrieval Augmented Agentic Reasoning for Cross-Domain Misinformation Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04853v1

4844. Orion-RAG: Path-Aligned Hybrid Retrieval for Graphless Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04764v1

4845. Enhancing Multimodal Retrieval via Complementary Information Extraction and Alignment

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04571v1

4846. SoK: Privacy Risks and Mitigations in Retrieval-Augmented Generation Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.03979v1

4847. Improving Scientific Document Retrieval with Academic Concept Index

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.00567v1

4848. MACA: A Framework for Distilling Trustworthy LLMs into Efficient Retrievers

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.00926v1

4849. Nonlinear determination and phase retrieval under unimodular constraints

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.00403v1

4850. R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.24684v1

4851. SPARK: Search Personalization via Agent-Driven Retrieval and Knowledge-sharing

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.24008v1

4852. Retrieval Augmented Question Answering: When Should LLMs Admit Ignorance?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.23836v1

4853. Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.21221v1

4854. How important is Recall for Measuring Retrieval Quality?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.20854v1

4855. Storage and retrieval of optical skyrmions with topological characteristics

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.20378v1

4856. MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.20237v1

4857. Retrieval-augmented Prompt Learning for Pre-trained Foundation Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.20145v1

4858. Beyond Vision: Contextually Enriched Image Captioning with Multi-Modal Retrieva

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.20042v1

4859. Auto-Prompting with Retrieval Guidance for Frame Detection in Logistics

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.19247v1

4860. Reason to Contrast: A Cascaded Multimodal Retrieval Framework

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.23369v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.05353v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.05200v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04853v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04764v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04571v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.03979v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00567v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00926v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00403v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.24684v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.24008v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.23836v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.21221v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20854v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20378v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20237v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20145v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.20042v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.19247v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.23369v1