benty-fields - Search paper

Retrieval-augmented generation (RAG) is a widely used framework for reducing hallucinations in large language models (LLMs) on domain-specific tasks by retrieving relevant documents from a database to support accurate responses. However, when the database contains sensitive corpora, such as medical records or legal documents, RAG poses serious privacy risks by potentially exposing private information through its outputs. Prior work has demonstrated that one can practically craft adversarial prompts that force an LLM to regurgitate the augmented contexts. A promising direction is to integrate differential privacy (DP), a privacy notion that offers strong formal guarantees, into RAG systems. However, naively applying DP mechanisms into existing systems often leads to significant utility degradation. Particularly for RAG systems, DP can reduce the usefulness of the augmented contexts leading to increase risk of hallucination from the LLMs. Motivated by these challenges, we present DP-KSA, a novel privacy-preserving RAG algorithm that integrates DP using the propose-test-release paradigm. DP-KSA follows from a key observation that most question-answering (QA) queries can be sufficiently answered with a few keywords. Hence, DP-KSA first obtains an ensemble of relevant contexts, each of which will be used to generate a response from an LLM. We utilize these responses to obtain the most frequent keywords in a differentially private manner. Lastly, the keywords are augmented into the prompt for the final output. This approach effectively compresses the semantic space while preserving both utility and privacy. We formally show that DP-KSA provides formal DP guarantees on the generated output with respect to the RAG database. We evaluate DP-KSA on two QA benchmarks using three instruction-tuned LLMs, and our empirical results demonstrate that DP-KSA achieves a strong privacy-utility tradeoff.

Vote

Add to Library

Recommend

3247. LEMUR: Learned Multi-Vector Retrieval

Elias Jääsaari, Ville Hyvönen, Teemu Roos

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.21853v1

Vote

Add to Library

Recommend

3248. Legal Retrieval for Public Defenders

Dominik Stammbach, Kylie Zhang, Patty Liu, Nimra Nadeem, Lucia Zheng, Peter Henderson

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.14348v1

Vote

Add to Library

Recommend

3249. Retrieval Quality at Context Limit

Max McKinnon

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.05850v1

Vote

Add to Library

Recommend

3250. RzenEmbed: Towards Comprehensive Multimodal Retrieval

Weijian Jian, Yajun Zhang, Dawei Liang, Chunyu Xie, Yixiao He, Dawei Leng, Yuhui Yin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.27350v1

Vote

Add to Library

Recommend

3251. Instance-Level Composed Image Retrieval

Bill Psomas, George Retsinas, Nikos Efthymiadis, Panagiotis Filntisis, Yannis Avrithis, Petros Maragos, Ondrej Chum, Giorgos Tolias

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.25387v1

Vote

Add to Library

Recommend

3252. Retrieval-Augmented Multimodal Depression Detection

Ruibo Hou, Shiyu Teng, Jiaqing Liu, Shurong Chai, Yinhao Li, Lanfen Lin, Yen-Wei Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.01892v1

Vote

Add to Library

Recommend

3253. Instance optimality in phase retrieval

Yu Xia, Zhiqiang Xu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.22578v1

Vote

Add to Library

Recommend

3254. Concept Retrieval -- What and How?

Ori nizan, Oren Shrout, Ayellet Tal

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.07058v1

Vote

Add to Library

Recommend

3255. Hierarchical Semantic Retrieval with Cobweb

Anant Gupta, Karthik Singaravadivelan, Zekun Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02539v1

Vote

Add to Library

Recommend

3256. Private Information Retrieval over Graphs

Gennian Ge, Hao Wang, Zixiang Xu, Yijun Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26512v1

The problem of PIR in graph-based replication systems has received significant attention in recent years. A systematic study was conducted by Sadeh, Gu, and Tamo, where each file is replicated across two servers and the storage topology is modeled by a graph. The PIR capacity of a graph $G$, denoted by $\mathcal{C}(G)$, is defined as the supremum of retrieval rates achievable by schemes that preserve user privacy, with the rate measured as the ratio between the file size and the total number of bits downloaded. This paper makes the following key contributions. (1) The complete graph $K_N$ has emerged as a central benchmark in the study of PIR over graphs. The asymptotic gap between the upper and lower bounds for $\mathcal{C}(K_N)$ was previously 2 and was only recently reduced to $5/3$. We shrink this gap to $1.0444$, bringing it close to resolution. More precisely, (i) Sadeh, Gu, and Tamo proved that $\mathcal{C}(K_N)\le 2/(N+1)$ and conjectured this bound to be tight. We refute this conjecture by establishing the strictly stronger bound $\mathcal{C}(K_N) \le \frac{1.3922}{N}.$ We also improve the upper bound for the balanced complete bipartite graph $\mathcal{C}(K_{N/2,N/2})$. (ii) The first lower bound on $\mathcal{C}(K_N)$ was $(1+o(1))/N$, which was recently sharpened to $(6/5+o(1))/N$. We provide explicit, systematic constructions that further improve this bound, proving $\mathcal{C}(K_N)\ge(4/3-o(1))/N,$ which in particular implies $\mathcal{C}(G) \ge (4/3-o(1))/|G|$ for every graph $G$. (2) We establish a conceptual bridge between deterministic and probabilistic PIR schemes on graphs. This connection has significant implications for reducing the required subpacketization in practical implementations and is of independent interest. We also design a general probabilistic PIR scheme that performs particularly well on sparse graphs.
Authors' comments: 72 pages

Vote

Add to Library

Recommend

3257. Semantic Search for Information Retrieval

Kayla Farivar

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17694v1

Vote

Add to Library

Recommend

3258. Efficient Direct-Access Ranked Retrieval

Mohsen Dehghankar, Raghav Mittal, Suraj Shetiya, Abolfazl Asudeh, Gautam Das

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.01108v1

Vote

Add to Library

Recommend

3259. Provably Secure Retrieval-Augmented Generation

Pengcheng Zhou, Yinglun Feng, Zhongliang Yang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.01084v1

Vote

Add to Library

Recommend

3260. PDF Retrieval Augmented Question Answering

Thi Thu Uyen Hoang, Viet Anh Nguyen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2506.18027v1

Vote

Add to Library

Recommend

Benty-search

3241. Thermodynamics of Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 0903.2792v3

3242. Approximate textual retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 0705.0751v1

3243. Intelligent Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | astro-ph/0510862v1

3244. ROSE: Retrieval-Oriented Segmentation Enhancement

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14147v1

3245. Retrieval-Enhanced Real Estate Appraisal

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.12986v1

3246. Differentially Private Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.14374v1

3247. LEMUR: Learned Multi-Vector Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.21853v1

3248. Legal Retrieval for Public Defenders

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.14348v1

3249. Retrieval Quality at Context Limit

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.05850v1

3250. RzenEmbed: Towards Comprehensive Multimodal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.27350v1

3251. Instance-Level Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.25387v1

3252. Retrieval-Augmented Multimodal Depression Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.01892v1

3253. Instance optimality in phase retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.22578v1

3254. Concept Retrieval -- What and How?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.07058v1

3255. Hierarchical Semantic Retrieval with Cobweb

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.02539v1

3256. Private Information Retrieval over Graphs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.26512v1

3257. Semantic Search for Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.17694v1

3258. Efficient Direct-Access Ranked Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.01108v1

3259. Provably Secure Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.01084v1

3260. PDF Retrieval Augmented Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2506.18027v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 0903.2792v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 0705.0751v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | astro-ph/0510862v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14147v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.12986v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.14374v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.21853v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.14348v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.05850v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.27350v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.25387v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.01892v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.22578v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.07058v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02539v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26512v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17694v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.01108v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.01084v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2506.18027v1