benty-fields - Search paper

Dense retrievers and rerankers are central to retrieval-augmented generation (RAG) pipelines, where accurately retrieving factual information is crucial for maintaining system trustworthiness and defending against RAG poisoning. However, little is known about how much factual competence these components inherit or lose from the large language models (LLMs) they are based on. We pair 12 publicly released embedding checkpoints with their original base LLMs and evaluate both sets on a factuality benchmark. Across every model evaluated, the embedding variants achieve markedly lower accuracy than their bases, with absolute drops ranging from 12 to 43 percentage points (median 28 pts) and typical retriever accuracies collapsing into the 25-35 % band versus the 60-70 % attained by the generative models. This degradation intensifies under a more demanding condition: when the candidate pool per question is expanded from four options to one thousand, the strongest retriever's top-1 accuracy falls from 33 % to 26 %, revealing acute sensitivity to distractor volume. Statistical tests further show that, for every embedding model, cosine-similarity scores between queries and correct completions are significantly higher than those for incorrect ones (p < 0.01), indicating decisions driven largely by surface-level semantic proximity rather than factual reasoning. To probe this weakness, we employed GPT-4.1 to paraphrase each correct completion, creating a rewritten test set that preserved factual truth while masking lexical cues, and observed that over two-thirds of previously correct predictions flipped to wrong, reducing overall accuracy to roughly one-third of its original level. Taken together, these findings reveal a systematic trade-off introduced by contrastive learning for retrievers: gains in semantic retrieval are paid for with losses in parametric factual knowledge......
Authors' comments: Proceedings of the 34th ACM International Conference on Information and Knowledge Management

Vote

Add to Library

Recommend

5130. MPFormer: Adaptive Framework for Industrial Multi-Task Personalized Sequential Retriever

Yijia Sun, Shanshan Huang, Linxiao Che, Haitao Lu, Qiang Luo, Kun Gai, Guorui Zhou

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.20400v1

Vote

Add to Library

Recommend

5131. Selective Retrieval-Augmentation for Long-Tail Legal Text Classification

Boheng Mao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.19997v1

Vote

Add to Library

Recommend

5132. Ontology-Based Concept Distillation for Radiology Report Retrieval and Labeling

Felix Nützel, Mischa Dombrowski, Bernhard Kainz

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.19915v1

Retrieval-augmented learning based on radiology reports has emerged as a promising direction to improve performance on long-tail medical imaging tasks, such as rare disease detection in chest X-rays. Most existing methods rely on comparing high-dimensional text embeddings from models like CLIP or CXR-BERT, which are often difficult to interpret, computationally expensive, and not well-aligned with the structured nature of medical knowledge. We propose a novel, ontology-driven alternative for comparing radiology report texts based on clinically grounded concepts from the Unified Medical Language System (UMLS). Our method extracts standardised medical entities from free-text reports using an enhanced pipeline built on RadGraph-XL and SapBERT. These entities are linked to UMLS concepts (CUIs), enabling a transparent, interpretable set-based representation of each report. We then define a task-adaptive similarity measure based on a modified and weighted version of the Tversky Index that accounts for synonymy, negation, and hierarchical relationships between medical entities. This allows efficient and semantically meaningful similarity comparisons between reports. We demonstrate that our approach outperforms state-of-the-art embedding-based retrieval methods in a radiograph classification task on MIMIC-CXR, particularly in long-tail settings. Additionally, we use our pipeline to generate ontology-backed disease labels for MIMIC-CXR, offering a valuable new resource for downstream learning tasks. Our work provides more explainable, reliable, and task-specific retrieval strategies in clinical AI systems, especially when interpretability and domain knowledge integration are essential. Our code is available at https://github.com/Felix-012/ontology-concept-distillation
Authors' comments: 10 pages, 3 figures, Preprint (submitted version, de-anonymized). Accepted at MLMI (MICCAI Workshop) 2025. Version of Record to appear in Springer LNCS; This preprint has not undergone peer review or any post-submission improvements or corrections

Vote

Add to Library

Recommend

5133. Enhancing Document VQA Models via Retrieval-Augmented Generation

Eric López, Artemis Llabrés, Ernest Valveny

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18984v1

Vote

Add to Library

Recommend

5134. Uniqueness of the Short-Time Linear Canonical Transform Phase Retrieval

Yali Dong, Rui Liu, Heying Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18973v1

Vote

Add to Library

Recommend

5135. UniC-RAG: Universal Knowledge Corruption Attacks to Retrieval-Augmented Generation

Runpeng Geng, Yanting Wang, Ying Chen, Jinyuan Jia

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18652v1

Vote

Add to Library

Recommend

5136. ArgRAG: Explainable Retrieval Augmented Generation using Quantitative Bipolar Argumentation

Yuqicheng Zhu, Nico Potyka, Daniel HernÃ¡ndez, Yuan He, Zifeng Ding, Bo Xiong, Dongzhuoran Zhou, Evgeny Kharlamov et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.20131v1

Vote

Add to Library

Recommend

5137. HyST: LLM-Powered Hybrid Retrieval over Semi-Structured Tabular Data

Jiyoon Myung, Jihyeon Park, Joohyung Han

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18048v1

Vote

Add to Library

Recommend

5138. How Do LLM-Generated Texts Impact Term-Based Retrieval Models?

Wei Huang, Keping Bi, Yinqiong Cai, Wei Chen, Jiafeng Guo, Xueqi Cheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17715v1

Vote

Add to Library

Recommend

5139. Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs

Jacob Portes, Connor Jennings, Erica Ji Yuen, Sasha Doubov, Michael Carbin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17400v1

Vote

Add to Library

Recommend

5140. SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation

Xiaqiang Tang, Yi Wang, Keyu Hu, Rui Xu, Chuang Li, Weigao Sun, Jian Li, Sihong Xie

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17225v1

Vote

Add to Library

Recommend

Benty-search

5121. Do Retrieval Augmented Language Models Know When They Don't Know?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.01476v1

5122. Enhancing Partially Relevant Video Retrieval with Robust Alignment Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.01383v1

5123. MARS: Modality-Aligned Retrieval for Sequence Augmented CTR Prediction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.01184v1

5124. Privacy-Preserving Reasoning with Knowledge-Distilled Parametric Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.01088v1

5125. Identifying Origins of Place Names via Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.01030v1

5126. Secure and Scalable Face Retrieval via Cancelable Product Quantization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.00781v1

5127. MultiFluxAI Enhancing Platform Engineering with Advanced Agent-Orchestrated Retrieval Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.21307v1

5128. On the Theoretical Limitations of Embedding-Based Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.21038v1

5129. Fact or Facsimile? Evaluating the Factual Robustness of Modern Retrievers

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.20408v1

5130. MPFormer: Adaptive Framework for Industrial Multi-Task Personalized Sequential Retriever

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.20400v1

5131. Selective Retrieval-Augmentation for Long-Tail Legal Text Classification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.19997v1

5132. Ontology-Based Concept Distillation for Radiology Report Retrieval and Labeling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.19915v1

5133. Enhancing Document VQA Models via Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.18984v1

5134. Uniqueness of the Short-Time Linear Canonical Transform Phase Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.18973v1

5135. UniC-RAG: Universal Knowledge Corruption Attacks to Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.18652v1

5136. ArgRAG: Explainable Retrieval Augmented Generation using Quantitative Bipolar Argumentation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.20131v1

5137. HyST: LLM-Powered Hybrid Retrieval over Semi-Structured Tabular Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.18048v1

5138. How Do LLM-Generated Texts Impact Term-Based Retrieval Models?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.17715v1

5139. Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.17400v1

5140. SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2508.17225v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.01476v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.01383v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.01184v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.01088v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.01030v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.00781v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.21307v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.21038v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.20408v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.20400v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.19997v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.19915v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18984v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18973v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18652v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.20131v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.18048v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17715v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17400v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2508.17225v1