benty-fields - Search paper

9061. TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking

Ching Nam Hang, Pei-Duo Yu, Chee Wei Tan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.07891v1

Vote

Add to Library

Recommend

9062. POISONCRAFT: Practical Poisoning of Retrieval-Augmented Generation for Large Language Models

Yangguang Shao, Xinjie Lin, Haozheng Luo, Chengshang Hou, Gang Xiong, Jiahao Yu, Junzheng Shi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.06579v1

Large language models (LLMs) have achieved remarkable success in various domains, primarily due to their strong capabilities in reasoning and generating human-like text. Despite their impressive performance, LLMs are susceptible to hallucinations, which can lead to incorrect or misleading outputs. This is primarily due to the lack of up-to-date knowledge or domain-specific information. Retrieval-augmented generation (RAG) is a promising approach to mitigate hallucinations by leveraging external knowledge sources. However, the security of RAG systems has not been thoroughly studied. In this paper, we study a poisoning attack on RAG systems named POISONCRAFT, which can mislead the model to refer to fraudulent websites. Compared to existing poisoning attacks on RAG systems, our attack is more practical as it does not require access to the target user query's info or edit the user query. It not only ensures that injected texts can be retrieved by the model, but also ensures that the LLM will be misled to refer to the injected texts in its response. We demonstrate the effectiveness of POISONCRAFTacross different datasets, retrievers, and language models in RAG pipelines, and show that it remains effective when transferred across retrievers, including black-box systems. Moreover, we present a case study revealing how the attack influences both the retrieval behavior and the step-by-step reasoning trace within the generation model, and further evaluate the robustness of POISONCRAFTunder multiple defense mechanisms. These results validate the practicality of our threat model and highlight a critical security risk for RAG systems deployed in real-world applications. We release our code\footnote{https://github.com/AndyShaw01/PoisonCraft} to support future research on the security and robustness of RAG systems in real-world settings.
Authors' comments: 12 pages, 7 tables and 3 figures

Vote

Add to Library

Recommend

9063. ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding

Shuai Wang, Ivona Najdenkoska, Hongyi Zhu, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.06020v1

Vote

Add to Library

Recommend

9064. Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems

Fatemeh Nazary, Yashar Deldjoo, Tommaso Di Noia, Eugenio Di Sciascio

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05196v1

Vote

Add to Library

Recommend

9065. LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration

Yingyi Zhang, Pengyue Jia, Xianneng Li, Derong Xu, Maolin Wang, Yichao Wang, Zhaocheng Du, Huifeng Guo et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05031v1

Vote

Add to Library

Recommend

9066. An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education

Ramteja Sajja, Yusuf Sermet, Ibrahim Demir

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04916v1

Vote

Add to Library

Recommend

9067. QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public

Mingruo Yuan, Ben Kao, Tien-Hsuan Wu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04883v1

Vote

Add to Library

Recommend

9068. Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval

Alexander Most, Joseph Winjum, Ayan Biswas, Shawn Jones, Nishath Rajiv Ranasinghe, Dan O'Malley, Manish Bhattarai

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05666v1

Vote

Add to Library

Recommend

9069. Fine-Tuning Video-Text Contrastive Model for Primate Behavior Retrieval from Unlabeled Raw Videos

Giulio Cesare Mastrocinque Santo, Patrícia Izar, Irene Delval, Victor de Napole Gregolin, Nina S. T. Hirata

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05681v1

Vote

Add to Library

Recommend

9070. CDE-Mapper: Using Retrieval-Augmented Language Models for Linking Clinical Data Elements to Controlled Vocabularies

Komal Gilani, Marlo Verket, Christof Peters, Michel Dumontier, Hans-Peter Brunner-La Rocca, Visara Urovi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04365v1

Vote

Add to Library

Recommend

9071. DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation

Naphat Nithisopa, Teerapong Panboonyuen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04175v1

Vote

Add to Library

Recommend

9072. Fine-Tuning Large Language Models and Evaluating Retrieval Methods for Improved Question Answering on Building Codes

Mohammad Aqib, Mohd Hamza, Qipei Mei, Ying Hei Chui

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04666v1

Vote

Add to Library

Recommend

9073. An Analysis of Hyper-Parameter Optimization Methods for Retrieval Augmented Generation

Matan Orbach, Ohad Eytan, Benjamin Sznajder, Ariel Gera, Odellia Boni, Yoav Kantor, Gal Bloch, Omri Levy et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03452v1

Vote

Add to Library

Recommend

9074. Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation

Mohammad Shoaib Ansari, Mohd Sohail Ali Khan, Shubham Revankar, Aditya Varma, Anil S. Mokhade

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03406v1

Vote

Add to Library

Recommend

9075. RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation

Tiantian Gan, Qiyao Sun

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03275v1

Vote

Add to Library

Recommend

9076. SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation

Yu-Ren Guo, Wen-Kai Tai

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03244v2

Vote

Add to Library

Recommend

9077. Nonconvex landscapes in phase retrieval and semidefinite low-rank matrix sensing with overparametrization

Andrew D. McRae

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02636v1

Vote

Add to Library

Recommend

9078. Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality

Xueguang Ma, Luyu Gao, Shengyao Zhuang, Jiaqi Samantha Zhan, Jamie Callan, Jimmy Lin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02466v1

Vote

Add to Library

Recommend

9079. TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment

Zhichuan Wang, Yang Zhou, Jinhai Xiang, Yulong Wang, Xinwei He

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02325v1

Vote

Add to Library

Recommend

9080. Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use

Justin Ho, Alexandra Colby, William Fisher

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02164v1

Vote

Add to Library

Recommend

Benty-search

9061. TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.07891v1

9062. POISONCRAFT: Practical Poisoning of Retrieval-Augmented Generation for Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.06579v1

9063. ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.06020v1

9064. Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.05196v1

9065. LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.05031v1

9066. An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.04916v1

9067. QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.04883v1

9068. Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.05666v1

9069. Fine-Tuning Video-Text Contrastive Model for Primate Behavior Retrieval from Unlabeled Raw Videos

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.05681v1

9070. CDE-Mapper: Using Retrieval-Augmented Language Models for Linking Clinical Data Elements to Controlled Vocabularies

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.04365v1

9071. DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.04175v1

9072. Fine-Tuning Large Language Models and Evaluating Retrieval Methods for Improved Question Answering on Building Codes

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.04666v1

9073. An Analysis of Hyper-Parameter Optimization Methods for Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.03452v1

9074. Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.03406v1

9075. RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.03275v1

9076. SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.03244v2

9077. Nonconvex landscapes in phase retrieval and semidefinite low-rank matrix sensing with overparametrization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.02636v1

9078. Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.02466v1

9079. TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.02325v1

9080. Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.02164v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.07891v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.06579v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.06020v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05196v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05031v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04916v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04883v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05666v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.05681v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04365v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04175v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04666v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03452v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03406v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03275v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.03244v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02636v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02466v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02325v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02164v1