benty-fields - Search paper

8281. Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation

Aparajitha Allamraju, Maitreya Prafulla Chitale, Hiranmai Sri Adibhatla, Rahul Mishra, Manish Shrivastava

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.00367v1

Vote

Add to Library

Recommend

8282. Reasoning in Action: MCTS-Driven Knowledge Retrieval for Large Language Models

Shuqi Liu, Bowei He, Chen Ma, Linqi Song

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00003v1

Vote

Add to Library

Recommend

8283. Retrieval-Augmented Few-Shot Prompting Versus Fine-Tuning for Code Vulnerability Detection

Fouad Trad, Ali Chehab

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.04106v1

Few-shot prompting has emerged as a practical alternative to fine-tuning for leveraging the capabilities of large language models (LLMs) in specialized tasks. However, its effectiveness depends heavily on the selection and quality of in-context examples, particularly in complex domains. In this work, we examine retrieval-augmented prompting as a strategy to improve few-shot performance in code vulnerability detection, where the goal is to identify one or more security-relevant weaknesses present in a given code snippet from a predefined set of vulnerability categories. We perform a systematic evaluation using the Gemini-1.5-Flash model across three approaches: (1) standard few-shot prompting with randomly selected examples, (2) retrieval-augmented prompting using semantically similar examples, and (3) retrieval-based labeling, which assigns labels based on retrieved examples without model inference. Our results show that retrieval-augmented prompting consistently outperforms the other prompting strategies. At 20 shots, it achieves an F1 score of 74.05% and a partial match accuracy of 83.90%. We further compare this approach against zero-shot prompting and several fine-tuned models, including Gemini-1.5-Flash and smaller open-source models such as DistilBERT, DistilGPT2, and CodeBERT. Retrieval-augmented prompting outperforms both zero-shot (F1 score: 36.35%, partial match accuracy: 20.30%) and fine-tuned Gemini (F1 score: 59.31%, partial match accuracy: 53.10%), while avoiding the training time and cost associated with model fine-tuning. On the other hand, fine-tuning CodeBERT yields higher performance (F1 score: 91.22%, partial match accuracy: 91.30%) but requires additional training, maintenance effort, and resources.
Authors' comments: Accepted in the 3rd International Conference on Foundation and Large Language Models (FLLM2025)

Vote

Add to Library

Recommend

8284. Autonomous QA Agent: A Retrieval-Augmented Framework for Reliable Selenium Script Generation

Dudekula Kasim Vali

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06034v1

Vote

Add to Library

Recommend

8285. ExoJAX Retrievals of VLT/CRIRES Spectra of Luhman 16AB: C/O Ratios and Systematic Uncertainties

Hibiki Yama, Kento Masuda, Yui Kawashima, Hajime Kawahara

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.23018v1

Vote

Add to Library

Recommend

8286. STELLAR: Structure-guided LLM Assertion Retrieval and Generation for Formal Verification

Saeid Rajabi, Chengmo Yang, Satwik Patnaik

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.19903v1

Vote

Add to Library

Recommend

8287. Hybrid, Unified and Iterative: A Novel Framework for Text-based Person Anomaly Retrieval

Tien-Huy Nguyen, Huu-Loc Tran, Huu-Phong Phan-Nguyen, Quang-Vinh Dinh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22470v1

Vote

Add to Library

Recommend

8288. UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries

Hoang-Bao Le, Allie Tran, Binh T. Nguyen, Liting Zhou, Cathal Gurrin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22253v1

Vote

Add to Library

Recommend

8289. Enhancing information retrieval in quantum-optical critical systems via quantum measurement backaction

Cheng Zhang, Mauro Cirio, Xin-Qi Li, Pengfei Liang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22248v1

Vote

Add to Library

Recommend

8290. FIGROTD: A Friendly-to-Handle Dataset for Image Guided Retrieval with Optional Text

Hoang-Bao Le, Allie Tran, Binh T. Nguyen, Liting Zhou, Cathal Gurrin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22247v1

Vote

Add to Library

Recommend

8291. The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval

Jaime Garcia-Martinez, David Diaz-Guerra, John Anderson, Ricardo Falcon-Perez, Pablo Cabañas-Molero, Tuomas Virtanen, Julio J. Carabias-Orti, Pedro Vera-Candeas

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21247v1

Vote

Add to Library

Recommend

8292. Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval

Anup Roy, Rishabh Gyanendra Upadhyay, Animesh Rameshbhai Panara, Robin Mills

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21121v1

Vote

Add to Library

Recommend

8293. 5G Network Automation Using Local Large Language Models and Retrieval-Augmented Generation

Ahmadreza Majlesara, Ali Majlesi, Ali Mamaghani, Alireza Shokrani, Babak Hossein Khalaj

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21084v1

Vote

Add to Library

Recommend

8294. Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning

Xiaoxing You, Qiang Huang, Lingyu Li, Chi Zhang, Xiaopeng Liu, Min Zhang, Jun Yu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21002v1

Vote

Add to Library

Recommend

8295. Mispronunciation Detection and Diagnosis Without Model Training: A Retrieval-Based Approach

Huu Tuong Tu, Ha Viet Khanh, Tran Tien Dat, Vu Huan, Thien Van Luong, Nguyen Tien Cuong, Nguyen Thi Thu Trang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.20107v1

Vote

Add to Library

Recommend

8296. M$^3$Prune: Hierarchical Communication Graph Pruning for Efficient Multi-Modal Multi-Agent Retrieval-Augmented Generation

Weizi Shao, Taolin Zhang, Zijie Zhou, Chen Chen, Chengyu Wang, Xiaofeng He

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19969v1

Vote

Add to Library

Recommend

8297. RPM-MCTS: Knowledge-Retrieval as Process Reward Model with Monte Carlo Tree Search for Code Generation

Yuanyuan Lin, Xiangyu Ouyang, Teng Zhang, Kaixin Sui

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19895v1

Vote

Add to Library

Recommend

8298. Generative Query Expansion with Multilingual LLMs for Cross-Lingual Information Retrieval

Olivia Macmillan-Scott, Roksana Goworek, Eda B. Özyiğit

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19325v1

Vote

Add to Library

Recommend

8299. What Drives Cross-lingual Ranking? Retrieval Approaches with Multilingual Language Models

Roksana Goworek, Olivia Macmillan-Scott, Eda B. Özyiğit

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19324v1

Vote

Add to Library

Recommend

8300. Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation

Yingjia Shang, Yi Liu, Huimin Wang, Furong Li, Wenfang Sun, Wu Chengyu, Yefeng Zheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19257v1

With the rapid advancement of retrieval-augmented vision-language models, multimodal medical retrieval-augmented generation (MMed-RAG) systems are increasingly adopted in clinical decision support. These systems enhance medical applications by performing cross-modal retrieval to integrate relevant visual and textual evidence for tasks, e.g., report generation and disease diagnosis. However, their complex architecture also introduces underexplored adversarial vulnerabilities, particularly via visual input perturbations. In this paper, we propose Medusa, a novel framework for crafting cross-modal transferable adversarial attacks on MMed-RAG systems under a black-box setting. Specifically, Medusa formulates the attack as a perturbation optimization problem, leveraging a multi-positive InfoNCE loss (MPIL) to align adversarial visual embeddings with medically plausible but malicious textual targets, thereby hijacking the retrieval process. To enhance transferability, we adopt a surrogate model ensemble and design a dual-loop optimization strategy augmented with invariant risk minimization (IRM). Extensive experiments on two real-world medical tasks, including medical report generation and disease diagnosis, demonstrate that Medusa achieves over 90% average attack success rate across various generation models and retrievers under appropriate parameter configuration, while remaining robust against four mainstream defenses, outperforming state-of-the-art baselines. Our results reveal critical vulnerabilities in the MMed-RAG systems and highlight the necessity of robustness benchmarking in safety-critical medical applications. The code and data are available at https://anonymous.4open.science/r/MMed-RAG-Attack-F05A.
Authors' comments: Accepted at KDD 2026 First Cycle (full version). Authors marked with * contributed equally. Yi Liu is the lead author

Vote

Add to Library

Recommend

Benty-search

8281. Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.00367v1

8282. Reasoning in Action: MCTS-Driven Knowledge Retrieval for Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.00003v1

8283. Retrieval-Augmented Few-Shot Prompting Versus Fine-Tuning for Code Vulnerability Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.04106v1

8284. Autonomous QA Agent: A Retrieval-Augmented Framework for Reliable Selenium Script Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06034v1

8285. ExoJAX Retrievals of VLT/CRIRES Spectra of Luhman 16AB: C/O Ratios and Systematic Uncertainties

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.23018v1

8286. STELLAR: Structure-guided LLM Assertion Retrieval and Generation for Formal Verification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.19903v1

8287. Hybrid, Unified and Iterative: A Novel Framework for Text-based Person Anomaly Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.22470v1

8288. UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.22253v1

8289. Enhancing information retrieval in quantum-optical critical systems via quantum measurement backaction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.22248v1

8290. FIGROTD: A Friendly-to-Handle Dataset for Image Guided Retrieval with Optional Text

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.22247v1

8291. The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.21247v1

8292. Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.21121v1

8293. 5G Network Automation Using Local Large Language Models and Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.21084v1

8294. Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.21002v1

8295. Mispronunciation Detection and Diagnosis Without Model Training: A Retrieval-Based Approach

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.20107v1

8296. M$^3$Prune: Hierarchical Communication Graph Pruning for Efficient Multi-Modal Multi-Agent Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19969v1

8297. RPM-MCTS: Knowledge-Retrieval as Process Reward Model with Monte Carlo Tree Search for Code Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19895v1

8298. Generative Query Expansion with Multilingual LLMs for Cross-Lingual Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19325v1

8299. What Drives Cross-lingual Ranking? Retrieval Approaches with Multilingual Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19324v1

8300. Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.19257v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.00367v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.00003v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.04106v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06034v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.23018v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.19903v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22470v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22253v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22248v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.22247v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21247v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21121v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21084v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.21002v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.20107v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19969v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19895v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19325v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19324v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.19257v1