benty-fields - Search paper

Unauthorized disclosure of confidential documents demands robust, low-leakage classification. In real work environments, there is a lot of inflow and outflow of documents. To continuously update knowledge, we propose a methodology for classifying confidential documents using Retrieval Augmented Classification (RAC). To confirm this effectiveness, we compare RAC and supervised fine tuning (FT) on the WikiLeaks US Diplomacy corpus under realistic sequence-length constraints. On balanced data, RAC matches FT. On unbalanced data, RAC is more stable while delivering comparable performance--about 96% Accuracy on both the original (unbalanced) and augmented (balanced) sets, and up to 94% F1 with proper prompting--whereas FT attains 90% F1 trained on the augmented, balanced set but drops to 88% F1 trained on the original, unbalanced set. When robust augmentation is infeasible, RAC provides a practical, security-preserving path to strong classification by keeping sensitive content out of model weights and under your control, and it remains robust as real-world conditions change in class balance, data, context length, or governance requirements. Because RAC grounds decisions in an external vector store with similarity matching, it is less sensitive to label skew, reduces parameter-level leakage, and can incorporate new data immediately via reindexing--a difficult step for FT, which typically requires retraining. The contributions of this paper are threefold: first, a RAC-based classification pipeline and evaluation recipe; second, a controlled study that isolates class imbalance and context-length effects for FT versus RAC in confidential-document grading; and third, actionable guidance on RAC design patterns for governed deployments.
Authors' comments: Appears in: KSII The 17th International Conference on Internet (ICONI) 2025, Dec 2025. 7 pages (48-54)

Vote

Add to Library

Recommend

3448. Feedback Adaptation for Retrieval-Augmented Generation

Jihwan Bang, Seunghan Yang, Kyuhong Shim, Simyung Chang, Juntae Lee, Sungha Choi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.06647v1

Vote

Add to Library

Recommend

3449. THIVLVC: Retrieval Augmented Dependency Parsing for Latin

Luc Pommeret, Thibault Wagret, Jules Deret

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.05564v1

Vote

Add to Library

Recommend

3450. Spike Hijacking in Late-Interaction Retrieval

Karthik Suresh, Tushar Vatsa, Tracy King, Asim Kadav, Michael Friedrich

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.05253v1

Vote

Add to Library

Recommend

3451. Retrieval Augmented Conversational Recommendation with Reinforcement Learning

Zhenrui Yue, Honglei Zhuang, Zhen Qin, Zhankui He, Huimin Zeng, Julian McAuley, Dong Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.04457v1

Large language models (LLMs) exhibit enhanced capabilities in language understanding and generation. By utilizing their embedded knowledge, LLMs are increasingly used as conversational recommender systems (CRS), achieving improved performance across diverse scenarios. However, existing LLM-based methods rely on pretrained knowledge without external retrieval mechanisms for novel items. Additionally, the lack of a unified corpus poses challenges for integrating retrieval augmentation into CRS. Motivated by these challenges, we present RAR, a novel two-stage retrieval augmented conversational recommendation framework that aligns retrieval and generation to enhance both performance and factuality. To support this framework and provide a unified corpus, we construct a large-scale movie corpus, comprising over 300k movies with rich metadata, such as titles, casts and plot summaries. Leveraging this data, our primary contribution is RAR, the first framework to departs from standard two-stage CRS by dynamically bridging retrieval and generation. First, a retriever model generates candidate items based on user history; in the subsequent stage, an LLM refines the recommendations by incorporating conversational context with retrieved results. In addition, we introduce a novel reinforcement learning (RL) method that leverages LLM feedback to iteratively update the retriever. By creating a collaborative feedback loop that reinforces sampled candidate sets with higher ranking metrics, RAR effectively mitigates the misalignment between the retrieval and generation stages. Furthermore, grounding the LLM in factual metadata allows our RL-driven approach to capture subtle user intentions and generate context-aware recommendations with reduced hallucinations. We validate our approach through extensive experiments on multiple benchmarks, where RAR consistently outperforms state-of-the-art baseline methods.

Vote

Add to Library

Recommend

3452. DOTRAG: Retrieval-Time Reasoning Along Paths

Larnell Moore, Naihao Deng, Rada Mihalcea, Farnaz Jahanbakhsh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.18760v1

Vote

Add to Library

Recommend

3453. Align then Train: Efficient Retrieval Adapter Learning

Seiji Maekawa, Moin Aminnaseri, Pouya Pezeshkpour, Estevam Hruschka

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.03403v1

Vote

Add to Library

Recommend

3454. Learning to Retrieve from Agent Trajectories

Yuqi Zhou, Sunhao Dai, Changle Qu, Liang Pang, Jun Xu, Ji-Rong Wen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.04949v1

Vote

Add to Library

Recommend

3455. Retrieval-Augmented Generation Based Nurse Observation Extraction

Kyomin Hwang, Nojun Kwak

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.26046v1

Vote

Add to Library

Recommend

3456. Retrieving Climate Change Disinformation by Narrative

Max Upravitelev, Veronika Solopova, Charlott Jakob, Premtim Sahitaj, Vera Schmitt

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.22015v1

Vote

Add to Library

Recommend

3457. CoVR-R:Reason-Aware Composed Video Retrieval

Omkar Thawakar, Dmitry Demidov, Vaishnav Potlapalli, Sai Prasanna Teja Reddy Bogireddy, Viswanatha Reddy Gajjala, Alaa Mostafa Lasheen, Rao Muhammad Anwer, Fahad Khan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.20190v1

Vote

Add to Library

Recommend

3458. Retrieval-Augmented LLMs for Security Incident Analysis

Xavier Cadet, Aditya Vikram Singh, Harsh Mamania, Edward Koh, Alex Fitts, Dirk Van Bruggen, Simona Boboila, Peter Chin et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.18196v1

Vote

Add to Library

Recommend

3459. Retrieving Counterfactuals Improves Visual In-Context Learning

Guangzhi Xiong, Sanchit Sinha, Zhenghao He, Aidong Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.16737v1

Vote

Add to Library

Recommend

3460. Retrieval-Augmented Sketch-Guided 3D Building Generation

Zhengyang Wang, Nuttapong Rochanavibhata, Yuxiao Ren, Xusheng Du, Ye Zhang, Haoran Xie

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.16612v1

Vote

Add to Library

Recommend

Benty-search

3441. MultiHedge: Adaptive Coordination via Retrieval-Augmented Control

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.24905v1

3442. ATIR: Towards Audio-Text Interleaved Contextual Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.20267v1

3443. TypeScript Repository Indexing for Code Agent Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.18413v1

3444. RACER: Retrieval-Augmented Contextual Rapid Speculative Decoding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14885v1

3445. A Sanity Check on Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.12904v1

3446. Bottleneck Tokens for Unified Multimodal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.11095v1

3447. Retrieval Augmented Classification for Confidential Documents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.08628v1

3448. Feedback Adaptation for Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.06647v1

3449. THIVLVC: Retrieval Augmented Dependency Parsing for Latin

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.05564v1

3450. Spike Hijacking in Late-Interaction Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.05253v1

3451. Retrieval Augmented Conversational Recommendation with Reinforcement Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.04457v1

3452. DOTRAG: Retrieval-Time Reasoning Along Paths

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2605.18760v1

3453. Align then Train: Efficient Retrieval Adapter Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.03403v1

3454. Learning to Retrieve from Agent Trajectories

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.04949v1

3455. Retrieval-Augmented Generation Based Nurse Observation Extraction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.26046v1

3456. Retrieving Climate Change Disinformation by Narrative

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.22015v1

3457. CoVR-R:Reason-Aware Composed Video Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.20190v1

3458. Retrieval-Augmented LLMs for Security Incident Analysis

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.18196v1

3459. Retrieving Counterfactuals Improves Visual In-Context Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.16737v1

3460. Retrieval-Augmented Sketch-Guided 3D Building Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.16612v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.24905v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.20267v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.18413v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14885v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12904v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.11095v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.08628v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.06647v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.05564v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.05253v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.04457v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.18760v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.03403v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.04949v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.26046v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.22015v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.20190v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.18196v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.16737v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.16612v1