benty-fields - Search paper

8561. TrueGradeAI: Retrieval-Augmented and Bias-Resistant AI for Transparent and Explainable Digital Assessments

Rakesh Thakur, Shivaansh Kaushik, Gauri Chopra, Harsh Rohilla

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22516v1

Vote

Add to Library

Recommend

8562. Your RAG is Unfair: Exposing Fairness Vulnerabilities in Retrieval-Augmented Generation via Backdoor Attacks

Gaurav Bagwe, Saket S. Chaturvedi, Xiaolong Ma, Xiaoyong Yuan, Kuang-Ching Wang, Lan Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22486v1

Vote

Add to Library

Recommend

8563. Can Synthetic Query Rewrites Capture User Intent Better than Humans in Retrieval-Augmented Generation?

JiaYing Zheng, HaiNan Zhang, Liang Pang, YongXin Tong, ZhiMing Zheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22325v1

Vote

Add to Library

Recommend

8564. GraphSearch: An Agentic Deep Searching Workflow for Graph Retrieval-Augmented Generation

Cehao Yang, Xiaojun Wu, Xueyuan Lin, Chengjin Xu, Xuhui Jiang, Yuanliang Sun, Jia Li, Hui Xiong et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22009v1

Vote

Add to Library

Recommend

8565. Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding

Seong-Woong Shim, Myunsoo Kim, Jae Hyeon Cho, Byung-Jun Lee

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21865v1

Vote

Add to Library

Recommend

8566. Frustratingly Easy Zero-Day Audio DeepFake Detection via Retrieval Augmentation and Profile Matching

Xuechen Liu, Xin Wang, Junichi Yamagishi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21728v1

Vote

Add to Library

Recommend

8567. Cross-modal RAG: Sub-dimensional Text-to-Image Retrieval-Augmented Generation

Mengdan Zhu, Senhao Cheng, Guangji Bai, Yifei Zhang, Liang Zhao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.21956v2

Vote

Add to Library

Recommend

8568. Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems

Kai Hua, Zhiyuan Feng, Chongyang Tao, Rui Yan, Lu Zhang

Proceedings of the 29th ACM International Conference on Information & Knowledge Management, 525-534 (2020)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22845v1

Vote

Add to Library

Recommend

8569. PseudoBridge: Pseudo Code as the Bridge for Better Semantic and Logic Alignment in Code Retrieval

Yixuan Li, Xinyi Liu, Weidong Yang, Ben Fei, Shuhao Li, Mingjie Zhou, Lipeng Ma

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20881v1

Vote

Add to Library

Recommend

8570. Meta-Memory: Retrieving and Integrating Semantic-Spatial Memories for Robot Spatial Reasoning

Yufan Mao, Hanjing Ye, Wenlong Dong, Chengjie Zhang, Hong Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20754v1

Vote

Add to Library

Recommend

8571. An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans

Junjie Cui, Peilong Wang, Jason Holmes, Leshan Sun, Michael L. Hinni, Barbara A. Pockaj, Sujay A. Vora, Terence T. Sio et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20707v1

Purpose: To develop a retrieval-augmented generation (RAG) system powered by LLaMA-4 109B for automated, protocol-aware, and interpretable evaluation of radiotherapy treatment plans. Methods and Materials: We curated a multi-protocol dataset of 614 radiotherapy plans across four disease sites and constructed a knowledge base containing normalized dose metrics and protocol-defined constraints. The RAG system integrates three core modules: a retrieval engine optimized across five SentenceTransformer backbones, a percentile prediction component based on cohort similarity, and a clinical constraint checker. These tools are directed by a large language model (LLM) using a multi-step prompt-driven reasoning pipeline to produce concise, grounded evaluations. Results: Retrieval hyperparameters were optimized using Gaussian Process on a scalarized loss function combining root mean squared error (RMSE), mean absolute error (MAE), and clinically motivated accuracy thresholds. The best configuration, based on all-MiniLM-L6-v2, achieved perfect nearest-neighbor accuracy within a 5-percentile-point margin and a sub-2pt MAE. When tested end-to-end, the RAG system achieved 100% agreement with the computed values by standalone retrieval and constraint-checking modules on both percentile estimates and constraint identification, confirming reliable execution of all retrieval, prediction and checking steps. Conclusion: Our findings highlight the feasibility of combining structured population-based scoring with modular tool-augmented reasoning for transparent, scalable plan evaluation in radiation therapy. The system offers traceable outputs, minimizes hallucination, and demonstrates robustness across protocols. Future directions include clinician-led validation, and improved domain-adapted retrieval models to enhance real-world integration.
Authors' comments: 16 pages, 4 figures. Submitted to npj Digital Medicine

Vote

Add to Library

Recommend

8572. RJE: A Retrieval-Judgment-Exploration Framework for Efficient Knowledge Graph Question Answering with LLMs

Can Lin, Zhengwang Jiang, Ling Zheng, Qi Zhao, Yuhang Zhang, Qi Song, Wangqiu Zhou

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.01257v1

Vote

Add to Library

Recommend

8573. Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval

Vivek Bhavsar, Joseph Ereifej, Aravanan Gurusami

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02326v1

Vote

Add to Library

Recommend

8574. X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning

Prasanna Reddy Pulakurthi, Jiamian Wang, Majid Rabbani, Sohail Dianat, Raghuveer Rao, Zhiqiang Tao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21559v1

Vote

Add to Library

Recommend

8575. MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

Lihui Liu, Carl J. Yang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21391v1

Vote

Add to Library

Recommend

8576. Enhancing LLM-based Fault Localization with a Functionality-Aware Retrieval-Augmented Generation Framework

Xinyu Shi, Zhenhao Li, An Ran Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20552v1

Vote

Add to Library

Recommend

8577. A Knowledge Graph and a Tripartite Evaluation Framework Make Retrieval-Augmented Generation Scalable and Transparent

Olalekan K. Akindele, Bhupesh Kumar Mishra, Kenneth Y. Wertheim

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.19209v1

Large Language Models (LLMs) have significantly enhanced conversational Artificial Intelligence(AI) chatbots; however, domain-specific accuracy and the avoidance of factual inconsistencies remain pressing challenges, particularly for large datasets. Designing an effective chatbot with appropriate methods and evaluating its effectiveness is among the challenges in this domain. This study presents a Retrieval Augmented Generation (RAG) chatbot that harnesses a knowledge graph and vector search retrieval to deliver precise, context-rich responses in an exemplary use case from over high-volume engineering project-related emails, thereby minimising the need for document chunking. A central innovation of this work is the introduction of RAG Evaluation (RAG-Eval), a novel chain-of-thought LLM-based tripartite evaluation framework specifically developed to assess RAG applications. This framework operates in parallel with the chatbot, jointly assessing the user's query, the retrieved document, and the generated response, enabling a holistic evaluation across multiple quality metrics like query relevance, factual accuracy, coverage, coherence and fluency. The resulting scoring system is provided directly to users as a confidence score (1 to 100%), enabling quick identification of possible misaligned or incomplete answers. This proposed approach promotes transparency and rapid verification by incorporating metadata email IDs, timestamps into responses. Experimental comparisons against BERTScore and G-EVAL for summarisation evaluation tasks confirm its effectiveness, and empirical analysis also shows RAG-Eval reliably detects factual gaps and query mismatches, thereby fostering trust in high demand, data centric environments. These findings highlight a scalable path for developing accurate, user-verifiable chatbots that bridge the gap between high-level conversational fluency and factual accuracy.
Authors' comments: 25 Pages

Vote

Add to Library

Recommend

8578. NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment

Ajay Narayanan Sridhar, Fuli Qiao, Nelson Daniel Troncoso Aldas, Yanpei Shi, Mehrdad Mahdavi, Laurent Itti, Vijaykrishnan Narayanan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.18672v1

Vote

Add to Library

Recommend

8579. CALL: Context-Aware Low-Latency Retrieval in Disk-Based Vector Databases

Yeonwoo Jeong, Hyunji Cho, Kyuri Park, Youngjae Kim, Sungyong Park

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.18670v1

Vote

Add to Library

Recommend

8580. MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interaction

Zilin Xiao, Qi Ma, Mengting Gu, Chun-cheng Jason Chen, Xintao Chen, Vicente Ordonez, Vijai Mohan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.18095v1

Vote

Add to Library

Recommend

Benty-search

8561. TrueGradeAI: Retrieval-Augmented and Bias-Resistant AI for Transparent and Explainable Digital Assessments

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.22516v1

8562. Your RAG is Unfair: Exposing Fairness Vulnerabilities in Retrieval-Augmented Generation via Backdoor Attacks

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.22486v1

8563. Can Synthetic Query Rewrites Capture User Intent Better than Humans in Retrieval-Augmented Generation?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.22325v1

8564. GraphSearch: An Agentic Deep Searching Workflow for Graph Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.22009v1

8565. Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.21865v1

8566. Frustratingly Easy Zero-Day Audio DeepFake Detection via Retrieval Augmentation and Profile Matching

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.21728v1

8567. Cross-modal RAG: Sub-dimensional Text-to-Image Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.21956v2

8568. Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.22845v1

8569. PseudoBridge: Pseudo Code as the Bridge for Better Semantic and Logic Alignment in Code Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.20881v1

8570. Meta-Memory: Retrieving and Integrating Semantic-Spatial Memories for Robot Spatial Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.20754v1

8571. An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.20707v1

8572. RJE: A Retrieval-Judgment-Exploration Framework for Efficient Knowledge Graph Question Answering with LLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.01257v1

8573. Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.02326v1

8574. X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.21559v1

8575. MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.21391v1

8576. Enhancing LLM-based Fault Localization with a Functionality-Aware Retrieval-Augmented Generation Framework

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.20552v1

8577. A Knowledge Graph and a Tripartite Evaluation Framework Make Retrieval-Augmented Generation Scalable and Transparent

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.19209v1

8578. NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.18672v1

8579. CALL: Context-Aware Low-Latency Retrieval in Disk-Based Vector Databases

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.18670v1

8580. MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interaction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.18095v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22516v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22486v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22325v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22009v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21865v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21728v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.21956v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22845v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20881v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20754v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20707v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.01257v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02326v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21559v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.21391v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.20552v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.19209v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.18672v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.18670v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.18095v1