benty-fields - Search paper

8161. MedTutor: A Retrieval-Augmented LLM System for Case-Based Medical Education

Dongsuk Jang, Ziyao Shangguan, Kyle Tegtmeyer, Anurag Gupta, Jan Czerminski, Sophie Chheang, Arman Cohan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06979v1

The learning process for medical residents presents significant challenges, demanding both the ability to interpret complex case reports and the rapid acquisition of accurate medical knowledge from reliable sources. Residents typically study case reports and engage in discussions with peers and mentors, but finding relevant educational materials and evidence to support their learning from these cases is often time-consuming and challenging. To address this, we introduce MedTutor, a novel system designed to augment resident training by automatically generating evidence-based educational content and multiple-choice questions from clinical case reports. MedTutor leverages a Retrieval-Augmented Generation (RAG) pipeline that takes clinical case reports as input and produces targeted educational materials. The system's architecture features a hybrid retrieval mechanism that synergistically queries a local knowledge base of medical textbooks and academic literature (using PubMed, Semantic Scholar APIs) for the latest related research, ensuring the generated content is both foundationally sound and current. The retrieved evidence is filtered and ordered using a state-of-the-art reranking model and then an LLM generates the final long-form output describing the main educational content regarding the case-report. We conduct a rigorous evaluation of the system. First, three radiologists assessed the quality of outputs, finding them to be of high clinical and educational value. Second, we perform a large scale evaluation using an LLM-as-a Judge to understand if LLMs can be used to evaluate the output of the system. Our analysis using correlation between LLMs outputs and human expert judgments reveals a moderate alignment and highlights the continued necessity of expert oversight.
Authors' comments: Accepted to EMNLP 2025 (System Demonstrations)

Benty-search

8161. MedTutor: A Retrieval-Augmented LLM System for Case-Based Medical Education

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06979v1

8162. UETQuintet at BioCreative IX - MedHopQA: Enhancing Biomedical QA with Selective Multi-hop Reasoning and Contextual Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06974v1

8163. Seeing through the Conflict: Transparent Knowledge Conflict Handling in Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06842v1

8164. CIRAG: Construction-Integration Retrieval and Adaptive Generation for Multi-hop Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06799v1

8165. CSR-RAG: An Efficient Retrieval System for Text-to-SQL on the Enterprise Scale

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06564v1

8166. L-RAG: Balancing Context and Retrieval with Entropy-Based Lazy Loading

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06551v1

8167. Multi-task Cross-modal Learning for Chest X-ray Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.05399v1

8168. OptiSet: Unified Optimizing Set Selection and Ranking for Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.05027v1

8169. SOVABench: A Vehicle Surveillance Action Retrieval Benchmark for Multimodal Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04824v1

8170. LANGSAE EDITING: Improving Multilingual Information Retrieval via Post-hoc Language Identity Removal

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04768v1

8171. Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04651v1

8172. Succeeding at Scale: Automated Multi-Retriever Fusion and Query-Side Adaptation for Multi-Tenant Search

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04646v1

8173. Self-MedRAG: a Self-Reflective Hybrid Retrieval-Augmented Generation Framework for Reliable Medical Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04531v1

8174. The Overlooked Role of Graded Relevance Thresholds in Multilingual Dense Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.04395v1

8175. CSMCIR: CoT-Enhanced Symmetric Alignment with Memory Bank for Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.03728v1

8176. Contract2Plan: Verified Contract-Grounded Retrieval-Augmented Optimization for BOM-Aware Procurement and Multi-Echelon Inventory Planning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.06164v1

8177. Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.03052v1

8178. SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.03014v1

8179. Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.02978v1

8180. RAL2M: Retrieval Augmented Learning-To-Match Against Hallucination in Compliance-Guaranteed Service Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2601.02917v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06979v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06974v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06842v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06799v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06564v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06551v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.05399v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.05027v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04824v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04768v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04651v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04646v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04531v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.04395v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.03728v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.06164v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.03052v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.03014v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.02978v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2601.02917v1