benty-fields - Search paper

8541. Rationale-Augmented Retrieval with Constrained LLM Re-Ranking for Task Discovery

Bowen Wei

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.05131v1

Vote

Add to Library

Recommend

8542. Retrieval and Augmentation of Domain Knowledge for Text-to-SQL Semantic Parsing

Manasi Patwardhan, Ayush Agarwal, Shabbirhussain Bhaisaheb, Aseem Arora, Lovekesh Vig, Sunita Sarawagi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02394v1

Vote

Add to Library

Recommend

8543. Graph-S3: Enhancing Agentic textual Graph Retrieval with Synthetic Stepwise Supervision

Ge Chang, Jinbo Su, Jiacheng Liu, Pengfei Yang, Yuhao Shang, Huiwen Zheng, Hongli Ma, Yan Liang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.03323v1

Vote

Add to Library

Recommend

8544. Which Programming Language and Model Work Best With LLM-as-a-Judge For Code Retrieval?

Lucas Roberts, Denisa Roberts

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.00324v1

Code search is an important information retrieval application. Benefits of better code search include faster new developer on-boarding, reduced software maintenance, and ease of understanding for large repositories. Despite improvements in search algorithms and search benchmarks, the domain of code search has lagged behind. One reason is the high cost of human annotation for code queries and answers. While humans may annotate search results in general text QA systems, code annotations require specialized knowledge of a programming language (PL), as well as domain specific software engineering knowledge. In this work we study the use of Large Language Models (LLMs) to retrieve code at the level of functions and to generate annotations for code search results. We compare the impact of the retriever representation (sparse vs. semantic), programming language, and LLM by comparing human annotations across several popular languages (C, Java, Javascript, Go, and Python). We focus on repositories that implement common data structures likely to be implemented in any PLs. For the same human annotations, we compare several LLM-as-a-Judge models to evaluate programming language and other affinities between LLMs. We find that the chosen retriever and PL exhibit affinities that can be leveraged to improve alignment of human and AI relevance determinations, with significant performance implications. We also find differences in representation (sparse vs. semantic) across PLs that impact alignment of human and AI relevance determinations. We propose using transpilers to bootstrap scalable code search benchmark datasets in other PLs and in a case study demonstrate that human-AI relevance agreement rates largely match the (worst case) human-human agreement under study. The application code used in this work is available at \href{https://github.com/rlucas7/code-searcher/}{this github repo}.
Authors' comments: Accepted as a full paper at SIGIR-AP 2025

Vote

Add to Library

Recommend

8545. Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models

Matheus Vinicius da Silva de Oliveira, Jonathan de Andrade Silva, Awdren de Lima Fontao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26584v1

Vote

Add to Library

Recommend

8546. MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval

Junjie Zhou, Ze Liu, Lei Xiong, Jin-Ge Yao, Yueze Wang, Shitao Xiao, Fenfen Lin, Miguel Hu Chen et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26378v1

Vote

Add to Library

Recommend

8547. RE$^2$: Improving Chinese Grammatical Error Correction via Retrieving Appropriate Examples with Explanation

Baoxin Wang, Yumeng Luo, Yixuan Wang, Dayong Wu, Wanxiang Che, Shijin Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26038v1

Vote

Add to Library

Recommend

8548. SETR: A Two-Stage Semantic-Enhanced Framework for Zero-Shot Composed Image Retrieval

Yuqi Xiao, Yingying Zhu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26012v1

Vote

Add to Library

Recommend

8549. Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions

Junbeom Kim, Kyuyoung Kim, Jihoon Tack, Dongha Lim, Jinwoo Shin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.25973v1

Vote

Add to Library

Recommend

8550. Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation

Haoyue Bai, Haoyu Wang, Shengyu Chen, Zhengzhang Chen, Lu-An Tang, Wei Cheng, Haifeng Chen, Yanjie Fu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02388v1

Vote

Add to Library

Recommend

8551. Thin Bridges for Drug Text Alignment: Lightweight Contrastive Learning for Target Specific Drug Retrieval

Mallikarjuna Tupakula

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.03309v1

Vote

Add to Library

Recommend

8552. TRUE: A Reproducible Framework for LLM-Driven Relevance Judgment in Information Retrieval

Mouly Dewan, Jiqun Liu, Chirag Shah

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.25602v1

Vote

Add to Library

Recommend

8553. Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models

Yuhui Wang, Changjiang Li, Guangke Chen, Jiacheng Liang, Ting Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.24156v1

Vote

Add to Library

Recommend

8554. ID-RAG: Identity Retrieval-Augmented Generation for Long-Horizon Persona Coherence in Generative Agents

Daniel Platnick, Mohamed E. Bengueddache, Marjan Alirezaie, Dava J. Newman, Alex ''Sandy'' Pentland, Hossein Rahnama

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.25299v1

Vote

Add to Library

Recommend

8555. Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs

Shreyas Singh, Kunal Singh, Pradeep Moturi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.24107v1

Vote

Add to Library

Recommend

8556. Multi-Value-Product Retrieval-Augmented Generation for Industrial Product Attribute Value Identification

Huike Zou, Haiyang Yang, Yindu Su, Liyu Chen, Chengbao Lian, Qingheng Zhang, Shuguang Han, Jufeng Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.23874v1

Vote

Add to Library

Recommend

8557. Transformer Tafsir at QIAS 2025 Shared Task: Hybrid Retrieval-Augmented Generation for Islamic Knowledge Question Answering

Muhammad Abu Ahmad, Mohamad Ballout, Raia Abu Ahmad, Elia Bruni

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.23793v1

Vote

Add to Library

Recommend

8558. Emission-GPT: A domain-specific language model agent for knowledge retrieval, emission inventory and data analysis

Jiashu Ye, Tong Wu, Weiwen Chen, Hao Zhang, Zeteng Lin, Xingxing Li, Shujuan Weng, Manni Zhu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02359v1

Vote

Add to Library

Recommend

8559. From Evidence to Trajectory: Abductive Reasoning Path Synthesis for Training Retrieval-Augmented Generation Agents

Muzhi Li, Jinhu Qi, Yihong Wu, Minghao Zhao, Liheng Ma, Yifan Li, Xinyu Wang, Yingxue Zhang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.23071v1

Vote

Add to Library

Recommend

8560. Retrieval-Augmented Guardrails for AI-Drafted Patient-Portal Messages: Error Taxonomy Construction and Large-Scale Evaluation

Wenyuan Chen, Fateme Nateghi Haredasht, Kameron C. Black, Francois Grolleau, Emily Alsentzer, Jonathan H. Chen, Stephen P. Ma

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22565v1

Vote

Add to Library

Recommend

Benty-search

8541. Rationale-Augmented Retrieval with Constrained LLM Re-Ranking for Task Discovery

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.05131v1

8542. Retrieval and Augmentation of Domain Knowledge for Text-to-SQL Semantic Parsing

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.02394v1

8543. Graph-S3: Enhancing Agentic textual Graph Retrieval with Synthetic Stepwise Supervision

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.03323v1

8544. Which Programming Language and Model Work Best With LLM-as-a-Judge For Code Retrieval?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.00324v1

8545. Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.26584v1

8546. MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.26378v1

8547. RE$^2$: Improving Chinese Grammatical Error Correction via Retrieving Appropriate Examples with Explanation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.26038v1

8548. SETR: A Two-Stage Semantic-Enhanced Framework for Zero-Shot Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.26012v1

8549. Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.25973v1

8550. Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.02388v1

8551. Thin Bridges for Drug Text Alignment: Lightweight Contrastive Learning for Target Specific Drug Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.03309v1

8552. TRUE: A Reproducible Framework for LLM-Driven Relevance Judgment in Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.25602v1

8553. Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.24156v1

8554. ID-RAG: Identity Retrieval-Augmented Generation for Long-Horizon Persona Coherence in Generative Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.25299v1

8555. Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.24107v1

8556. Multi-Value-Product Retrieval-Augmented Generation for Industrial Product Attribute Value Identification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.23874v1

8557. Transformer Tafsir at QIAS 2025 Shared Task: Hybrid Retrieval-Augmented Generation for Islamic Knowledge Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.23793v1

8558. Emission-GPT: A domain-specific language model agent for knowledge retrieval, emission inventory and data analysis

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.02359v1

8559. From Evidence to Trajectory: Abductive Reasoning Path Synthesis for Training Retrieval-Augmented Generation Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.23071v1

8560. Retrieval-Augmented Guardrails for AI-Drafted Patient-Portal Messages: Error Taxonomy Construction and Large-Scale Evaluation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2509.22565v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.05131v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02394v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.03323v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.00324v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26584v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26378v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26038v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.26012v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.25973v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02388v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.03309v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.25602v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.24156v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.25299v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.24107v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.23874v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.23793v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.02359v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.23071v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2509.22565v1