benty-fields - Search paper

7721. Don't Retrieve, Navigate: Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG

Yiqun Sun, Pengfei Wei, Lawrence B. Hsieh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14572v1

Vote

Add to Library

Recommend

7722. A Unified Model and Document Representation for On-Device Retrieval-Augmented Generation

Julian Killingback, Ofer Meshi, Henry Li, Hamed Zamani, Maryam Karimzadehgan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14403v1

Vote

Add to Library

Recommend

7723. ASTRA: Enhancing Multi-Subject Generation with Retrieval-Augmented Pose Guidance and Disentangled Position Embedding

Tianze Xia, Zijian Ning, Zonglin Zhao, Mingjia Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13938v1

Vote

Add to Library

Recommend

7724. ToolOmni: Enabling Open-World Tool Use via Agentic learning with Proactive Retrieval and Grounded Execution

Shouzheng Huang, Meishan Zhang, Baotian Hu, Min Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13787v1

Vote

Add to Library

Recommend

7725. Hybrid Retrieval for COVID-19 Literature: Comparing Rank Fusion and Projection Fusion with Diversity Reranking

Harishkumar Kishorkumar Prajapati

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13728v1

Vote

Add to Library

Recommend

7726. SLQ: Bridging Modalities via Shared Latent Queries for Retrieval with Frozen MLLMs

Haoran Lou, Ziyan Liu, Chunxiao Fan, Yuexin Wu, Yue Ming

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13710v1

Vote

Add to Library

Recommend

7727. ToolSpec: Accelerating Tool Calling via Schema-Aware and Retrieval-Augmented Speculative Decoding

Heming Xia, Yongqi Li, Cunxiao Du, Mingbo Song, Wenjie Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13519v1

Vote

Add to Library

Recommend

7728. From Relevance to Authority: Authority-aware Generative Retrieval in Web Search Engines

Sunkyung Lee, Jihye Back, Donghyeon Jeon, Soonhwan Kwon, Moonkwon Kim, Inho Kang, Jongwuk Lee

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13468v1

Vote

Add to Library

Recommend

7729. MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments

Han Wang, David Wan, Hyunji Lee, Thinh Pham, Mikaela Cankosyan, Weiyuan Chen, Elias Stengel-Eskin, Tu Vu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13418v1

Motivated by the underspecified, multi-hop nature of search queries and the multimodal, heterogeneous, and often conflicting nature of real-world web results, we introduce MERRIN (Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments), a human-annotated benchmark for evaluating search-augmented agents. MERRIN measures AI agents' ability to identify relevant modalities, retrieve multimodal evidence, and perform multi-hop reasoning over noisy web sources. It differs from prior work in three important aspects: (1) using natural language queries without explicit modality cues, (2) incorporating underexplored modalities such as video and audio, and (3) requiring the retrieval of complex, often noisy or conflicting multimodal evidence during web search. We evaluate diverse search agents powered by ten models, including strong closed-source models (e.g., GPT-5.4-mini, Gemini 3/3.1 Flash/Pro) and open-weight models (Qwen3-4B/30B/235B), across three search settings (no search, native search, and agentic search). Our results show that MERRIN is highly challenging: the average accuracy across all agents is 22.3%, with the best-performing agent reaching only 40.1%. We further observe that while stronger agents like Gemini Deep Research achieve higher performance, gains are modest due to over-exploration; they take more steps and use more tools, but are often distracted by conflicting or partially relevant web content, leading to incorrect answers. Compared to humans, these agents consume more resources yet achieve lower accuracy, largely due to inefficient source selection and an overreliance on text modalities. These findings highlight the need for search agents capable of robust search and reasoning across diverse modalities in noisy web environments, making MERRIN a valuable testbed for evaluating such capabilities.
Authors' comments: First three authors contributed equally. Project Page: https://merrin-benchmark.github.io/

Vote

Add to Library

Recommend

7730. FRESCO: Benchmarking and Optimizing Re-rankers for Evolving Semantic Conflict in Retrieval-Augmented Generation

Sohyun An, Hayeon Lee, Shuibenyang Yuan, Chun-cheng Jason Chen, Cho-Jui Hsieh, Vijai Mohan, Alexander Min

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14227v1

Vote

Add to Library

Recommend

7731. AffectAgent: Collaborative Multi-Agent Reasoning for Retrieval-Augmented Multimodal Emotion Recognition

Zeheng Wang, Zitong Yu, Yijie Zhu, Bo Zhao, Haochen Liang, Taorui Wang, Wei Xia, Jiayu Zhang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12735v1

Vote

Add to Library

Recommend

7732. Transforming External Knowledge into Triplets for Enhanced Retrieval in RAG of LLMs

Xudong Wang, Chaoning Zhang, Qigan Sun, Zhenzhen Huang, Chang Lu, Sheng Zheng, Zeyu Ma, Caiyan Qin et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12610v1

Vote

Add to Library

Recommend

7733. Adaptive Query Routing: A Tier-Based Framework for Hybrid Retrieval Across Financial, Legal, and Medical Documents

Afshan Hashmi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14222v1

Vote

Add to Library

Recommend

7734. Beyond Factual Grounding: The Case for Opinion-Aware Retrieval-Augmented Generation

Aditya Agrawal, Alwarappan Nakkiran, Darshan Fofadiya, Alex Karlsson, Harsha Aduri

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12138v1

RAG systems have transformed how LLMs access external knowledge, but we find that current implementations exhibit a bias toward factual, objective content, as evidenced by existing benchmarks and datasets that prioritize objective retrieval. This factual bias - treating opinions and diverse perspectives as noise rather than information to be synthesized - limits RAG systems in real-world scenarios involving subjective content, from social media discussions to product reviews. Beyond technical limitations, this bias poses risks to transparent and accountable AI: echo chamber effects that amplify dominant viewpoints, systematic underrepresentation of minority voices, and potential opinion manipulation through biased information synthesis. We formalize this limitation through the lens of uncertainty: factual queries involve epistemic uncertainty reducible through evidence, while opinion queries involve aleatoric uncertainty reflecting genuine heterogeneity in human perspectives. This distinction implies that factual RAG should minimize posterior entropy, whereas opinion-aware RAG must preserve it. Building on this theoretical foundation, we present an Opinion-Aware RAG architecture featuring LLM-based opinion extraction, entity-linked opinion graphs, and opinion-enriched document indexing. We evaluate our approach on e-commerce seller forum data, comparing an Opinion-Enriched knowledge base against a traditional baseline. Experiments demonstrate substantial improvements in retrieval diversity: +26.8% sentiment diversity, +42.7% entity match rate, and +31.6% author demographic coverage on entity-matched documents. Our results provide empirical evidence that treating subjectivity as a first-class citizen yields measurably more representative retrieval-a first step toward opinion-aware RAG. Future work includes joint optimization of retrieval and generation for distributional fidelity.
Authors' comments: 13 pages, Preprint under review

Vote

Add to Library

Recommend

7735. Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval

Willy Carlos Tchuitcheu, Tan Lu, Ann Dooms

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12133v1

Vote

Add to Library

Recommend

7736. RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering

Zhuoyu Wu, Wenhui Ou, Pei-Sze Tan, Wenqi Fang, Sailaja Rajanala, Raphaël C. -W. Phan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.11229v1

Vote

Add to Library

Recommend

7737. ARHN: Answer-Centric Relabeling of Hard Negatives with Open-Source LLMs for Dense Retrieval

Hyewon Choi, Jooyoung Choi, Hansol Jang, Hyun Kim, Chulmin Yun, ChangWook Jun, Stanley Jungkyu Choi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.11092v1

Vote

Add to Library

Recommend

7738. RAG-KT: Cross-platform Explainable Knowledge Tracing with Multi-view Fusion Retrieval Generation

Zhiyi Duan, Hongyu Yuan, Rui Liu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.10960v1

Vote

Add to Library

Recommend

7739. CMedTEB & CARE: Benchmarking and Enabling Efficient Chinese Medical Retrieval via Asymmetric Encoders

Angqing Jiang, Jianlyu Chen, Zhe Fang, Yongcan Wang, Xinpeng Li, Keyu Ding, Defu Lian

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.10937v1

Vote

Add to Library

Recommend

7740. From Query to Conscience: The Importance of Information Retrieval in Empowering Socially Responsible Consumerism

Frans van der Sluis, Leif Azzopardi, Florian Meier

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.10751v1

Millions of consumers search for products online each day, aiming to find items that meet their needs at an acceptable price. While price and quality are major factors in purchasing decisions, ethical considerations increasingly influence consumer behavior, giving rise to the socially responsible consumer. Insights from a recent survey of over 600 consumers reveal that many barriers to ethical shopping stem from information-seeking challenges, often leading to decisions made under uncertainty. These challenges contribute to the intention-behaviour gap, where consumers' desire to make ethical choices is undermined by limited or inaccessible information and inefficacy of search systems in supporting responsible decision-making. In this perspectives paper, we argue that the field of Information Retrieval (IR) has a critical role to play by empowering consumers to make more informed and more responsible choices. We present three interrelated perspectives: (1) reframing responsible consumption as an information extraction problem aimed at reducing information asymmetries; (2) redefining product search as a complex task requiring interfaces that lower the cost and burden of responsible search; and (3) reimagining search as a process of knowledge calibration that helps consumers bridge gaps in awareness when making purchasing decisions. Taken together, these perspectives outline a path from query to conscience, one where IR systems help transform everyday product searches into opportunities for more ethical and informed choices. We advocate for the development of new and novel IR systems and interfaces that address the intricacies of socially responsible consumerism, and call on the IR community to build technologies that make ethical decisions more informed, convenient, and aligned with economic realities.
Authors' comments: 12 pages, 4 figures. Published in SIGIR '25 (ACM), pp. 3853-3864. Peer reviewed

Vote

Add to Library

Recommend

Benty-search

7721. Don't Retrieve, Navigate: Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14572v1

7722. A Unified Model and Document Representation for On-Device Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14403v1

7723. ASTRA: Enhancing Multi-Subject Generation with Retrieval-Augmented Pose Guidance and Disentangled Position Embedding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13938v1

7724. ToolOmni: Enabling Open-World Tool Use via Agentic learning with Proactive Retrieval and Grounded Execution

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13787v1

7725. Hybrid Retrieval for COVID-19 Literature: Comparing Rank Fusion and Projection Fusion with Diversity Reranking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13728v1

7726. SLQ: Bridging Modalities via Shared Latent Queries for Retrieval with Frozen MLLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13710v1

7727. ToolSpec: Accelerating Tool Calling via Schema-Aware and Retrieval-Augmented Speculative Decoding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13519v1

7728. From Relevance to Authority: Authority-aware Generative Retrieval in Web Search Engines

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13468v1

7729. MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.13418v1

7730. FRESCO: Benchmarking and Optimizing Re-rankers for Evolving Semantic Conflict in Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14227v1

7731. AffectAgent: Collaborative Multi-Agent Reasoning for Retrieval-Augmented Multimodal Emotion Recognition

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.12735v1

7732. Transforming External Knowledge into Triplets for Enhanced Retrieval in RAG of LLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.12610v1

7733. Adaptive Query Routing: A Tier-Based Framework for Hybrid Retrieval Across Financial, Legal, and Medical Documents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14222v1

7734. Beyond Factual Grounding: The Case for Opinion-Aware Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.12138v1

7735. Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.12133v1

7736. RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.11229v1

7737. ARHN: Answer-Centric Relabeling of Hard Negatives with Open-Source LLMs for Dense Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.11092v1

7738. RAG-KT: Cross-platform Explainable Knowledge Tracing with Multi-view Fusion Retrieval Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.10960v1

7739. CMedTEB & CARE: Benchmarking and Enabling Efficient Chinese Medical Retrieval via Asymmetric Encoders

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.10937v1

7740. From Query to Conscience: The Importance of Information Retrieval in Empowering Socially Responsible Consumerism

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.10751v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14572v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14403v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13938v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13787v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13728v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13710v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13519v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13468v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.13418v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14227v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12735v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12610v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14222v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12138v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.12133v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.11229v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.11092v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.10960v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.10937v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.10751v1