benty-fields - Search paper

8021. RankGR: Rank-Enhanced Generative Retrieval with Listwise Direct Preference Optimization in Recommendation

Kairui Fu, Changfa Wu, Kun Yuan, Binbin Cao, Dunxian Huang, Yuliang Yan, Junjun Zheng, Jianning Zhang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08575v1

Vote

Add to Library

Recommend

8022. MemAdapter: Fast Alignment across Agent Memory Paradigms via Generative Subgraph Retrieval

Xin Zhang, Kailai Yang, Chenyue Li, Hao Li, Qiyu Wei, Jun'ichi Tsujii, Sophia Ananiadou

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08369v1

Vote

Add to Library

Recommend

8023. ByteHouse: A Cloud-Native OLAP Engine with Incremental Computation and Multi-Modal Retrieval

Yuxing Han, Yu Lin, Yifeng Dong, Xuanhe Zhou, Xindong Peng, Xinhui Tian, Zhiyuan You, Yingzhong Guo et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08226v1

Vote

Add to Library

Recommend

8024. WristMIR: Coarse-to-Fine Region-Aware Retrieval of Pediatric Wrist Radiographs with Radiology Report-Driven Learning

Mert Sonmezer, Serge Vasylechko, Duygu Atasoy, Seyda Ertekin, Sila Kurugol

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07872v1

Vote

Add to Library

Recommend

8025. IGMiRAG: Intuition-Guided Retrieval-Augmented Generation with Adaptive Mining of In-Depth Memory

Xingliang Hou, Yuyan Liu, Qi Sun, haoxiu wang, Hao Hu, Shaoyi Du, Zhiqiang Tian

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07525v1

Vote

Add to Library

Recommend

8026. Adaptive Retrieval helps Reasoning in LLMs -- but mostly if it's not used

Srijan Shakya, Anamaria-Roberta Hartl, Sepp Hochreiter, Korbinian Pöppel

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07213v1

Vote

Add to Library

Recommend

8027. The effect of JWST/NIRSpec data reduction on the retrieval of WASP-39b atmospheric properties

J. Roy-Perez, S. Pérez-Hoyos, N. Barrado-Izagirre, H. Chen-Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06722v1

Vote

Add to Library

Recommend

8028. Multimodal Generative Retrieval Model with Staged Pretraining for Food Delivery on Meituan

Boyu Chen, Tai Guo, Weiyu Cui, Yuqing Li, Xingxing Wang, Chuan Shi, Cheng Yang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06654v1

Vote

Add to Library

Recommend

8029. R2LED: Equipping Retrieval and Refinement in Lifelong User Modeling with Semantic IDs for CTR Prediction

Qidong Liu, Gengnan Wang, Zhichen Liu, Moranxin Wang, Zijian Zhang, Xiao Han, Ni Zhang, Tao Qin et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06622v1

Vote

Add to Library

Recommend

8030. Evaluating Retrieval-Augmented Generation Variants for Natural Language-Based SQL and API Call Generation

Michael Marketsmüller, Simon Martin, Tim Schlippe

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07086v1

Vote

Add to Library

Recommend

8031. IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

Connor Shorten, Augustas Skaburskas, Daniel M. Jones, Charles Pierse, Roberto Esposito, John Trengrove, Etienne Dilocker, Bob van Luijt

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.17687v1

AI systems have achieved remarkable success in processing text and relational data, yet visual document processing remains relatively underexplored. Whereas traditional systems require OCR transcriptions to convert these visual documents into text and metadata, recent advances in multimodal foundation models offer retrieval and generation directly from document images. This raises a key question: How do image-based systems compare to established text-based methods? We introduce IRPAPERS, a benchmark of 3,230 pages from 166 scientific papers, with both an image and an OCR transcription for each page. Using 180 needle-in-the-haystack questions, we compare image- and text-based retrieval and question answering systems. Text retrieval using Arctic 2.0 embeddings, BM25, and hybrid text search achieved 46% Recall@1, 78% Recall@5, and 91% Recall@20, while image-based retrieval reaches 43%, 78%, and 93%, respectively. The two modalities exhibit complementary failures, enabling multimodal hybrid search to outperform either alone, achieving 49% Recall@1, 81% Recall@5, and 95% Recall@20. We further evaluate efficiency-performance tradeoffs with MUVERA and assess multiple multi-vector image embedding models. Among closed-source models, Cohere Embed v4 page image embeddings outperform Voyage 3 Large text embeddings and all tested open-source models, achieving 58% Recall@1, 87% Recall@5, and 97% Recall@20. For question answering, text-based RAG systems achieved higher ground-truth alignment than image-based systems (0.82 vs. 0.71), and both benefit substantially from increased retrieval depth, with multi-document retrieval outperforming oracle single-document retrieval. We analyze the complementary limitations of unimodal text and image representations and identify question types that require one modality over the other. The IRPAPERS dataset and all experimental code are publicly available.
Authors' comments: 23 pages, 6 figures

Vote

Add to Library

Recommend

8032. Evaluating the impact of word embeddings on similarity scoring in practical information retrieval

Niall McCarroll, Kevin Curran, Eugene McNamee, Angela Clist, Andrew Brammer

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05734v1

Vote

Add to Library

Recommend

8033. Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification

Taoye Yin, Haoyuan Hu, Yaxin Fan, Xinhao Chen, Xinya Wu, Kai Deng, Kezun Zhang, Feng Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05723v1

Vote

Add to Library

Recommend

8034. AI Agent Systems for Supply Chains: Structured Decision Prompts and Memory Retrieval

Konosuke Yoshizato, Kazuma Shimizu, Ryota Higa, Takanobu Otsuka

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05524v1

Vote

Add to Library

Recommend

8035. NeuCLIRTech: Chinese Monolingual and Cross-Language Information Retrieval Evaluation in a Challenging Domain

Dawn Lawrie, James Mayfield, Eugene Yang, Andrew Yates, Sean MacAvaney, Ronak Pradeep, Scott Miller, Paul McNamee et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05334v1

Vote

Add to Library

Recommend

8036. Supporting software engineering tasks with agentic AI: Demonstration on document retrieval and test scenario generation

Marian Kica, Lukas Radosky, David Slivka, Karin Kubinova, Daniel Dovhun, Tomas Uhercik, Erik Bircak, Ivan Polasek

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04726v1

Vote

Add to Library

Recommend

8037. SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation

David F. Ramirez, Tim Overman, Kristen Jaskie, Joe Marvin, Andreas Spanias

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04712v1

Vote

Add to Library

Recommend

8038. Deterministic Retrieval at Scale: Optimal-Space LCP Indexing and 308x Energy Reduction on Modern GPUs

Stanislav Byriukov

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04936v1

Vote

Add to Library

Recommend

8039. LILaC: Late Interacting in Layered Component Graph for Open-domain Multimodal Multihop Retrieval

Joohyung Yun, Doyup Lee, Wook-Shin Han

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04263v1

Vote

Add to Library

Recommend

8040. ARIA: Adaptive Retrieval Intelligence Assistant -- A Multimodal RAG Framework for Domain-Specific Engineering Education

Yue Luo, Dibakar Roy Sarkar, Rachel Herring Sangree, Somdatta Goswami

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.06179v1

Vote

Add to Library

Recommend

Benty-search

8021. RankGR: Rank-Enhanced Generative Retrieval with Listwise Direct Preference Optimization in Recommendation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08575v1

8022. MemAdapter: Fast Alignment across Agent Memory Paradigms via Generative Subgraph Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08369v1

8023. ByteHouse: A Cloud-Native OLAP Engine with Incremental Computation and Multi-Modal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08226v1

8024. WristMIR: Coarse-to-Fine Region-Aware Retrieval of Pediatric Wrist Radiographs with Radiology Report-Driven Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.07872v1

8025. IGMiRAG: Intuition-Guided Retrieval-Augmented Generation with Adaptive Mining of In-Depth Memory

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.07525v1

8026. Adaptive Retrieval helps Reasoning in LLMs -- but mostly if it's not used

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.07213v1

8027. The effect of JWST/NIRSpec data reduction on the retrieval of WASP-39b atmospheric properties

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.06722v1

8028. Multimodal Generative Retrieval Model with Staged Pretraining for Food Delivery on Meituan

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.06654v1

8029. R2LED: Equipping Retrieval and Refinement in Lifelong User Modeling with Semantic IDs for CTR Prediction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.06622v1

8030. Evaluating Retrieval-Augmented Generation Variants for Natural Language-Based SQL and API Call Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.07086v1

8031. IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.17687v1

8032. Evaluating the impact of word embeddings on similarity scoring in practical information retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05734v1

8033. Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05723v1

8034. AI Agent Systems for Supply Chains: Structured Decision Prompts and Memory Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05524v1

8035. NeuCLIRTech: Chinese Monolingual and Cross-Language Information Retrieval Evaluation in a Challenging Domain

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05334v1

8036. Supporting software engineering tasks with agentic AI: Demonstration on document retrieval and test scenario generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04726v1

8037. SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04712v1

8038. Deterministic Retrieval at Scale: Optimal-Space LCP Indexing and 308x Energy Reduction on Modern GPUs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04936v1

8039. LILaC: Late Interacting in Layered Component Graph for Open-domain Multimodal Multihop Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04263v1

8040. ARIA: Adaptive Retrieval Intelligence Assistant -- A Multimodal RAG Framework for Domain-Specific Engineering Education

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.06179v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08575v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08369v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08226v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07872v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07525v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07213v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06722v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06654v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06622v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07086v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.17687v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05734v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05723v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05524v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05334v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04726v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04712v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04936v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04263v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.06179v1