benty-fields - Search paper

9161. Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling

Hengran Zhang, Keping Bi, Jiafeng Guo, Xiaojie Sun, Shihao Liu, Daiting Shi, Dawei Yin, Xueqi Cheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.05216v1

Vote

Add to Library

Recommend

9162. Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG

Hengran Zhang, Minghao Tang, Keping Bi, Jiafeng Guo, Shihao Liu, Daiting Shi, Dawei Yin, Xueqi Cheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.05220v1

Retrieval models typically rely on costly human-labeled query-document relevance annotations for training and evaluation. To reduce this cost and leverage the potential of Large Language Models (LLMs) in relevance judgments, we aim to explore whether LLM-generated annotations can effectively replace human annotations in training retrieval models. Retrieval usually emphasizes relevance, which indicates "topic-relatedness" of a document to a query, while in RAG, the value of a document (or utility) depends on how it contributes to answer generation. Recognizing this mismatch, some researchers use LLM performance on downstream tasks with documents as labels, but this approach requires manual answers for specific tasks, leading to high costs and limited generalization. In another line of work, prompting LLMs to select useful documents as RAG references eliminates the need for human annotation and is not task-specific. If we leverage LLMs' utility judgments to annotate retrieval data, we may retain cross-task generalization without human annotation in large-scale corpora. Therefore, we investigate utility-focused annotation via LLMs for large-scale retriever training data across both in-domain and out-of-domain settings on the retrieval and RAG tasks. To reduce the impact of low-quality positives labeled by LLMs, we design a novel loss function, i.e., Disj-InfoNCE. Our experiments reveal that: (1) Retrievers trained on utility-focused annotations significantly outperform those trained on human annotations in the out-of-domain setting on both tasks, demonstrating superior generalization capabilities. (2) LLM annotation does not replace human annotation in the in-domain setting. However, incorporating just 20% human-annotated data enables retrievers trained with utility-focused annotations to match the performance of models trained entirely with human annotations.
Authors' comments: 12 pages, 4 figures

Vote

Add to Library

Recommend

9163. TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval

Xiaolun Jing, Genke Yang, Jian Chu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04707v1

Vote

Add to Library

Recommend

9164. UniRVQA: A Unified Framework for Retrieval-Augmented Vision Question Answering via Self-Reflective Joint Training

Jiaqi Deng, Kaize Shi, Zonghan Wu, Huan Huo, Dingxian Wang, Guandong Xu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04065v1

Vote

Add to Library

Recommend

9165. Precise Legal Sentence Boundary Detection for Retrieval at Scale: NUPunkt and CharBoundary

Michael J Bommarito, Daniel Martin Katz, Jillian Bommarito

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04131v1

We present NUPunkt and CharBoundary, two sentence boundary detection libraries optimized for high-precision, high-throughput processing of legal text in large-scale applications such as due diligence, e-discovery, and legal research. These libraries address the critical challenges posed by legal documents containing specialized citations, abbreviations, and complex sentence structures that confound general-purpose sentence boundary detectors. Our experimental evaluation on five diverse legal datasets comprising over 25,000 documents and 197,000 annotated sentence boundaries demonstrates that NUPunkt achieves 91.1% precision while processing 10 million characters per second with modest memory requirements (432 MB). CharBoundary models offer balanced and adjustable precision-recall tradeoffs, with the large model achieving the highest F1 score (0.782) among all tested methods. Notably, NUPunkt provides a 29-32% precision improvement over general-purpose tools while maintaining exceptional throughput, processing multi-million document collections in minutes rather than hours. Both libraries run efficiently on standard CPU hardware without requiring specialized accelerators. NUPunkt is implemented in pure Python with zero external dependencies, while CharBoundary relies only on scikit-learn and optional ONNX runtime integration for optimized performance. Both libraries are available under the MIT license, can be installed via PyPI, and can be interactively tested at https://sentences.aleainstitute.ai/. These libraries address critical precision issues in retrieval-augmented generation systems by preserving coherent legal concepts across sentences, where each percentage improvement in precision yields exponentially greater reductions in context fragmentation, creating cascading benefits throughout retrieval pipelines and significantly enhancing downstream reasoning quality.
Authors' comments: 12 pages, 5 figures, 6 tables

Vote

Add to Library

Recommend

9166. QE-RAG: A Robust Retrieval-Augmented Generation Benchmark for Query Entry Errors

Kepu Zhang, Zhongxiang Sun, Weijie Yu, Xiaoxue Zang, Kai Zheng, Yang Song, Han Li, Jun Xu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04062v1

Vote

Add to Library

Recommend

9167. Joint Retrieval of Cloud properties using Attention-based Deep Learning Models

Zahid Hassan Tushar, Adeleke Ademakinwa, Jianwu Wang, Zhibo Zhang, Sanjay Purushotham

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.03133v1

Vote

Add to Library

Recommend

9168. REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval

Shabnam Choudhury, Yash Salunkhe, Sarthak Mehrotra, Biplab Banerjee

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.03169v2

Vote

Add to Library

Recommend

9169. HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse

Yuwei An, Yihua Cheng, Seo Jin Park, Junchen Jiang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.02921v1

Vote

Add to Library

Recommend

9170. Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval

Boseung Jeong, Jicheol Park, Sungyeon Kim, Suha Kwak

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.02397v1

Vote

Add to Library

Recommend

9171. Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools

Lena Schmidt, Oshin Sharma, Chris Marshall, Sonia Garcia Gonzalez Moral

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01627v1

Vote

Add to Library

Recommend

9172. CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models

Runlong Zhou, Yi Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01450v1

Vote

Add to Library

Recommend

9173. Real-time Ad retrieval via LLM-generative Commercial Intention for Sponsored Search Advertising

Tongtong Liu, Zhaohui Wang, Meiyue Qin, Zenghui Lu, Xudong Chen, Yuekui Yang, Peng Shu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01304v1

Vote

Add to Library

Recommend

9174. One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image

Ezzeldin Shereen, Dan Ristea, Burak Hasircioglu, Shae McFadden, Vasilios Mavroudis, Chris Hicks

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.02132v1

Vote

Add to Library

Recommend

9175. Asymmetry and Dynamical Constraints in 2-Limbs Retrieval of WASP-39 b Inferring from JWST Data

Zixin Chen, Jianghui Ji, Guo Chen, Fei Yan, Xianyu Tan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00419v1

Vote

Add to Library

Recommend

9176. CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation

Chengshuai Zhao, Riccardo De Maria, Tharindu Kumarage, Kumar Satvik Chaudhary, Garima Agrawal, Yiwen Li, Jongchan Park, Yuli Deng et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00389v1

Vote

Add to Library

Recommend

9177. On the Consistency of Multilingual Context Utilization in Retrieval-Augmented Generation

Jirui Qi, Raquel Fernández, Arianna Bisazza

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00597v2

Retrieval-augmented generation (RAG) with large language models (LLMs) has demonstrated strong performance in multilingual question-answering (QA) tasks by leveraging relevant passages retrieved from corpora. In multilingual RAG (mRAG), the retrieved passages can be written in languages other than that of the query entered by the user, making it challenging for LLMs to effectively utilize the provided information. Recent research suggests that retrieving passages from multilingual corpora can improve RAG performance, particularly for low-resource languages. However, the extent to which LLMs can leverage different kinds of multilingual contexts to generate accurate answers, *independently from retrieval quality*, remains understudied. In this paper, we conduct an extensive assessment of LLMs' ability to (i) make consistent use of a relevant passage regardless of its language, (ii) respond in the expected language, and (iii) focus on the relevant passage even when multiple `distracting' passages in different languages are provided in the context. Our experiments with four LLMs across three QA datasets covering a total of 48 languages reveal a surprising ability of LLMs to extract the relevant information from out-language passages, but a much weaker ability to formulate a full answer in the correct language. Our analysis, based on both accuracy and feature attribution techniques, further shows that distracting passages negatively impact answer quality regardless of their language. However, distractors in the query language exert a slightly stronger influence. Taken together, our findings deepen the understanding of how LLMs utilize context in mRAG systems, providing directions for future improvements.
Authors' comments: Under review at COLM2025. All codes and data are released at https://github.com/Betswish/mRAG-Context-Consistency

Vote

Add to Library

Recommend

9178. Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding

Sakhinana Sagar Srinivas, Akash Das, Shivam Gupta, Venkataramana Runkana

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01281v2

Vote

Add to Library

Recommend

9179. Accelerating Causal Network Discovery of Alzheimer Disease Biomarkers via Scientific Literature-based Retrieval Augmented Generation

Xiaofan Zhou, Liangjie Huang, Pinyang Cheng, Wenpen Yin, Rui Zhang, Wenrui Hao, Lu Cheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08768v1

Vote

Add to Library

Recommend

9180. Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data

Yiqun Duan, Sameera Ramasinghe, Stephen Gould, Ajanthan Thalaiyasingam

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00812v2

Vote

Add to Library

Recommend

Benty-search

9161. Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.05216v1

9162. Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.05220v1

9163. TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.04707v1

9164. UniRVQA: A Unified Framework for Retrieval-Augmented Vision Question Answering via Self-Reflective Joint Training

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.04065v1

9165. Precise Legal Sentence Boundary Detection for Retrieval at Scale: NUPunkt and CharBoundary

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.04131v1

9166. QE-RAG: A Robust Retrieval-Augmented Generation Benchmark for Query Entry Errors

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.04062v1

9167. Joint Retrieval of Cloud properties using Attention-based Deep Learning Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.03133v1

9168. REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.03169v2

9169. HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.02921v1

9170. Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.02397v1

9171. Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.01627v1

9172. CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.01450v1

9173. Real-time Ad retrieval via LLM-generative Commercial Intention for Sponsored Search Advertising

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.01304v1

9174. One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.02132v1

9175. Asymmetry and Dynamical Constraints in 2-Limbs Retrieval of WASP-39 b Inferring from JWST Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.00419v1

9176. CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.00389v1

9177. On the Consistency of Multilingual Context Utilization in Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.00597v2

9178. Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.01281v2

9179. Accelerating Causal Network Discovery of Alzheimer Disease Biomarkers via Scientific Literature-based Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.08768v1

9180. Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.00812v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.05216v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.05220v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04707v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04065v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04131v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.04062v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.03133v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.03169v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.02921v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.02397v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01627v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01450v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01304v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.02132v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00419v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00389v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00597v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.01281v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08768v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.00812v2