benty-fields - Search paper

4761. MLDocRAG: Multimodal Long-Context Document Retrieval Augmented Generation

Yongyue Zhang, Yaxiong Wu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.10271v1

Vote

Add to Library

Recommend

4762. RAD: Retrieval-Augmented Monocular Metric Depth Estimation for Underrepresented Classes

Michael Baltaxe, Dan Levi, Sagie Benaim

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.09532v1

Vote

Add to Library

Recommend

4763. AtomicRAG: Atom-Entity Graphs for Retrieval-Augmented Generation

Yanning Hou, Duanyang Yuan, Sihang Zhou, Xiaoshu Chen, Ke Liang, Siwei Wang, Xinwang Liu, Jian Huang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.20844v1

Vote

Add to Library

Recommend

4764. Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation

Zhisheng Qi, Utkarsh Sahu, Li Ma, Haoyu Han, Ryan Rossi, Franck Dernoncourt, Mahantesh Halappanavar, Nesreen Ahmed et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.09319v1

Vote

Add to Library

Recommend

4765. OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval

Teng Wang, Rong Shan, Jianghao Lin, Junjie Wu, Tianyi Xu, Jianping Zhang, Wenteng Chen, Changwang Zhang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08603v1

Vote

Add to Library

Recommend

4766. DA-RAG: Dynamic Attributed Community Search for Retrieval-Augmented Generation

Xingyuan Zeng, Zuohan Wu, Yue Wang, Chen Zhang, Quanming Yao, Libin Zheng, Jian Yin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08545v1

Vote

Add to Library

Recommend

4767. A Sketch+Text Composed Image Retrieval Dataset for Thangka

Jinyu Xu, Yi Sun, Jiangling Zhang, Qing Xie, Daomin Ji, Zhifeng Bao, Jiachen Li, Yanchun Ma et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08411v1

Vote

Add to Library

Recommend

4768. VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval

Issar Tzachor, Dvir Samuel, Rami Ben-Ari

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08099v1

Vote

Add to Library

Recommend

4769. Efficient Table Retrieval and Understanding with Multimodal Large Language Models

Zhuoyan Xu, Haoyang Fang, Boran Han, Bonan Min, Bernie Wang, Cuixiong Hu, Shuai Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07642v1

Vote

Add to Library

Recommend

4770. V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Dongyang Chen, Chaoyang Wang, Dezhao SU, Xi Xiao, Zeyu Zhang, Jing Xiong, Qing Li, Yuzhang Shang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06034v1

Vote

Add to Library

Recommend

4771. SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Tiansheng Hu, Yilun Zhao, Canyu Zhang, Arman Cohan, Chen Zhao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05975v1

Vote

Add to Library

Recommend

4772. ArkTS-CodeSearch: A Open-Source ArkTS Dataset for Code Retrieval

Yulong He, Artem Ermakov, Sergey Kovalchuk, Artem Aliev, Dmitry Shalymov

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05550v1

Vote

Add to Library

Recommend

4773. FedMosaic: Federated Retrieval-Augmented Generation via Parametric Adapters

Zhilin Liang, Yuxiang Wang, Zimu Zhou, Hainan Zhang, Boyi Liu, Yongxin Tong

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05235v1

Vote

Add to Library

Recommend

4774. Scaling Laws for Embedding Dimension in Information Retrieval

Julian Killingback, Mahta Rafiee, Madine Manas, Hamed Zamani

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05062v1

Dense retrieval, which encodes queries and documents into a single dense vector, has become the dominant neural retrieval approach due to its simplicity and compatibility with fast approximate nearest neighbor algorithms. As the tasks dense retrieval performs grow in complexity, the fundamental limitations of the underlying data structure and similarity metric -- namely vectors and inner-products -- become more apparent. Prior recent work has shown theoretical limitations inherent to single vectors and inner-products that are generally tied to the embedding dimension. Given the importance of embedding dimension for retrieval capacity, understanding how dense retrieval performance changes as embedding dimension is scaled is fundamental to building next generation retrieval models that balance effectiveness and efficiency. In this work, we conduct a comprehensive analysis of the relationship between embedding dimension and retrieval performance. Our experiments include two model families and a range of model sizes from each to construct a detailed picture of embedding scaling behavior. We find that the scaling behavior fits a power law, allowing us to derive scaling laws for performance given only embedding dimension, as well as a joint law accounting for embedding dimension and model size. Our analysis shows that for evaluation tasks aligned with the training task, performance continues to improve as embedding size increases, though with diminishing returns. For evaluation data that is less aligned with the training task, we find that performance is less predictable, with performance degrading with larger embedding dimensions for certain tasks. We hope our work provides additional insight into the limitations of embeddings and their behavior as well as offers a practical guide for selecting model and embedding dimension to achieve optimal performance with reduced storage and compute costs.
Authors' comments: 9 Pages, 7 figures

Vote

Add to Library

Recommend

4775. Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction

Junjie Chen, Haitao Li, Qilei Zhang, Zhenghua Li, Ya Zhang, Quan Zhou, Cheng Luo, Yiqun Liu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04690v1

Vote

Add to Library

Recommend

4776. Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation

Nikita Sorokin, Ivan Sedykh, Valentin Malykh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.06593v1

Vote

Add to Library

Recommend

4777. AIANO: Enhancing Information Retrieval with AI-Augmented Annotation

Sameh Khattab, Marie Bauer, Lukas Heine, Till Rostalski, Jens Kleesiek, Julian Friedrich

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04579v1

Vote

Add to Library

Recommend

4778. Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation

Ning Wang, Kuanyan Zhu, Daniel Yuehwoon Yee, Yitang Gao, Shiying Huang, Zirun Xu, Sainyam Galhotra

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04926v1

Vote

Add to Library

Recommend

4779. RAGTurk: Best Practices for Retrieval Augmented Generation in Turkish

Süha Kağan Köse, Mehmet Can Baytekin, Burak Aktaş, Bilge Kaan Görür, Evren Ayberk Munis, Deniz Yılmaz, Muhammed Yusuf Kartal, Çağrı Toraman

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.03652v1

Vote

Add to Library

Recommend

4780. RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval

Tyler Skow, Alexander Martin, Benjamin Van Durme, Rama Chellappa, Reno Kriz

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.02444v1

Vote

Add to Library

Recommend

Benty-search

4761. MLDocRAG: Multimodal Long-Context Document Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.10271v1

4762. RAD: Retrieval-Augmented Monocular Metric Depth Estimation for Underrepresented Classes

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.09532v1

4763. AtomicRAG: Atom-Entity Graphs for Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.20844v1

4764. Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.09319v1

4765. OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08603v1

4766. DA-RAG: Dynamic Attributed Community Search for Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08545v1

4767. A Sketch+Text Composed Image Retrieval Dataset for Thangka

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08411v1

4768. VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.08099v1

4769. Efficient Table Retrieval and Understanding with Multimodal Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.07642v1

4770. V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.06034v1

4771. SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05975v1

4772. ArkTS-CodeSearch: A Open-Source ArkTS Dataset for Code Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05550v1

4773. FedMosaic: Federated Retrieval-Augmented Generation via Parametric Adapters

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05235v1

4774. Scaling Laws for Embedding Dimension in Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.05062v1

4775. Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04690v1

4776. Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.06593v1

4777. AIANO: Enhancing Information Retrieval with AI-Augmented Annotation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04579v1

4778. Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.04926v1

4779. RAGTurk: Best Practices for Retrieval Augmented Generation in Turkish

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.03652v1

4780. RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.02444v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.10271v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.09532v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.20844v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.09319v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08603v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08545v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08411v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.08099v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.07642v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.06034v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05975v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05550v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05235v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.05062v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04690v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.06593v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04579v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.04926v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.03652v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.02444v1