benty-fields - Search paper

9661. LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.18050v2

Vote

Add to Library

Recommend

9662. Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval

Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gaopeng Gou, Gang Xiong, Qi Wu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.17393v1

Zero-Shot Composed Image Retrieval (ZS-CIR) supports diverse tasks with a broad range of visual content manipulation intentions that can be related to domain, scene, object, and attribute. A key challenge for ZS-CIR is to accurately map image representation to a pseudo-word token that captures the manipulation intention relevant image information for generalized CIR. However, existing methods between the retrieval and pre-training stages lead to significant redundancy in the pseudo-word tokens. In this paper, we propose a novel denoising image-to-word mapping approach, named Denoise-I2W, for mapping images into denoising pseudo-word tokens that, without intention-irrelevant visual information, enhance accurate ZS-CIR. Specifically, a pseudo triplet construction module first automatically constructs pseudo triples (\textit{i.e.,} a pseudo-reference image, a pseudo-manipulation text, and a target image) for pre-training the denoising mapping network. Then, a pseudo-composed mapping module maps the pseudo-reference image to a pseudo-word token and combines it with the pseudo-manipulation text with manipulation intention. This combination aligns with the target image, facilitating denoising intention-irrelevant visual information for mapping. Our proposed Denoise-I2W is a model-agnostic and annotation-free approach. It demonstrates strong generalization capabilities across three state-of-the-art ZS-CIR models on four benchmark datasets. By integrating Denoise-I2W with existing best models, we obtain consistent and significant performance boosts ranging from 1.45\% to 4.17\% over the best methods without increasing inference costs. and achieve new state-of-the-art results on ZS-CIR. Our code is available at \url{https://github.com/Pter61/denoise-i2w-tmm}.
Authors' comments: This work was submitted to IJCAI 2024, with a score of weak accept and borderline accept

Vote

Add to Library

Recommend

9663. Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

Zongmeng Zhang, Yufeng Shi, Jinhua Zhu, Wengang Zhou, Xiang Qi, Peng Zhang, Houqiang Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.16843v1

Vote

Add to Library

Recommend

9664. Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?

Gustavo Penha, Ali Vardasbi, Enrico Palumbo, Marco de Nadai, Hugues Bouchard

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.16823v1

Generative retrieval for search and recommendation is a promising paradigm for retrieving items, offering an alternative to traditional methods that depend on external indexes and nearest-neighbor searches. Instead, generative models directly associate inputs with item IDs. Given the breakthroughs of Large Language Models (LLMs), these generative systems can play a crucial role in centralizing a variety of Information Retrieval (IR) tasks in a single model that performs tasks such as query understanding, retrieval, recommendation, explanation, re-ranking, and response generation. Despite the growing interest in such a unified generative approach for IR systems, the advantages of using a single, multi-task model over multiple specialized models are not well established in the literature. This paper investigates whether and when such a unified approach can outperform task-specific models in the IR tasks of search and recommendation, broadly co-existing in multiple industrial online platforms, such as Spotify, YouTube, and Netflix. Previous work shows that (1) the latent representations of items learned by generative recommenders are biased towards popularity, and (2) content-based and collaborative-filtering-based information can improve an item's representations. Motivated by this, our study is guided by two hypotheses: [H1] the joint training regularizes the estimation of each item's popularity, and [H2] the joint training regularizes the item's latent representations, where search captures content-based aspects of an item and recommendation captures collaborative-filtering aspects. Our extensive experiments with both simulated and real-world data support both [H1] and [H2] as key contributors to the effectiveness improvements observed in the unified search and recommendation generative models over the single-task approaches.
Authors' comments: Accepted for publication in the 18th ACM Conference on Recommender Systems (RecSys'24)

Vote

Add to Library

Recommend

9665. Unleashing the Potential of Multi-Channel Fusion in Retrieval for Personalized Recommendations

Junjie Huang, Jiarui Qin, Jianghao Lin, Ziming Feng, Yong Yu, Weinan Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.16080v1

Recommender systems (RS) are pivotal in managing information overload in modern digital services. A key challenge in RS is efficiently processing vast item pools to deliver highly personalized recommendations under strict latency constraints. Multi-stage cascade ranking addresses this by employing computationally efficient retrieval methods to cover diverse user interests, followed by more precise ranking models to refine the results. In the retrieval stage, multi-channel retrieval is often used to generate distinct item subsets from different candidate generators, leveraging the complementary strengths of these methods to maximize coverage. However, forwarding all retrieved items overwhelms downstream rankers, necessitating truncation. Despite advancements in individual retrieval methods, multi-channel fusion, the process of efficiently merging multi-channel retrieval results, remains underexplored. We are the first to identify and systematically investigate multi-channel fusion in the retrieval stage. Current industry practices often rely on heuristic approaches and manual designs, which often lead to suboptimal performance. Moreover, traditional gradient-based methods like SGD are unsuitable for this task due to the non-differentiable nature of the selection process. In this paper, we explore advanced channel fusion strategies by assigning systematically optimized weights to each channel. We utilize black-box optimization techniques, including the Cross Entropy Method and Bayesian Optimization for global weight optimization, alongside policy gradient-based approaches for personalized merging. Our methods enhance both personalization and flexibility, achieving significant performance improvements across multiple datasets and yielding substantial gains in real-world deployments, offering a scalable solution for optimizing multi-channel fusion in retrieval.
Authors' comments: 12 pages, 8 figures

Vote

Add to Library

Recommend

9666. Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report

Ayman Asad Khan, Md Toufique Hasan, Kai Kristian Kemell, Jussi Rasku, Pekka Abrahamsson

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15944v1

Vote

Add to Library

Recommend

9667. Leveraging Retrieval-Augmented Generation for Culturally Inclusive Hakka Chatbots: Design Insights and User Perceptions

Chen-Chi Chang, Han-Pi Chang, Hung-Shin Lee

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15572v1

Vote

Add to Library

Recommend

9668. ConTReGen: Context-driven Tree-structured Retrieval for Open-domain Long-form Text Generation

Kashob Kumar Roy, Pritom Saha Akash, Kevin Chen-Chuan Chang, Lucian Popa

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15511v1

Vote

Add to Library

Recommend

9669. Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15438v1

Vote

Add to Library

Recommend

9670. When Machine Unlearning Meets Retrieval-Augmented Generation (RAG): Keep Secret or Forget Knowledge?

Shang Wang, Tianqing Zhu, Dayong Ye, Wanlei Zhou

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15267v1

Vote

Add to Library

Recommend

9671. BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression

Yuankai Li, Jia-Chen Gu, Di Wu, Kai-Wei Chang, Nanyun Peng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15277v2

Vote

Add to Library

Recommend

9672. YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary

Hao-Tang Tsui, Chien-Yao Wang, Hong-Yuan Mark Liao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15346v2

Vote

Add to Library

Recommend

9673. Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization

Zichen Wang, Yaokun Ji, Jianing Tian, Shuangjia Zheng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15040v1

Vote

Add to Library

Recommend

9674. Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems

Jiajing Chen, Runyuan Bao, Hongye Zheng, Zhen Qi, Jianjun Wei, Jiacheng Hu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14167v1

Vote

Add to Library

Recommend

9675. RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training

Muhe Ding, Yang Ma, Pengda Qin, Jianlong Wu, Yuhong Li, Liqiang Nie

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14154v1

Vote

Add to Library

Recommend

9676. DiSCo: LLM Knowledge Distillation for Efficient Sparse Retrieval in Conversational Search

Simon Lupart, Mohammad Aliannejadi, Evangelos Kanoulas

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14609v2

Conversational Search (CS) involves retrieving relevant documents from a corpus while considering the conversational context, integrating retrieval with context modeling. Recent advancements in Large Language Models (LLMs) have significantly enhanced CS by enabling query rewriting based on conversational context. However, employing LLMs during inference poses efficiency challenges. Existing solutions mitigate this issue by distilling embeddings derived from human-rewritten queries, focusing primarily on learning the context modeling task. These methods, however, often separate the contrastive retrieval task from the distillation process, treating it as an independent loss term. To overcome these limitations, we introduce DiSCo (Distillation of Sparse Conversational retrieval), a novel approach that unifies retrieval and context modeling through a relaxed distillation objective. Instead of relying exclusively on representation learning, our method distills similarity scores between conversations and documents, providing more freedom in the representation space and better leveraging the contrastive nature of document relevance. Extensive experiments on Learned Sparse Retrieval (LSR) across five CS datasets demonstrate that DiSCo achieves substantial improvements in both in-domain and out-of-domain retrieval tasks, achieving up to a six-point gain in recall for out-of-domain datasets over state-of-the-art methods. Additionally, DiSCo employs a multi-teacher distillation strategy, using multiple LLMs as teachers, further enhancing performance and surpassing the individual teachers in in-domain settings. Furthermore, analysis of model sparsity reveals that DiSCo allows for more effective control over the sparsity of the trained models.
Authors' comments: 11 pages, 6 figures. SIGIR '25 Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval July 13--18, 2025 Padua, Italy

Vote

Add to Library

Recommend

9677. Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14057v1

Vote

Add to Library

Recommend

9678. FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation

Dongfang Zhao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.13272v1

Vote

Add to Library

Recommend

9679. Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Sreyan Ghosh, Mohammad Sadegh Rasooli, Michael Levit, Peidong Wang, Jian Xue, Dinesh Manocha, Jinyu Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.13198v1

Vote

Add to Library

Recommend

9680. CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Shangda Wu, Yashan Wang, Ruibin Yuan, Zhancheng Guo, Xu Tan, Ge Zhang, Monan Zhou, Jing Chen et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.13267v2

Vote

Add to Library

Recommend

Benty-search

9661. LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.18050v2

9662. Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.17393v1

9663. Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.16843v1

9664. Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.16823v1

9665. Unleashing the Potential of Multi-Channel Fusion in Retrieval for Personalized Recommendations

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.16080v1

9666. Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15944v1

9667. Leveraging Retrieval-Augmented Generation for Culturally Inclusive Hakka Chatbots: Design Insights and User Perceptions

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15572v1

9668. ConTReGen: Context-driven Tree-structured Retrieval for Open-domain Long-form Text Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15511v1

9669. Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15438v1

9670. When Machine Unlearning Meets Retrieval-Augmented Generation (RAG): Keep Secret or Forget Knowledge?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15267v1

9671. BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15277v2

9672. YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15346v2

9673. Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.15040v1

9674. Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.14167v1

9675. RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.14154v1

9676. DiSCo: LLM Knowledge Distillation for Efficient Sparse Retrieval in Conversational Search

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.14609v2

9677. Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.14057v1

9678. FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.13272v1

9679. Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.13198v1

9680. CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.13267v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.18050v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.17393v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.16843v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.16823v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.16080v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15944v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15572v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15511v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15438v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15267v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15277v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15346v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.15040v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14167v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14154v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14609v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.14057v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.13272v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.13198v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.13267v2