benty-fields - Search paper

8801. Retrieval-Augmented Clinical Benchmarking for Contextual Model Testing in Kenyan Primary Care: A Methodology Paper

Fred Mutisya, Shikoh Gitau, Christine Syovata, Diana Oigara, Ibrahim Matende, Muna Aden, Munira Ali, Ryan Nyotu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.14615v1

Vote

Add to Library

Recommend

8802. Multi-Class-Token Transformer for Multitask Self-supervised Music Information Retrieval

Yuexuan Kong, Vincent Lostanlen, Romain Hennequin, Mathieu Lagrange, Gabriel Meseguer-Brocal

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12996v1

Vote

Add to Library

Recommend

8803. Bridging the Gap: Leveraging Retrieval-Augmented Generation to Better Understand Public Concerns about Vaccines

Muhammad Javed, Sedigh Khademi Habibabadi, Christopher Palmer, Hazel Clothier, Jim Buttery, Gerardo Luis Dimaguila

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12840v1

Vote

Add to Library

Recommend

8804. MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning

Hongxu Ma, Guanshuo Wang, Fufu Yu, Qiong Jia, Shouhong Ding

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12062v1

Vote

Add to Library

Recommend

8805. DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning

Qingyun Sun, Jiaqi Yuan, Shan He, Xiao Guan, Haonan Yuan, Xingcheng Fu, Jianxin Li, Philip S. Yu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.13396v1

Vote

Add to Library

Recommend

8806. When Retriever Meets Generator: A Joint Model for Code Comment Generation

Tien P. T. Le, Anh M. T. Bui, Huy N. D. Pham, Alessio Bucaioni, Phuong T. Nguyen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12558v1

Vote

Add to Library

Recommend

8807. Aligned Query Expansion: Efficient Query Expansion for Information Retrieval through LLM Alignment

Adam Yang, Gustavo Penha, Enrico Palumbo, Hugues Bouchard

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.11042v1

Vote

Add to Library

Recommend

8808. PRISM: Fine-Grained Paper-to-Paper Retrieval with Multi-Aspect-Aware Query Optimization

Sangwoo Park, Jinheon Baek, Soyeong Jeong, Sung Ju Hwang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.10057v1

Vote

Add to Library

Recommend

8809. MixLoRA-DSI: Dynamically Expandable Mixture-of-LoRA Experts for Rehearsal-Free Generative Retrieval over Dynamic Corpora

Tuan-Luc Huynh, Thuy-Trang Vu, Weiqing Wang, Trung Le, Dragan Gašević, Yuan-Fang Li, Thanh-Toan Do

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09924v1

Vote

Add to Library

Recommend

8810. I2I-PR: Deep Iterative Refinement for Phase Retrieval using Image-to-Image Diffusion Models

Mehmet Onurcan Kaya, Figen S. Oktem

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09609v1

Vote

Add to Library

Recommend

8811. Correcting the LogQ Correction: Revisiting Sampled Softmax for Large-Scale Retrieval

Kirill Khrylchenko, Vladimir Baikalov, Sergei Makeev, Artem Matveev, Sergei Liamaev

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09331v1

Two-tower neural networks are a popular architecture for the retrieval stage in recommender systems. These models are typically trained with a softmax loss over the item catalog. However, in web-scale settings, the item catalog is often prohibitively large, making full softmax infeasible. A common solution is sampled softmax, which approximates the full softmax using a small number of sampled negatives. One practical and widely adopted approach is to use in-batch negatives, where negatives are drawn from items in the current mini-batch. However, this introduces a bias: items that appear more frequently in the batch (i.e., popular items) are penalized more heavily. To mitigate this issue, a popular industry technique known as logQ correction adjusts the logits during training by subtracting the log-probability of an item appearing in the batch. This correction is derived by analyzing the bias in the gradient and applying importance sampling, effectively twice, using the in-batch distribution as a proposal distribution. While this approach improves model quality, it does not fully eliminate the bias. In this work, we revisit the derivation of logQ correction and show that it overlooks a subtle but important detail: the positive item in the denominator is not Monte Carlo-sampled - it is always present with probability 1. We propose a refined correction formula that accounts for this. Notably, our loss introduces an interpretable sample weight that reflects the model's uncertainty - the probability of misclassification under the current parameters. We evaluate our method on both public and proprietary datasets, demonstrating consistent improvements over the standard logQ correction.
Authors' comments: Accepted at ACM RecSys 2025. Author's version. To appear in the Proceedings of the 18th ACM Conference on Recommender Systems

Vote

Add to Library

Recommend

8812. Back to the Basics: Rethinking Issue-Commit Linking with LLM-Assisted Retrieval

Huihui Huang, Ratnadira Widyasari, Ting Zhang, Ivana Clairine Irsan, Jieke Shi, Han Wei Ang, Frank Liauw, Eng Lieh Ouh et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09199v1

Vote

Add to Library

Recommend

8813. RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking

Shuo Yang, Zijian Yu, Zhenzhe Ying, Yuqin Dai, Guoqing Wang, Jun Lan, Jinfeng Xu, Jinze Li et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09174v1

Vote

Add to Library

Recommend

8814. HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving

Zhengding Hu, Vibha Murthy, Zaifeng Pan, Wanlu Li, Xiaoyi Fang, Yufei Ding, Yuke Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09138v1

Vote

Add to Library

Recommend

8815. DS@GT at Touché: Large Language Models for Retrieval-Augmented Debate

Anthony Miyaguchi, Conor Johnston, Aaryan Potdar

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09090v1

Vote

Add to Library

Recommend

8816. GraphRunner: A Multi-Stage Framework for Efficient and Accurate Graph-Based Retrieval

Savini Kashmira, Jayanaka L. Dantanarayana, Krisztián Flautner, Lingjia Tang, Jason Mars

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08945v1

Vote

Add to Library

Recommend

8817. RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features

Inye Na, Nejung Rue, Jiwon Chung, Hyunjin Park

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08546v1

Vote

Add to Library

Recommend

8818. Improving Korean-English Cross-Lingual Retrieval: A Data-Centric Study of Language Composition and Model Merging

Youngjoon Jang, Junyoung Son, Taemin Lee, Seongtae Hong, Heuiseok Lim

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08480v1

Vote

Add to Library

Recommend

8819. KGRAG-Ex: Explainable Retrieval-Augmented Generation with Knowledge Graph-based Perturbations

Georgios Balanos, Evangelos Chasanis, Konstantinos Skianis, Evaggelia Pitoura

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08443v1

Vote

Add to Library

Recommend

8820. xpSHACL: Explainable SHACL Validation using Retrieval-Augmented Generation and Large Language Models

Gustavo Correa Publio, José Emilio Labra Gayo

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08432v1

Vote

Add to Library

Recommend

Benty-search

8801. Retrieval-Augmented Clinical Benchmarking for Contextual Model Testing in Kenyan Primary Care: A Methodology Paper

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.14615v1

8802. Multi-Class-Token Transformer for Multitask Self-supervised Music Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.12996v1

8803. Bridging the Gap: Leveraging Retrieval-Augmented Generation to Better Understand Public Concerns about Vaccines

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.12840v1

8804. MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.12062v1

8805. DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.13396v1

8806. When Retriever Meets Generator: A Joint Model for Code Comment Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.12558v1

8807. Aligned Query Expansion: Efficient Query Expansion for Information Retrieval through LLM Alignment

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.11042v1

8808. PRISM: Fine-Grained Paper-to-Paper Retrieval with Multi-Aspect-Aware Query Optimization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.10057v1

8809. MixLoRA-DSI: Dynamically Expandable Mixture-of-LoRA Experts for Rehearsal-Free Generative Retrieval over Dynamic Corpora

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09924v1

8810. I2I-PR: Deep Iterative Refinement for Phase Retrieval using Image-to-Image Diffusion Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09609v1

8811. Correcting the LogQ Correction: Revisiting Sampled Softmax for Large-Scale Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09331v1

8812. Back to the Basics: Rethinking Issue-Commit Linking with LLM-Assisted Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09199v1

8813. RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09174v1

8814. HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09138v1

8815. DS@GT at Touché: Large Language Models for Retrieval-Augmented Debate

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.09090v1

8816. GraphRunner: A Multi-Stage Framework for Efficient and Accurate Graph-Based Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.08945v1

8817. RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.08546v1

8818. Improving Korean-English Cross-Lingual Retrieval: A Data-Centric Study of Language Composition and Model Merging

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.08480v1

8819. KGRAG-Ex: Explainable Retrieval-Augmented Generation with Knowledge Graph-based Perturbations

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.08443v1

8820. xpSHACL: Explainable SHACL Validation using Retrieval-Augmented Generation and Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2507.08432v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.14615v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12996v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12840v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12062v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.13396v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.12558v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.11042v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.10057v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09924v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09609v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09331v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09199v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09174v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09138v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.09090v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08945v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08546v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08480v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08443v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2507.08432v1