benty-fields - Search paper

Most image-text retrieval work adopts binary labels indicating whether a pair of image and text matches or not. Such a binary indicator covers only a limited subset of image-text semantic relations, which is insufficient to represent relevance degrees between images and texts described by continuous labels such as image captions. The visual-semantic embedding space obtained by learning binary labels is incoherent and cannot fully characterize the relevance degrees. In addition to the use of binary labels, this paper further incorporates continuous pseudo labels (generally approximated by text similarity between captions) to indicate the relevance degrees. To learn a coherent embedding space, we propose an image-text retrieval framework with Binary and Continuous Label Supervision (BCLS), where binary labels are used to guide the retrieval model to learn limited binary correlations, and continuous labels are complementary to the learning of image-text semantic relations. For the learning of binary labels, we improve the common Triplet ranking loss with Soft Negative mining (Triplet-SN) to improve convergence. For the learning of continuous labels, we design Kendall ranking loss inspired by Kendall rank correlation coefficient (Kendall), which improves the correlation between the similarity scores predicted by the retrieval model and the continuous labels. To mitigate the noise introduced by the continuous pseudo labels, we further design Sliding Window sampling and Hard Sample mining strategy (SW-HS) to alleviate the impact of noise and reduce the complexity of our framework to the same order of magnitude as the triplet ranking loss. Extensive experiments on two image-text retrieval benchmarks demonstrate that our method can improve the performance of state-of-the-art image-text retrieval models.
Authors' comments: 13 pages, 7 figures

Vote

Add to Library

Recommend

6325. VTC: Improving Video-Text Retrieval with User Comments

Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.10820v1

Vote

Add to Library

Recommend

6326. Towards Proactive Information Retrieval in Noisy Text with Wikipedia Concepts

Tabish Ahmed, Sahan Bulathwela

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.09877v1

Vote

Add to Library

Recommend

6327. Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Joemon M. Jose

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.08908v1

Vote

Add to Library

Recommend

6328. Efficient Cross-Modal Video Retrieval with Meta-Optimized Frames

Ning Han, Xun Yang, Ee-Peng Lim, Hao Chen, Qianru Sun

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.08452v1

Vote

Add to Library

Recommend

6329. Selective Query-guided Debiasing for Video Corpus Moment Retrieval

Sunjae Yoon, Ji Woo Hong, Eunseop Yoon, Dahyun Kim, Junyeong Kim, Hee Suk Yoon, Chang D. Yoo

Lecture Notes in Computer Science, 185-200 (2022)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.08714v2

Vote

Add to Library

Recommend

6330. Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge

Kosuke Nishida, Naoki Yoshinaga, Kyosuke Nishida

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.07523v3

Vote

Add to Library

Recommend

6331. Language Agnostic Multilingual Information Retrieval with Contrastive Learning

Xiyang Hu, Xinchi Chen, Peng Qi, Deguang Kong, Kunlun Liu, William Yang Wang, Zhiheng Huang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.06633v3

Vote

Add to Library

Recommend

6332. Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers

Odunayo Ogundepo, Xinyu Zhang, Jimmy Lin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05481v1

Vote

Add to Library

Recommend

6333. Retrieval Augmentation for T5 Re-ranker using External Sources

Kai Hui, Tao Chen, Zhen Qin, Honglei Zhuang, Fernando Diaz, Mike Bendersky, Don Metzler

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05145v1

Vote

Add to Library

Recommend

6334. Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval

Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, Jing Yao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05521v3

Vote

Add to Library

Recommend

6335. Improving Robustness of Retrieval Augmented Translation via Shuffling of Suggestions

Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05059v1

Vote

Add to Library

Recommend

6336. CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation

Tanay Dixit, Bhargavi Paranjape, Hannaneh Hajishirzi, Luke Zettlemoyer

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.04873v2

Vote

Add to Library

Recommend

6337. ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval

Adriano Fragomeni, Michael Wray, Dima Damen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.04341v1

Vote

Add to Library

Recommend

6338. Multi-Objective Personalized Product Retrieval in Taobao Search

Yukun Zheng, Jiang Bian, Guanghao Meng, Chao Zhang, Honggang Wang, Zhixuan Zhang, Sen Li, Tao Zhuang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.04170v1

Vote

Add to Library

Recommend

6339. Enhanced vectors for top-k document retrieval in Question Answering

Mohammed Hammad

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.10584v1

Vote

Add to Library

Recommend

6340. Learning to embed semantic similarity for joint image-text retrieval

Noam Malali, Yosi Keller

IEEE Transactions on Pattern Analysis and Machine Intelligence, 1-1 (2021)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.03838v1

Vote

Add to Library

Recommend

Benty-search

6321. PENTATRON: PErsonalized coNText-Aware Transformer for Retrieval-based cOnversational uNderstanding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.12308v1

6322. SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.11773v2

6323. An Analysis of Fusion Functions for Hybrid Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.11934v2

6324. Image-Text Retrieval with Binary and Continuous Label Supervision

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.11319v1

6325. VTC: Improving Video-Text Retrieval with User Comments

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.10820v1

6326. Towards Proactive Information Retrieval in Noisy Text with Wikipedia Concepts

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.09877v1

6327. Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.08908v1

6328. Efficient Cross-Modal Video Retrieval with Meta-Optimized Frames

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.08452v1

6329. Selective Query-guided Debiasing for Video Corpus Moment Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.08714v2

6330. Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.07523v3

6331. Language Agnostic Multilingual Information Retrieval with Contrastive Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.06633v3

6332. Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.05481v1

6333. Retrieval Augmentation for T5 Re-ranker using External Sources

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.05145v1

6334. Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.05521v3

6335. Improving Robustness of Retrieval Augmented Translation via Shuffling of Suggestions

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.05059v1

6336. CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.04873v2

6337. ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.04341v1

6338. Multi-Objective Personalized Product Retrieval in Taobao Search

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.04170v1

6339. Enhanced vectors for top-k document retrieval in Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.10584v1

6340. Learning to embed semantic similarity for joint image-text retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2210.03838v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.12308v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.11773v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.11934v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.11319v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.10820v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.09877v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.08908v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.08452v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.08714v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.07523v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.06633v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05481v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05145v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05521v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.05059v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.04873v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.04341v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.04170v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.10584v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2210.03838v1