benty-fields - Search paper

An important goal of online platforms is to enable content discovery, i.e. allow users to find a catalog entity they were not familiar with. A pre-requisite to discover an entity, e.g. a book, with a search engine is that the entity is retrievable, i.e. there are queries for which the system will surface such entity in the top results. However, machine-learned search engines have a high retrievability bias, where the majority of the queries return the same entities. This happens partly due to the predominance of narrow intent queries, where users create queries using the title of an already known entity, e.g. in book search 'harry potter'. The amount of broad queries where users want to discover new entities, e.g. in music search 'chill lyrical electronica with an atmospheric feeling to it', and have a higher tolerance to what they might find, is small in comparison. We focus here on two factors that have a negative impact on the retrievability of the entities (I) the training data used for dense retrieval models and (II) the distribution of narrow and broad intent queries issued in the system. We propose CtrlQGen, a method that generates queries for a chosen underlying intent-narrow or broad. We can use CtrlQGen to improve factor (I) by generating training data for dense retrieval models comprised of diverse synthetic queries. CtrlQGen can also be used to deal with factor (II) by suggesting queries with broader intents to users. Our results on datasets from the domains of music, podcasts, and books reveal that we can significantly decrease the retrievability bias of a dense retrieval model when using CtrlQGen. First, by using the generated queries as training data for dense models we make 9% of the entities retrievable (go from zero to non-zero retrievability). Second, by suggesting broader queries to users, we can make 12% of the entities retrievable in the best case.
Authors' comments: Accepted for publication in the International World Wide Web Conference 2023

Vote

Add to Library

Recommend

6209. CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion

Geonmo Gu, Sanghyuk Chun, Wonjae Kim, HeeJae Jun, Yoohoon Kang, Sangdoo Yun

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11916v4

Vote

Add to Library

Recommend

6210. Scene Graph Based Fusion Network For Image-Text Retrieval

Guoliang Wang, Yanlei Shang, Yong Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11090v1

Vote

Add to Library

Recommend

6211. Controllable Ancient Chinese Lyrics Generation Based on Phrase Prototype Retrieving

Li Yi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11005v1

Vote

Add to Library

Recommend

6212. Retrieving Multimodal Information for Augmented Generation: A Survey

Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10868v3

Vote

Add to Library

Recommend

6213. UNREAL:Unlabeled Nodes Retrieval and Labeling for Heavily-imbalanced Node Classification

Liang Yan, Shengzhong Zhang, Bisheng Li, Min Zhou, Zengfeng Huang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10371v1

Vote

Add to Library

Recommend

6214. Textless Speech-to-Music Retrieval Using Emotion Similarity

SeungHeon Doh, Minz Won, Keunwoo Choi, Juhan Nam

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10539v1

Vote

Add to Library

Recommend

6215. DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.09867v2

Vote

Add to Library

Recommend

6216. Retrieving false claims on Twitter during the Russia-Ukraine conflict

Valerio La Gatta, Chiyu Wei, Luca Luceri, Francesco Pierri, Emilio Ferrara

Companion Proceedings of the ACM Web Conference 2023 (2023)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10121v1

Vote

Add to Library

Recommend

6217. Data Roaming and Quality Assessment for Composed Image Retrieval

Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.09429v2

Vote

Add to Library

Recommend

6218. UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation

Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Furu Wei et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.08518v4

Vote

Add to Library

Recommend

6219. VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression

Won Jo, Geuntaek Lim, Gwangjin Lee, Hyunwoo Kim, Byungsoo Ko, Yukyung Choi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.08906v2

Vote

Add to Library

Recommend

6220. Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening

Min Cao, Yang Bai, Jingyao Wang, Ziqiang Cao, Liqiang Nie, Min Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.07740v1

Vote

Add to Library

Recommend

Benty-search

6201. QUADRo: Dataset and Models for QUestion-Answer Database Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.01003v1

6202. Zero-Shot Composed Image Retrieval with Textual Inversion

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.15247v2

6203. Lexicon-Enhanced Self-Supervised Training for Multilingual Dense Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.14979v1

6204. Resolution Complete In-Place Object Retrieval given Known Object Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.14562v1

6205. Query-Dependent Video Representation for Moment Retrieval and Highlight Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.13874v1

6206. Parameter-Efficient Sparse Retrievers and Rerankers using Adapters

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.13220v1

6207. RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.12570v3

6208. Improving Content Retrievability in Search with Controllable Query Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.11648v1

6209. CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.11916v4

6210. Scene Graph Based Fusion Network For Image-Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.11090v1

6211. Controllable Ancient Chinese Lyrics Generation Based on Phrase Prototype Retrieving

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.11005v1

6212. Retrieving Multimodal Information for Augmented Generation: A Survey

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.10868v3

6213. UNREAL:Unlabeled Nodes Retrieval and Labeling for Heavily-imbalanced Node Classification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.10371v1

6214. Textless Speech-to-Music Retrieval Using Emotion Similarity

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.10539v1

6215. DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.09867v2

6216. Retrieving false claims on Twitter during the Russia-Ukraine conflict

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.10121v1

6217. Data Roaming and Quality Assessment for Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.09429v2

6218. UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.08518v4

6219. VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.08906v2

6220. Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2303.07740v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.01003v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.15247v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.14979v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.14562v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.13874v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.13220v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.12570v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11648v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11916v4

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11090v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.11005v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10868v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10371v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10539v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.09867v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.10121v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.09429v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.08518v4

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.08906v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2303.07740v1