benty-fields - Search paper

The paper proposes a Federated Content-Based Medical Image Retrieval (FedCBMIR) platform that utilizes Federated Learning (FL) to address the challenges of acquiring a diverse medical data set for training CBMIR models. CBMIR assists pathologists in diagnosing breast cancer more rapidly by identifying similar medical images and relevant patches in prior cases compared to traditional cancer detection methods. However, CBMIR in histopathology necessitates a pool of Whole Slide Images (WSIs) to train to extract an optimal embedding vector that leverages search engine performance, which may not be available in all centers. The strict regulations surrounding data sharing in medical data sets also hinder research and model development, making it difficult to collect a rich data set. The proposed FedCBMIR distributes the model to collaborative centers for training without sharing the data set, resulting in shorter training times than local training. FedCBMIR was evaluated in two experiments with three scenarios on BreaKHis and Camelyon17 (CAM17). The study shows that the FedCBMIR method increases the F1-Score (F1S) of each client to 98%, 96%, 94%, and 97% in the BreaKHis experiment with a generalized model of four magnifications and does so in 6.30 hours less time than total local training. FedCBMIR also achieves 98% accuracy with CAM17 in 2.49 hours less training time than local training, demonstrating that our FedCBMIR is both fast and accurate for both pathologists and engineers. In addition, our FedCBMIR provides similar images with higher magnification for non-developed countries where participate in the worldwide FedCBMIR with developed countries to facilitate mitosis measuring in breast cancer diagnosis. We evaluate this scenario by scattering BreaKHis into four centers with different magnifications.
Authors' comments: This paper has been submitted in IEEE Access

Vote

Add to Library

Recommend

6165. COLA: A Benchmark for Compositional Text-to-image Retrieval

Arijit Ray, Filip Radenovic, Abhimanyu Dubey, Bryan A. Plummer, Ranjay Krishna, Kate Saenko

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.03689v3

Compositional reasoning is a hallmark of human visual intelligence. Yet, despite the size of large vision-language models, they struggle to represent simple compositions by combining objects with their attributes. To measure this lack of compositional capability, we design Cola, a text-to-image retrieval benchmark to Compose Objects Localized with Attributes. To solve Cola, a model must retrieve images with the correct configuration of attributes and objects and avoid choosing a distractor image with the same objects and attributes but in the wrong configuration. Cola contains about 1.2k composed queries of 168 objects and 197 attributes on around 30K images. Our human evaluation finds that Cola is 83.33% accurate, similar to contemporary compositionality benchmarks. Using Cola as a testbed, we explore empirical modeling designs to adapt pre-trained vision-language models to reason compositionally. We explore 6 adaptation strategies on 2 seminal vision-language models, using compositionality-centric test benchmarks - Cola and CREPE. We find the optimal adaptation strategy is to train a multi-modal attention layer that jointly attends over the frozen pre-trained image and language features. Surprisingly, training multimodal layers on CLIP performs better than tuning a larger FLAVA model with already pre-trained multimodal layers. Furthermore, our adaptation strategy improves CLIP and FLAVA to comparable levels, suggesting that training multimodal layers using contrastive attribute-object data is key, as opposed to using them pre-trained. Lastly, we show that Cola is harder than a closely related contemporary benchmark, CREPE, since simpler fine-tuning strategies without multimodal layers suffice on CREPE but not on Cola. However, we still see a significant gap between our best adaptation and human accuracy, suggesting considerable room for further research.
Authors' comments: Accepted to NeurIPS 2023. Webpage: https://cs-people.bu.edu/array/research/cola/

Vote

Add to Library

Recommend

6166. Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory

Xin Cheng, Di Luo, Xiuying Chen, Lemao Liu, Dongyan Zhao, Rui Yan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.02437v3

Vote

Add to Library

Recommend

6167. Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Yifan Qiao, Yingrui Yang, Haixin Lin, Tao Yang

Proceedings of the ACM Web Conference 2023 (2023)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.01203v1

Vote

Add to Library

Recommend

6168. Large Language Models are Strong Zero-Shot Retriever

Tao Shen, Guodong Long, Xiubo Geng, Chongyang Tao, Tianyi Zhou, Daxin Jiang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.14233v2

Vote

Add to Library

Recommend

6169. A Personalized Dense Retrieval Framework for Unified Information Access

Hansi Zeng, Surya Kallumadi, Zaid Alibadi, Rodrigo Nogueira, Hamed Zamani

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.13654v1

Vote

Add to Library

Recommend

6170. Retrieval-based Knowledge Augmented Vision Language Pre-training

Jiahua Rao, Zifei Shan, Longpo Liu, Yao Zhou, Yuedong Yang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.13923v2

Vote

Add to Library

Recommend

6171. A Static Pruning Study on Sparse Neural Retrievers

Carlos Lassance, Simon Lupart, Hervé Dejean, Stéphane Clinchant, Nicola Tonellotto

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12702v1

Vote

Add to Library

Recommend

6172. A Preliminary Evaluation of ChatGPT in Requirements Information Retrieval

Jianzhang Zhang, Yiyang Chen, Nan Niu, Chuang Liu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12562v1

Vote

Add to Library

Recommend

6173. Learnable Pillar-based Re-ranking for Image-Text Retrieval

Leigang Qu, Meng Liu, Wenjie Wang, Zhedong Zheng, Liqiang Nie, Tat-Seng Chua

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12570v1

Vote

Add to Library

Recommend

6174. Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes

Xueguang Ma, Tommaso Teofili, Jimmy Lin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12139v1

Vote

Add to Library

Recommend

6175. Constructing Tree-based Index for Efficient and Effective Dense Retrieval

Haitao Li, Qingyao Ai, Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Zheng Liu, Zhao Cao

SIGIR 2023

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.11943v1

Vote

Add to Library

Recommend

6176. Complementarity between decoherence and information retrieval from the environment

Tae-Hun Lee, Jarosław K. Korbicz

Physical Review A, 109 (2024)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12222v2

Vote

Add to Library

Recommend

6177. Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval

Mehdi Rafiei, Alexandros Iosifidis

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.11734v1

Vote

Add to Library

Recommend

6178. Rethinking Benchmarks for Cross-modal Image-text Retrieval

Weijing Chen, Linli Yao, Qin Jin

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.10824v1

Image-text retrieval, as a fundamental and important branch of information retrieval, has attracted extensive research attentions. The main challenge of this task is cross-modal semantic understanding and matching. Some recent works focus more on fine-grained cross-modal semantic matching. With the prevalence of large scale multimodal pretraining models, several state-of-the-art models (e.g. X-VLM) have achieved near-perfect performance on widely-used image-text retrieval benchmarks, i.e. MSCOCO-Test-5K and Flickr30K-Test-1K. In this paper, we review the two common benchmarks and observe that they are insufficient to assess the true capability of models on fine-grained cross-modal semantic matching. The reason is that a large amount of images and texts in the benchmarks are coarse-grained. Based on the observation, we renovate the coarse-grained images and texts in the old benchmarks and establish the improved benchmarks called MSCOCO-FG and Flickr30K-FG. Specifically, on the image side, we enlarge the original image pool by adopting more similar images. On the text side, we propose a novel semi-automatic renovation approach to refine coarse-grained sentences into finer-grained ones with little human effort. Furthermore, we evaluate representative image-text retrieval models on our new benchmarks to demonstrate the effectiveness of our method. We also analyze the capability of models on fine-grained semantic comprehension through extensive experiments. The results show that even the state-of-the-art models have much room for improvement in fine-grained semantic understanding, especially in distinguishing attributes of close objects in images. Our code and improved benchmark datasets are publicly available at: https://github.com/cwj1412/MSCOCO-Flikcr30K_FG, which we hope will inspire further in-depth research on cross-modal retrieval.
Authors' comments: Accepted to SIGIR2023

Vote

Add to Library

Recommend

6179. Image-text Retrieval via Preserving Main Semantics of Vision

Xu Zhang, Xinzheng Niu, Philippe Fournier-Viger, Xudong Dai

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.10254v2

Vote

Add to Library

Recommend

6180. Image retrieval outperforms diffusion models on data augmentation

Max F. Burg, Florian Wenzel, Dominik Zietlow, Max Horn, Osama Makansi, Francesco Locatello, Chris Russell

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.10253v2

Vote

Add to Library

Recommend

Benty-search

6161. SRTK: A Toolkit for Semantic-relevant Subgraph Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.04101v4

6162. Image to Multi-Modal Retrieval for Industrial Scenarios

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.03972v2

6163. A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.03347v1

6164. WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.03383v1

6165. COLA: A Benchmark for Compositional Text-to-image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.03689v3

6166. Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.02437v3

6167. Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2305.01203v1

6168. Large Language Models are Strong Zero-Shot Retriever

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.14233v2

6169. A Personalized Dense Retrieval Framework for Unified Information Access

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.13654v1

6170. Retrieval-based Knowledge Augmented Vision Language Pre-training

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.13923v2

6171. A Static Pruning Study on Sparse Neural Retrievers

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.12702v1

6172. A Preliminary Evaluation of ChatGPT in Requirements Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.12562v1

6173. Learnable Pillar-based Re-ranking for Image-Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.12570v1

6174. Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.12139v1

6175. Constructing Tree-based Index for Efficient and Effective Dense Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.11943v1

6176. Complementarity between decoherence and information retrieval from the environment

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.12222v2

6177. Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.11734v1

6178. Rethinking Benchmarks for Cross-modal Image-text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.10824v1

6179. Image-text Retrieval via Preserving Main Semantics of Vision

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.10254v2

6180. Image retrieval outperforms diffusion models on data augmentation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2304.10253v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.04101v4

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.03972v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.03347v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.03383v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.03689v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.02437v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2305.01203v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.14233v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.13654v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.13923v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12702v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12562v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12570v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12139v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.11943v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.12222v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.11734v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.10824v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.10254v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2304.10253v2