benty-fields - Search paper

Large language models (LLMs) are often augmented with tools to solve complex tasks. By generating code snippets and executing them through task-specific Application Programming Interfaces (APIs), they can offload certain functions to dedicated external modules, such as image encoding and performing calculations. However, most existing approaches to augment LLMs with tools are constrained by general-purpose APIs and lack the flexibility for tailoring them to specific tasks. In this work, we present CRAFT, a general tool creation and retrieval framework for LLMs. It creates toolsets specifically curated for the tasks and equips LLMs with a component that retrieves tools from these sets to enhance their capability to solve complex tasks. For each task, we collect specific code solutions by prompting GPT-4 to solve the training examples. Following a validation step ensuring the correctness, these solutions are abstracted into code snippets to enhance reusability, and deduplicated for higher quality. At inference time, the language model retrieves snippets from the toolsets and then executes them or generates the output conditioning on the retrieved snippets. Our method is designed to be flexible and offers a plug-and-play approach to adapt off-the-shelf LLMs to unseen domains and modalities, without any finetuning. Experiments on vision-language, tabular processing, and mathematical reasoning tasks show that our approach achieves substantial improvements compared to strong baselines. In addition, our in-depth analysis reveals that: (1) consistent performance improvement can be achieved by scaling up the number of tools and the capability of the backbone models; (2) each component of our approach contributes to the performance gains; (3) the created tools are well-structured and reliable with low complexity and atomicity. The code is available at https://github.com/lifan-yuan/CRAFT.
Authors' comments: Accepted to ICLR 2024. Code is available at https://github.com/lifan-yuan/CRAFT

Vote

Add to Library

Recommend

6013. Strong-Field Bloch Electron Interferometry for Band Structure Retrieval

Tobias Weitz, Christian Heide, Peter Hommelhoff

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16313v1

Vote

Add to Library

Recommend

6014. FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding

Pengxiang Wu, Siman Wang, Kevin Dela Rosa, Derek Hao Hu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16249v1

Vote

Add to Library

Recommend

6015. Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections

Tom Bamford, Andrea Coletta, Elizabeth Fons, Sriram Gopalakrishnan, Svitlana Vyetrenko, Tucker Balch, Manuela Veloso

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16741v2

Vote

Add to Library

Recommend

6016. MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering

Yucheng Shi, Shaochen Xu, Tianze Yang, Zhengliang Liu, Tianming Liu, Quanzheng Li, Xiang Li, Ninghao Liu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16035v3

Vote

Add to Library

Recommend

6017. Video-adverb retrieval with compositional adverb-action embeddings

Thomas Hummel, Otniel-Bogdan Mercea, A. Sophia Koepke, Zeynep Akata

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.15086v1

Vote

Add to Library

Recommend

6018. Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features

Hila Levi, Guy Heller, Dan Levi, Ethan Fetaya

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.14999v2

Vote

Add to Library

Recommend

6019. Towards Robust and Truly Large-Scale Audio-Sheet Music Retrieval

Luis Carvalho, Gerhard Widmer

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.12158v1

Vote

Add to Library

Recommend

6020. Passage Summarization with Recurrent Models for Audio-Sheet Music Retrieval

Luis Carvalho, Gerhard Widmer

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.12111v1

Vote

Add to Library

Recommend

Benty-search

6001. Self-Knowledge Guided Retrieval Augmentation for Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.05002v1

6002. Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.04027v1

6003. RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.04408v1

6004. An Efficient Content-based Time Series Retrieval System

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.03919v1

6005. Dual-Polarization Phase Retrieval Receiver in Silicon Photonics

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.02467v1

6006. Making Retrieval-Augmented Language Models Robust to Irrelevant Context

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.01558v1

6007. NEUCORE: Neural Concept Reasoning for Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.01358v1

6008. Scaling Up Music Information Retrieval Training with Semi-Supervised Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.01353v1

6009. BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.01329v1

6010. EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2310.00970v1

6011. Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.17093v3

6012. CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.17428v2

6013. Strong-Field Bloch Electron Interferometry for Band Structure Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.16313v1

6014. FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.16249v1

6015. Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.16741v2

6016. MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.16035v3

6017. Video-adverb retrieval with compositional adverb-action embeddings

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.15086v1

6018. Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.14999v2

6019. Towards Robust and Truly Large-Scale Audio-Sheet Music Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.12158v1

6020. Passage Summarization with Recurrent Models for Audio-Sheet Music Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.12111v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.05002v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.04027v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.04408v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.03919v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.02467v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.01558v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.01358v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.01353v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.01329v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2310.00970v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.17093v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.17428v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16313v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16249v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16741v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16035v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.15086v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.14999v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.12158v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.12111v1