benty-fields - Search paper

Integrating multiple (sub-)systems is essential to create advanced Information Systems (ISs). Difficulties mainly arise when integrating dynamic environments across the IS lifecycle. A traditional approach is a registry that provides the API documentation of the systems' endpoints. Large Language Models (LLMs) have shown to be capable of automatically creating system integrations (e.g., as service composition) based on this documentation but require concise input due to input token limitations, especially regarding comprehensive API descriptions. Currently, it is unknown how best to preprocess these API descriptions. Within this work, we (i) analyze the usage of Retrieval Augmented Generation (RAG) for endpoint discovery and the chunking, i.e., preprocessing, of OpenAPIs to reduce the input token length while preserving the most relevant information. To further reduce the input token length for the composition prompt and improve endpoint retrieval, we propose (ii) a Discovery Agent that only receives a summary of the most relevant endpoints and retrieves details on demand. We evaluate RAG for endpoint discovery using the RestBench benchmark, first, for the different chunking possibilities and parameters measuring the endpoint retrieval recall, precision, and F1 score. Then, we assess the Discovery Agent using the same test set. With our prototype, we demonstrate how to successfully employ RAG for endpoint discovery to reduce the token count. While revealing high values for recall, precision, and F1, further research is necessary to retrieve all requisite endpoints. Our experiments show that for preprocessing, LLM-based and format-specific approaches outperform na\"ive chunking methods. Relying on an agent further enhances these results as the agent splits the tasks into multiple fine granular subtasks, improving the overall RAG performance in the token count, precision, and F1 score.

Vote

Add to Library

Recommend

5658. EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval

Muhammad Huzaifa, Yova Kementchedjhieva

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.00139v1

Vote

Add to Library

Recommend

5659. Deep Plug-and-Play HIO Approach for Phase Retrieval

Cagatay Isil, Figen S. Oktem

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2411.18967v2

Vote

Add to Library

Recommend

5660. Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Tian Yu, Shaolei Zhang, Yang Feng

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2411.19443v1

Iterative retrieval refers to the process in which the model continuously queries the retriever during generation to enhance the relevance of the retrieved knowledge, thereby improving the performance of Retrieval-Augmented Generation (RAG). Existing work typically employs few-shot prompting or manually constructed rules to implement iterative retrieval. This introduces additional inference overhead and overlooks the remarkable reasoning capabilities of Large Language Models (LLMs). In this paper, we introduce Auto-RAG, an autonomous iterative retrieval model centered on the LLM's powerful decision-making capabilities. Auto-RAG engages in multi-turn dialogues with the retriever, systematically planning retrievals and refining queries to acquire valuable knowledge. This process continues until sufficient external information is gathered, at which point the results are presented to the user. To this end, we develop a method for autonomously synthesizing reasoning-based decision-making instructions in iterative retrieval and fine-tuned the latest open-source LLMs. The experimental results indicate that Auto-RAG is capable of autonomous iterative interaction with the retriever, effectively leveraging the remarkable reasoning and decision-making abilities of LLMs, which lead to outstanding performance across six benchmarks. Further analysis reveals that Auto-RAG can autonomously adjust the number of iterations based on the difficulty of the questions and the utility of the retrieved knowledge, without requiring any human intervention. Moreover, Auto-RAG expresses the iterative retrieval process in natural language, enhancing interpretability while providing users with a more intuitive experience\footnote{Code is available at \url{https://github.com/ictnlp/Auto-RAG}.
Authors' comments: Code is available at https://github.com/ictnlp/Auto-RAG

Vote

Add to Library

Recommend

Benty-search

5641. Bilingual BSARD: Extending Statutory Article Retrieval to Dutch

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.07462v1

5642. Automatic Database Configuration Debugging using Retrieval-Augmented Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.07548v2

5643. Towards Brain Passage Retrieval -- An Investigation of EEG Query Representations

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.06695v3

5644. Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.05831v1

5645. Compositional Image Retrieval via Instruction-Aware Contrastive Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.05756v1

5646. Privacy-Preserving Retrieval Augmented Generation with Differential Privacy

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.04697v1

5647. Ranking Narrative Query Graphs for Biomedical Document Retrieval (Technical Report)

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.15232v1

5648. Deep-Unrolling Multidimensional Harmonic Retrieval Algorithms on Neuromorphic Hardware

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.04008v1

5649. Composed Image Retrieval for Training-Free Domain Conversion

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.03297v1

5650. Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.13205v1

5651. Advancing Similarity Search with GenAI: A Retrieval Augmented Generation Approach

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2501.04006v1

5652. RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.02830v2

5653. Multi-Facet Blending for Faceted Query-by-Example Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.01443v1

5654. QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.00934v1

5655. Improving Vietnamese Legal Document Retrieval using Synthetic Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.00657v1

5656. Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2411.19806v2

5657. Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2411.19804v1

5658. EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2412.00139v1

5659. Deep Plug-and-Play HIO Approach for Phase Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2411.18967v2

5660. Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2411.19443v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.07462v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.07548v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.06695v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.05831v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.05756v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.04697v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.15232v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.04008v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.03297v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.13205v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2501.04006v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.02830v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.01443v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.00934v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.00657v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2411.19806v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2411.19804v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2412.00139v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2411.18967v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2411.19443v1