benty-fields - Search paper

Integrating multiple (sub-)systems is essential to create advanced Information Systems. Difficulties mainly arise when integrating dynamic environments, e.g., the integration at design time of not yet existing services. This has been traditionally addressed using a registry that provides the API documentation of the endpoints. Large Language Models have shown to be capable of automatically creating system integrations (e.g., as service composition) based on this documentation but require concise input due to input oken limitations, especially regarding comprehensive API descriptions. Currently, it is unknown how best to preprocess these API descriptions. In the present work, we (i) analyze the usage of Retrieval Augmented Generation for endpoint discovery and the chunking, i.e., preprocessing, of state-of-practice OpenAPIs to reduce the input oken length while preserving the most relevant information. To further reduce the input token length for the composition prompt and improve endpoint retrieval, we propose (ii) a Discovery Agent that only receives a summary of the most relevant endpoints nd retrieves specification details on demand. We evaluate RAG for endpoint discovery using (iii) a proposed novel service discovery benchmark SOCBench-D representing a general setting across numerous domains and the real-world RestBench enchmark, first, for the different chunking possibilities and parameters measuring the endpoint retrieval accuracy. Then, we assess the Discovery Agent using the same test data set. The prototype shows how to successfully employ RAG for endpoint discovery to reduce the token count. Our experiments show that endpoint-based approaches outperform naive chunking methods for preprocessing. Relying on an agent significantly improves precision while being prone to decrease recall, disclosing the need for further reasoning capabilities.
Authors' comments: arXiv admin note: substantial text overlap with arXiv:2411.19804

Vote

Add to Library

Recommend

5297. POQD: Performance-Oriented Query Decomposer for Multi-vector retrieval

Yaoyang Liu, Junlin Li, Yinjun Wu, Zhen Chen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.19189v1

Vote

Add to Library

Recommend

5298. Federated Retrieval-Augmented Generation: A Systematic Mapping Study

Abhijit Chakraborty, Chahana Dahal, Vivek Gupta

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.18906v1

Vote

Add to Library

Recommend

5299. RoleRAG: Enhancing LLM Role-Playing via Graph Guided Retrieval

Yongjie Wang, Jonathan Leung, Zhiqi Shen

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.18541v1

Vote

Add to Library

Recommend

5300. BRIT: Bidirectional Retrieval over Unified Image-Text Graph

Ainulla Khan, Yamada Moyuru, Srinidhi Akella

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.18450v1

Vote

Add to Library

Recommend

Benty-search

5281. Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.23275v1

5282. Retrieval Augmented Generation based Large Language Models for Causality Mining

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.23944v1

5283. Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.23952v1

5284. Detecting Undesired Process Behavior by Means of Retrieval Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.22041v1

5285. Evaluating the Retrieval Robustness of Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.21870v1

5286. LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.23832v1

5287. Scientific Paper Retrieval with LLM-Guided Semantic-Based Ranking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.21815v1

5288. Aligning Proteins and Language: A Foundation Model for Protein Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2506.08023v1

5289. Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.20825v1

5290. DISRetrieval: Harnessing Discourse Structure for Long Document Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2506.06313v1

5291. LogiCoL: Logically-Informed Contrastive Learning for Set-based Dense Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.19588v1

5292. Rethinking Text-based Protein Understanding: Retrieval or LLM?

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.20354v1

5293. REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.20613v2

5294. Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.19356v1

5295. DocMMIR: A Framework for Document Multi-modal Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.19312v1

5296. Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.19310v1

5297. POQD: Performance-Oriented Query Decomposer for Multi-vector retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.19189v1

5298. Federated Retrieval-Augmented Generation: A Systematic Mapping Study

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.18906v1

5299. RoleRAG: Enhancing LLM Role-Playing via Graph Guided Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.18541v1

5300. BRIT: Bidirectional Retrieval over Unified Image-Text Graph

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.18450v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.23275v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.23944v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.23952v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.22041v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.21870v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.23832v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.21815v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2506.08023v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.20825v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2506.06313v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.19588v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.20354v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.20613v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.19356v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.19312v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.19310v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.19189v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.18906v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.18541v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.18450v1