benty-fields - Search paper

5521. Improving Retrieval-Augmented Deep Assertion Generation via Joint Training

Quanjun Zhang, Chunrong Fang, Yi Zheng, Ruixiang Qian, Shengcheng Yu, Yuan Zhao, Jianyi Zhou, Yun Yang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.10696v2

Unit testing attempts to validate the correctness of basic units of the software system under test and has a crucial role in software development and testing. Very recent work proposes a retrieve-and-edit approach to generate unit test oracles, i.e., assertions. Despite being promising, it is still far from perfect due to some limitations, such as splitting assertion retrieval and generation into two separate components without benefiting each other. In this paper, we propose AG-RAG, a retrieval-augmented automated assertion generation approach that leverages external codebases and joint training to address various technical limitations of prior work. Inspired by the plastic surgery hypothesis, AG-RAG attempts to combine relevant unit tests and advanced pre-trained language models (PLMs) with retrieval-augmented fine-tuning. AG-RAG builds a dense retriever to search for relevant test-assert pairs (TAPs) with semantic matching and a retrieval-augmented generator to synthesize accurate assertions with the focal-test and retrieved TAPs as input. Besides, AG-RAG leverages a code-aware language model CodeT5 as the cornerstone to facilitate both assertion retrieval and generation tasks. Furthermore, the retriever is optimized in conjunction with the generator as a whole pipeline with a joint training strategy. This unified design fully adapts both components specifically for retrieving more useful TAPs, thereby generating accurate assertions. We extensively evaluate AG-RAG against six state-of-the-art AG approaches on two benchmarks and three metrics. Experimental results show that AG-RAG significantly outperforms previous AG approaches on all benchmarks and metrics, e.g., improving the most recent baseline EditAS by 20.82% and 26.98% in terms of accuracy. AG-RAG also correctly generates 1739 and 2866 unique assertions that all baselines fail to generate, 3.45X and 9.20X more than EditAS.
Authors' comments: Accepted to IEEE Transactions on Software Engineering (TSE 2025)

Benty-search

5521. Improving Retrieval-Augmented Deep Assertion Generation via Joint Training

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.10696v2

5522. Retrieval-augmented Encoders for Extreme Multi-label Text Classification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.10615v1

5523. Dataset Protection via Watermarked Canaries in Retrieval-Augmented LLMs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.10673v1

5524. Retrieving maximum information of symmetric states from their corrupted copies

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.10627v1

5525. ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.09411v1

5526. ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.09891v1

5527. ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.07971v1

5528. Fast and Accurate Antibody Sequence Design via Structure Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.19395v1

5529. DOGR: Leveraging Document-Oriented Contrastive Learning in Generative Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.07219v1

5530. PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.07215v1

5531. Repository-level Code Search with Neural Retrieval Methods

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.07067v1

5532. Retrieving Filter Spectra in CNN for Explainable Sleep Stage Classification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.06478v1

5533. Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.06148v1

5534. FlashCheck: Exploration of Efficient Evidence Retrieval for Fast Fact-Checking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.05803v2

5535. Retrieval-augmented Large Language Models for Financial Time Series Forecasting

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.05878v2

5536. On Memory Construction and Retrieval for Personalized Conversational Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.05589v2

5537. Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.15734v1

5538. Circuit Diagram Retrieval Based on Hierarchical Circuit Graph Representation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2503.11658v1

5539. Expertized Caption Auto-Enhancement for Video-Text Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.02885v3

5540. VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2502.01549v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.10696v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.10615v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.10673v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.10627v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.09411v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.09891v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.07971v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.19395v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.07219v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.07215v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.07067v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.06478v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.06148v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.05803v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.05878v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.05589v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.15734v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2503.11658v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.02885v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2502.01549v1