benty-fields - Search paper

8321. PathMind: A Retrieve-Prioritize-Reason Framework for Knowledge Graph Reasoning with Large Language Models

Yu Liu, Xixun Lin, Yanmin Shang, Yangxi Li, Shi Wang, Yanan Cao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14256v1

Vote

Add to Library

Recommend

8322. Towards Authentic Movie Dubbing with Retrieve-Augmented Director-Actor Interaction Learning

Rui Liu, Yuan Zhao, Zhenqi Jia

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14249v1

The automatic movie dubbing model generates vivid speech from given scripts, replicating a speaker's timbre from a brief timbre prompt while ensuring lip-sync with the silent video. Existing approaches simulate a simplified workflow where actors dub directly without preparation, overlooking the critical director-actor interaction. In contrast, authentic workflows involve a dynamic collaboration: directors actively engage with actors, guiding them to internalize the context cues, specifically emotion, before performance. To address this issue, we propose a new Retrieve-Augmented Director-Actor Interaction Learning scheme to achieve authentic movie dubbing, termed Authentic-Dubber, which contains three novel mechanisms: (1) We construct a multimodal Reference Footage library to simulate the learning footage provided by directors. Note that we integrate Large Language Models (LLMs) to achieve deep comprehension of emotional representations across multimodal signals. (2) To emulate how actors efficiently and comprehensively internalize director-provided footage during dubbing, we propose an Emotion-Similarity-based Retrieval-Augmentation strategy. This strategy retrieves the most relevant multimodal information that aligns with the target silent video. (3) We develop a Progressive Graph-based speech generation approach that incrementally incorporates the retrieved multimodal emotional knowledge, thereby simulating the actor's final dubbing process. The above mechanisms enable the Authentic-Dubber to faithfully replicate the authentic dubbing workflow, achieving comprehensive improvements in emotional expressiveness. Both subjective and objective evaluations on the V2C Animation benchmark dataset validate the effectiveness. The code and demos are available at https://github.com/AI-S2-Lab/Authentic-Dubber.
Authors' comments: Accepted by AAAI 2026

Vote

Add to Library

Recommend

8323. SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM

An Yu, Weiheng Lu, Jian Li, Zhenfei Zhang, Yunhang Shen, Felix X. -F. Ye, Ming-Ching Chang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14143v1

Vote

Add to Library

Recommend

8324. MalRAG: A Retrieval-Augmented LLM Framework for Open-set Malicious Traffic Identification

Xiang Luo, Chang Liu, Gang Xiong, Chen Yang, Gaopeng Gou, Yaochen Ren, Zhen Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14129v1

Fine-grained identification of IDS-flagged suspicious traffic is crucial in cybersecurity. In practice, cyber threats evolve continuously, making the discovery of novel malicious traffic a critical necessity as well as the identification of known classes. Recent studies have advanced this goal with deep models, but they often rely on task-specific architectures that limit transferability and require per-dataset tuning. In this paper we introduce MalRAG, the first LLM driven retrieval-augmented framework for open-set malicious traffic identification. MalRAG freezes the LLM and operates via comprehensive traffic knowledge construction, adaptive retrieval, and prompt engineering. Concretely, we construct a multi-view traffic database by mining prior malicious traffic from content, structural, and temporal perspectives. Furthermore, we introduce a Coverage-Enhanced Retrieval Algorithm that queries across these views to assemble the most probable candidates, thereby improving the inclusion of correct evidence. We then employ Traffic-Aware Adaptive Pruning to select a variable subset of these candidates based on traffic-aware similarity scores, suppressing incorrect matches and yielding reliable retrieved evidence. Moreover, we develop a suite of guidance prompts where task instruction, evidence referencing, and decision guidance are integrated with the retrieved evidence to improve LLM performance. Across diverse real-world datasets and settings, MalRAG delivers state-of-the-art results in both fine-grained identification of known classes and novel malicious traffic discovery. Ablation and deep-dive analyses further show that MalRAG effective leverages LLM capabilities yet achieves open-set malicious traffic identification without relying on a specific LLM.
Authors' comments: 13 pages, 13 figures. Intended for submission to IEEE Transactions on Information Forensics and Security (TIFS)

Vote

Add to Library

Recommend

8325. NeuroPath: Neurobiology-Inspired Path Tracking and Reflection for Semantically Coherent Retrieval

Junchen Li, Rongzheng Wang, Yihong Huang, Qizhi Chen, Jiasheng Zhang, Shuang Liang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14096v1

Vote

Add to Library

Recommend

8326. AISAC: An Integrated multi-agent System for Transparent, Retrieval-Grounded Scientific Assistance

Chandrachur Bhattacharya, Sibendu Som

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14043v1

Vote

Add to Library

Recommend

8327. Searching in Space and Time: Unified Memory-Action Loops for Open-World Object Retrieval

Taijing Chen, Sateesh Kumar, Junhong Xu, George Pavlakos, J oydeep Biswas, Roberto Martín-Martín

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14004v1

Vote

Add to Library

Recommend

8328. TaoSearchEmb: A Multi-Objective Reinforcement Learning Framework for Dense Retrieval in Taobao Search

Xingxian Liu, Dongshuai Li, Tao Wen, Jiahui Wan, Gui Ling, Fuyu Lv, Dan Ou, Haihong Tang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13885v1

Vote

Add to Library

Recommend

8329. Hierarchical Retrieval with Out-Of-Vocabulary Queries: A Case Study on SNOMED CT

Jonathon Dilworth, Hui Yang, Jiaoyan Chen, Yongsheng Gao

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16698v1

Vote

Add to Library

Recommend

8330. Automated Construction of Medical Indicator Knowledge Graphs Using Retrieval Augmented Large Language Models

Zhengda Wang, Daqian Shi, Jingyi Zhao, Xiaolei Diao, Xiongfeng Tang, Yanguo Qin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13526v1

Vote

Add to Library

Recommend

8331. Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval

Chuang Zhao, Hui Tang, Hongke Zhao, Xiaofang Zhou, Xiaomeng Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13293v1

Vote

Add to Library

Recommend

8332. Cog-RAG: Cognitive-Inspired Dual-Hypergraph with Theme Alignment Retrieval-Augmented Generation

Hao Hu, Yifan Feng, Ruoxue Li, Rundong Xue, Xingliang Hou, Zhiqiang Tian, Yue Gao, Shaoyi Du

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13201v1

Vote

Add to Library

Recommend

8333. Optimal Foraging in Memory Retrieval: Evaluating Random Walks and Metropolis-Hastings Sampling in Modern Semantic Spaces

James Moore

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.12759v1

Vote

Add to Library

Recommend

8334. TAdaRAG: Task Adaptive Retrieval-Augmented Generation via On-the-Fly Knowledge Graph Construction

Jie Zhang, Bo Tang, Wanzi Shao, Wenqiang Wei, Jihao Zhao, Jianqing Zhu, Zhiyu li, Wen Xi et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.12520v1

Vote

Add to Library

Recommend

8335. LLM-Powered Text-Attributed Graph Anomaly Detection via Retrieval-Augmented Reasoning

Haoyan Xu, Ruizhi Qian, Zhengtao Yao, Ziyi Liu, Li Li, Yuqi Li, Yanshu Li, Wenqing Zheng et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.17584v1

Vote

Add to Library

Recommend

8336. Phase-Coded Memory and Morphological Resonance: A Next-Generation Retrieval-Augmented Generator Architecture

Denis V. Saklakov

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.11848v1

Vote

Add to Library

Recommend

8337. GRIN Transfer: A production-ready tool for libraries to retrieve digital copies from Google Books

Liza Daly, Matteo Cargnelutti, Catherine Brobston, John Hess, Greg Leppert, Amanda Watson, Jonathan Zittrain

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.11447v1

Vote

Add to Library

Recommend

8338. CLARITY: Contextual Linguistic Adaptation and Accent Retrieval for Dual-Bias Mitigation in Text-to-Speech Generation

Crystal Min Hui Poon, Pai Chet Ng, Xiaoxiao Miao, Immanuel Jun Kai Loh, Bowen Zhang, Haoyu Song, Ian Mcloughlin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.11104v1

Vote

Add to Library

Recommend

8339. Expert-Guided Prompting and Retrieval-Augmented Generation for Emergency Medical Service Question Answering

Xueren Ge, Sahil Murtaza, Anthony Cortez, Homa Alemzadeh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.10900v1

Vote

Add to Library

Recommend

8340. URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding

Yongxin Shi, Jiapeng Wang, Zeyu Shan, Dezhi Peng, Zening Lin, Lianwen Jin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.10552v1

Vote

Add to Library

Recommend

Benty-search

8321. PathMind: A Retrieve-Prioritize-Reason Framework for Knowledge Graph Reasoning with Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14256v1

8322. Towards Authentic Movie Dubbing with Retrieve-Augmented Director-Actor Interaction Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14249v1

8323. SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14143v1

8324. MalRAG: A Retrieval-Augmented LLM Framework for Open-set Malicious Traffic Identification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14129v1

8325. NeuroPath: Neurobiology-Inspired Path Tracking and Reflection for Semantically Coherent Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14096v1

8326. AISAC: An Integrated multi-agent System for Transparent, Retrieval-Grounded Scientific Assistance

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14043v1

8327. Searching in Space and Time: Unified Memory-Action Loops for Open-World Object Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.14004v1

8328. TaoSearchEmb: A Multi-Objective Reinforcement Learning Framework for Dense Retrieval in Taobao Search

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.13885v1

8329. Hierarchical Retrieval with Out-Of-Vocabulary Queries: A Case Study on SNOMED CT

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.16698v1

8330. Automated Construction of Medical Indicator Knowledge Graphs Using Retrieval Augmented Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.13526v1

8331. Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.13293v1

8332. Cog-RAG: Cognitive-Inspired Dual-Hypergraph with Theme Alignment Retrieval-Augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.13201v1

8333. Optimal Foraging in Memory Retrieval: Evaluating Random Walks and Metropolis-Hastings Sampling in Modern Semantic Spaces

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.12759v1

8334. TAdaRAG: Task Adaptive Retrieval-Augmented Generation via On-the-Fly Knowledge Graph Construction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.12520v1

8335. LLM-Powered Text-Attributed Graph Anomaly Detection via Retrieval-Augmented Reasoning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.17584v1

8336. Phase-Coded Memory and Morphological Resonance: A Next-Generation Retrieval-Augmented Generator Architecture

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.11848v1

8337. GRIN Transfer: A production-ready tool for libraries to retrieve digital copies from Google Books

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.11447v1

8338. CLARITY: Contextual Linguistic Adaptation and Accent Retrieval for Dual-Bias Mitigation in Text-to-Speech Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.11104v1

8339. Expert-Guided Prompting and Retrieval-Augmented Generation for Emergency Medical Service Question Answering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.10900v1

8340. URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2511.10552v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14256v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14249v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14143v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14129v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14096v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14043v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.14004v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13885v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.16698v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13526v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13293v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.13201v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.12759v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.12520v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.17584v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.11848v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.11447v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.11104v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.10900v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2511.10552v1