benty-fields - Search paper

2881. IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios

Hai Lin, Shaoxiong Zhan, Junyou Su, Haitao Zheng, Hui Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2409.15763v2

Vote

Add to Library

Recommend

2882. Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems

Yunxiao Shi, Xing Zi, Zijing Shi, Haimin Zhang, Qiang Wu, Min Xu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2407.10670v1

Vote

Add to Library

Recommend

2883. Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model

Sai Ganesh, Anupam Purwar, Gautam B

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2406.16383v1

Vote

Add to Library

Recommend

2884. Npix2Cpix: A GAN-based Image-to-Image Translation Network with Retrieval-Classification Integration for Watermark Retrieval from Historical Document Images

Utsab Saha, Sawradip Saha, Shaikh Anowarul Fattah, Mohammad Saquib

IEEE Access, 12, 95857-95870 (2024)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2406.03556v2

Vote

Add to Library

Recommend

2885. Domain Adaptation for Dense Retrieval and Conversational Dense Retrieval through Self-Supervision by Meticulous Pseudo-Relevance Labeling

Minghan Li, Eric Gaussier

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2403.08970v1

Vote

Add to Library

Recommend

2886. Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2206.14381v2

Vote

Add to Library

Recommend

2887. Pixel Wised Lesion Prediction on COVID-19 CT Imagery: A Comparative Analysis of Automated Image Segmentation Architectures

Sarmad Khan, Arslan Shaukat, Umer Asgher, Basim Azam

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.20459v1

Vote

Add to Library

Recommend

2888. Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning

Jan Netík, Patrícia Martinková

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.16991v1

Vote

Add to Library

Recommend

2889. DeepTumorVQA: A Hierarchical 3D CT Benchmark for Stage-Wise Evaluation of Medical VLMs and Tool-Augmented Agents

Yixiong Chen, Wenjie Xiao, Pedro R. A. S. Bassi, Boyan Wang, Liang He, Xinze Zhou, Sezgin Er, Ibrahim Ethem Hamamci et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.09679v1

Vote

Add to Library

Recommend

2890. The faint voice of a radio-weak BL Lacertae: modeling the broadband emission of WISE~J141046.00+740511.2

A. M. Carulli, F. L. Vieyro, M. M. Reynoso, E. J. Marchesini, I. Andruchow

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.28074v1

Vote

Add to Library

Recommend

2891. ROSA: Robust and Energy-Efficient Microring-Based Optical Neural Networks via Optical Shift-and-Add and Layer-Wise Hybrid Mapping

Huifan Zhang, Yun Hu, Caizhi Sheng, Yurui Qu, Pingqiang Zhou

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.00032v1

Vote

Add to Library

Recommend

2892. LKV: End-to-End Learning of Head-wise Budgets and Token Selection for LLM KV Cache Eviction

Enshuai Zhou, Yifan Hao, Chao Wang, Rui Zhang, Di Huang, Jiaming Guo, Xing Hu, Zidong Du et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.06676v1

Vote

Add to Library

Recommend

2893. Beyond Semantic Similarity: A Component-Wise Evaluation Framework for Medical Question Answering Systems with Health Equity Implications

Abu Noman Md Sakib, Md. Main Oddin Chisty, Zijie Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.19281v1

The use of Large Language Models (LLMs) to support patients in addressing medical questions is becoming increasingly prevalent. However, most of the measures currently used to evaluate the performance of these models in this context only measure how closely a model's answers match semantically, and therefore do not provide a true indication of the model's medical accuracy or of the health equity risks associated with it. To address these shortcomings, we present a new evaluation framework for medical question answering called VB-Score (Verification-Based Score) that provides a separate evaluation of the four components of entity recognition, semantic similarity, factual consistency, and structured information completeness for medical question-answering models. We perform rigorous reviews of the performance of three well-known and widely used LLMs on 48 public health-related topics taken from high-quality, authoritative information sources. Based on our analyses, we discover a major discrepancy between the models' semantic and entity accuracy. Our assessments of the performance of all three models show that each of them has almost uniformly severe performance failures when evaluated against our criteria. Our findings indicate alarming performance disparities across various public health topics, with most of the models exhibiting 13.8% lower performance (compared to an overall average) for all the public health topics that relate to chronic conditions that occur in older and minority populations, which indicates the existence of what's known as condition-based algorithmic discrimination. Our findings also demonstrate that prompt engineering alone does not compensate for basic architectural limitations on how these models perform in extracting medical entities and raise the question of whether semantic evaluation alone is a sufficient measure of medical AI safety.
Authors' comments: Accepted in the Ninth Annual ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT) 2026

Vote

Add to Library

Recommend

2894. Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy

Shun-ichiro Hayashi, Daichi Mukunoki, Tetsuya Hoshino, Takahiro Katagiri

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.17182v1

Vote

Add to Library

Recommend

2895. Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Columns under Transient Operating Conditions

Debadutta Patra, Ayush Bardhan Tripathy, Soumya Ranjan Sahu, Sucheta Panda

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.24644v1

Digital twin technology, when combined with physics-informed machine learning with simulation results of Aspen, offers transformative capabilities for industrial process monitoring, control, and optimization. In this work, the proposed model presents a Physics-Informed Neural Network (PINN) digital twin framework for the dynamic, tray-wise modeling of binary distillation columns operating under transient conditions. The architecture of the proposed model embeds fundamental thermodynamic constraints, including vapor-liquid equilibrium (VLE) described by modified Raoult's law, tray-level mass and energy balances, and the McCabe-Thiele graphical methodology directly into the neural network loss function via physics residual terms. The model is trained and evaluated on a high-fidelity synthetic dataset of 961 timestamped measurements spanning 8 hours of transient operation, generated in Aspen HYSYS for a binary HX/TX distillation system comprising 16 sensor streams. An adaptive loss-weighting scheme balances the data fidelity and physics consistency objectives during training. Compared to five data-driven baselines (LSTM, vanilla MLP, GRU, Transformer, DeepONet), the proposed PINN achieves an RMSE of 0.00143 for HX mole fraction prediction (R^2 = 0.9887), representing a 44.6% reduction over the best data-only baseline, while strictly satisfying thermodynamic constraints. Tray-wise temperature and composition profiles predicted under transient perturbations demonstrate that the digital twin accurately captures column dynamics including feed tray responses, reflux ratio variations, and pressure transients. These results establish the proposed PINN digital twin as a robust foundation for real-time soft sensing, model-predictive control, and anomaly detection in industrial distillation processes.
Authors' comments: 17 pages, 10 figures

Vote

Add to Library

Recommend

2896. Training-Free Sparse Attention for Fast Video Generation via Offline Layer-Wise Sparsity Profiling and Online Bidirectional Co-Clustering

Jiayi Luo, Jiayu Chen, Jiankun Wang, Cong Wang, Hanxin Zhu, Qingyun Sun, Chen Gao, Zhibo Chen et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.18636v1

Vote

Add to Library

Recommend

2897. Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity

Hengyuan Zhang, Xinrong Chen, Zunhai Su, Xiao Liang, Jing Xiong, Wendong Xu, He Xiao, Chaofan Tao et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.17354v1

Vote

Add to Library

Recommend

2898. Continual Learning via Ensemble-Based Depth-Wise Masked Autoencoders for Data Quality Monitoring in High-Energy Physics

Dale Julson, Eric Reinhardt, Andrii Krutsylo, Resham Sohal, Guillermo Fidalgo, Sergei Gleyzer, Emanuele Usai, The CMS HCAL Collaboration

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.02369v1

Vote

Add to Library

Recommend

2899. DynaMoE: Dynamic Token-Level Expert Activation with Layer-Wise Adaptive Capacity for Mixture-of-Experts Neural Networks

Gökdeniz Gülmez

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.01697v1

Vote

Add to Library

Recommend

2900. OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis

Junuk Cha, Jihyeon Kim, Han-Mu Park

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.22949v1

Fingerspelling is a component of sign languages in which words are spelled out letter by letter using specific hand poses. Automatic fingerspelling recognition plays a crucial role in bridging the communication gap between Deaf and hearing communities, yet it remains challenging due to the signing-hand ambiguity issue, the lack of appropriate training losses, and the out-of-vocabulary (OOV) problem. Prior fingerspelling recognition methods rely on explicit signing-hand detection, which often leads to recognition failures, and on a connectionist temporal classification (CTC) loss, which exhibits the peaky behavior problem. To address these issues, we develop OpenFS, an open-source approach for fingerspelling recognition and synthesis. We propose a multi-hand-capable fingerspelling recognizer that supports both single- and multi-hand inputs and performs implicit signing-hand detection by incorporating a dual-level positional encoding and a signing-hand focus (SF) loss. The SF loss encourages cross-attention to focus on the signing hand, enabling implicit signing-hand detection during recognition. Furthermore, without relying on the CTC loss, we introduce a monotonic alignment (MA) loss that enforces the output letter sequence to follow the temporal order of the input pose sequence through cross-attention regularization. In addition, we propose a frame-wise letter-conditioned generator that synthesizes realistic fingerspelling pose sequences for OOV words. This generator enables the construction of a new synthetic benchmark, called FSNeo. Through comprehensive experiments, we demonstrate that our approach achieves state-of-the-art performance in recognition and validate the effectiveness of the proposed recognizer and generator. Codes and data are available in: https://github.com/JunukCha/OpenFS.
Authors' comments: Accepted to CVPR 2026

Vote

Add to Library

Recommend

Benty-search

2881. IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2409.15763v2

2882. Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2407.10670v1

2883. Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2406.16383v1

2884. Npix2Cpix: A GAN-based Image-to-Image Translation Network with Retrieval-Classification Integration for Watermark Retrieval from Historical Document Images

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2406.03556v2

2885. Domain Adaptation for Dense Retrieval and Conversational Dense Retrieval through Self-Supervision by Meticulous Pseudo-Relevance Labeling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2403.08970v1

2886. Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2206.14381v2

2887. Pixel Wised Lesion Prediction on COVID-19 CT Imagery: A Comparative Analysis of Automated Image Segmentation Architectures

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2605.20459v1

2888. Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2605.16991v1

2889. DeepTumorVQA: A Hierarchical 3D CT Benchmark for Stage-Wise Evaluation of Medical VLMs and Tool-Augmented Agents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2605.09679v1

2890. The faint voice of a radio-weak BL Lacertae: modeling the broadband emission of WISE~J141046.00+740511.2

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.28074v1

2891. ROSA: Robust and Energy-Efficient Microring-Based Optical Neural Networks via Optical Shift-and-Add and Layer-Wise Hybrid Mapping

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2605.00032v1

2892. LKV: End-to-End Learning of Head-wise Budgets and Token Selection for LLM KV Cache Eviction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2605.06676v1

2893. Beyond Semantic Similarity: A Component-Wise Evaluation Framework for Medical Question Answering Systems with Health Equity Implications

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.19281v1

2894. Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.17182v1

2895. Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Columns under Transient Operating Conditions

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.24644v1

2896. Training-Free Sparse Attention for Fast Video Generation via Offline Layer-Wise Sparsity Profiling and Online Bidirectional Co-Clustering

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.18636v1

2897. Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.17354v1

2898. Continual Learning via Ensemble-Based Depth-Wise Masked Autoencoders for Data Quality Monitoring in High-Energy Physics

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.02369v1

2899. DynaMoE: Dynamic Token-Level Expert Activation with Layer-Wise Adaptive Capacity for Mixture-of-Experts Neural Networks

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2603.01697v1

2900. OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2602.22949v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2409.15763v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2407.10670v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2406.16383v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2406.03556v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2403.08970v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2206.14381v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.20459v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.16991v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.09679v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.28074v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.00032v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2605.06676v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.19281v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.17182v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.24644v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.18636v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.17354v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.02369v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2603.01697v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2602.22949v1