benty-fields - Search paper

1541. L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers

Sofia Casarin, Sergio Escalera, Oswald Lanz

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.07300v1

Vote

Add to Library

Recommend

1542. YANNs: Y-wise Affine Neural Networks for Exact and Efficient Representations of Piecewise Linear Functions

Austin Braniff, Yuhe Tian

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.07054v1

Vote

Add to Library

Recommend

1543. From Pixels to Perception: Interpretable Predictions via Instance-wise Grouped Feature Selection

Moritz Vandenhirtz, Julia E. Vogt

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.06003v2

Vote

Add to Library

Recommend

1544. A targeted search for binary white dwarf pulsars using Gaia and WISE

Ingrid Pelisoli, T. R. Marsh, G. Tovmassian, L. A. Amaral, Amornrat Aungwerojwit, M. J. Green, R. P. Ashley, David A. H. Buckley et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04693v1

Vote

Add to Library

Recommend

1545. SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting

Shiwei Guo, Ziang Chen, Yupeng Ma, Yunfei Han, Yi Wang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02655v1

Vote

Add to Library

Recommend

1546. Wise Goose Chase: A Predictive Path Planning Algorithm for Dynamic Rebalancing in Ride-Hailing Systems

Avalpreet Singh Brar, Rong Su, Christos G. Cassandras, Gioele Zardini

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02603v1

Vote

Add to Library

Recommend

1547. Gateformer: Advancing Multivariate Time Series Forecasting through Temporal and Variate-Wise Attention with Gated Representations

Yu-Hsiang Lan, Eric K. Oermann

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.00307v2

Vote

Add to Library

Recommend

1548. Element-wise description of the $\mathcal I$-characterized subgroups of the circle

Raffaele Di Santo, Dikran Dikranjan, Anna Giordano Bruno, Hans Weber

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.21642v1

Vote

Add to Library

Recommend

1549. Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning

Pengxiang Li, Zhi Gao, Bofei Zhang, Yapeng Mi, Xiaojian Ma, Chenrui Shi, Tao Yuan, Yuwei Wu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.21561v3

Multimodal agents, which integrate a controller e.g., a vision language model) with external tools, have demonstrated remarkable capabilities in tackling complex multimodal tasks. Existing approaches for training these agents, both supervised fine-tuning and reinforcement learning, depend on extensive human-annotated task-answer pairs and tool trajectories. However, for complex multimodal tasks, such annotations are prohibitively expensive or impractical to obtain. In this paper, we propose an iterative tool usage exploration method for multimodal agents without any pre-collected data, namely SPORT, via step-wise preference optimization to refine the trajectories of tool usage. Our method enables multimodal agents to autonomously discover effective tool usage strategies through self-exploration and optimization, eliminating the bottleneck of human annotation. SPORT has four iterative components: task synthesis, step sampling, step verification, and preference tuning. We first synthesize multimodal tasks using language models. Then, we introduce a novel trajectory exploration scheme, where step sampling and step verification are executed alternately to solve synthesized tasks. In step sampling, the agent tries different tools and obtains corresponding results. In step verification, we employ a verifier to provide AI feedback to construct step-wise preference data. The data is subsequently used to update the controller for tool usage through preference tuning, producing a SPORT agent. By interacting with real environments, the SPORT agent gradually evolves into a more refined and capable system. Evaluation in the GTA and GAIA benchmarks shows that the SPORT agent achieves 6.41% and 3.64% improvements, underscoring the generalization and effectiveness introduced by our method. The project page is https://SPORT-Agents.github.io.
Authors' comments: 24 pages

Vote

Add to Library

Recommend

1550. Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction

Changjun Li, Runqing Jiang, Zhuo Song, Pengpeng Yu, Ye Zhang, Yulan Guo

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.00259v1

Vote

Add to Library

Recommend

1551. semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage

Ke Hong, Lufang Chen, Zhong Wang, Xiuhong Li, Qiuli Mao, Jianping Ma, Chao Xiong, Guanyu Wu et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.19867v1

Existing large language model (LLM) serving systems fall into two categories: 1) a unified system where prefill phase and decode phase are co-located on the same GPU, sharing the unified computational resource and storage, and 2) a disaggregated system where the two phases are disaggregated to different GPUs. The design of the disaggregated system addresses the latency interference and sophisticated scheduling issues in the unified system but leads to storage challenges including 1) replicated weights for both phases that prevent flexible deployment, 2) KV cache transfer overhead between the two phases, 3) storage imbalance that causes substantial wasted space of the GPU capacity, and 4) suboptimal resource adjustment arising from the difficulties in migrating KV cache. Such storage inefficiency delivers poor serving performance under high request rates. In this paper, we identify that the advantage of the disaggregated system lies in the disaggregated computation, i.e., partitioning the computational resource to enable the asynchronous computation of two phases. Thus, we propose a novel LLM serving system, semi-PD, characterized by disaggregated computation and unified storage. In semi-PD, we introduce a computation resource controller to achieve disaggregated computation at the streaming multi-processor (SM) level, and a unified memory manager to manage the asynchronous memory access from both phases. semi-PD has a low-overhead resource adjustment mechanism between the two phases, and a service-level objective (SLO) aware dynamic partitioning algorithm to optimize the SLO attainment. Compared to state-of-the-art systems, semi-PD maintains lower latency at higher request rates, reducing the average end-to-end latency per request by 1.27-2.58x on DeepSeek series models, and serves 1.55-1.72x more requests adhering to latency constraints on Llama series models.
Authors' comments: 18 pages, 16 figures

Vote

Add to Library

Recommend

1552. Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection

Brian K. S. Isaac-Medina, Toby P. Breckon

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.18746v1

Vote

Add to Library

Recommend

1553. Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models

Dasol Jeong, Donggoo Kang, Jiwon Park, Hyebean Lee, Joonki Paik

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.15723v2

Vote

Add to Library

Recommend

1554. Non-Uniform Class-Wise Coreset Selection: Characterizing Category Difficulty for Data-Efficient Transfer Learning

Hanyu Zhang, Zhen Xing, Wenxuan Yang, Chenxi Ma, Weimin Tan, Bo Yan

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.13234v1

Vote

Add to Library

Recommend

1555. GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image Classification

Thanh-Dung Le, Vu Nguyen Ha, Ti Ti Nguyen, Geoffrey Eappen, Prabhu Thiruvasagam, Hong-fu Chou, Duc-Dung Tran, Hung Nguyen-Kha et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.12484v1

Vote

Add to Library

Recommend

1556. A 55-nm SRAM Chip Scanning Errors Every 125 ns for Event-Wise Soft Error Measurement

Yuibi Gomi, Akira Sato, Waleed Madany, Kenichi Okada, Satoshi Adachi, Masatoshi Itoh, Masanori Hashimoto

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08305v1

Vote

Add to Library

Recommend

1557. Exploring a Patch-Wise Approach for Privacy-Preserving Fake ID Detection

Javier Muñoz-Haro, Ruben Tolosana, Ruben Vera-Rodriguez, Aythami Morales, Julian Fierrez

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.07761v1

Vote

Add to Library

Recommend

1558. Datum-wise Transformer for Synthetic Tabular Data Detection in the Wild

G. Charbel N. Kindji, Elisa Fromont, Lina Maria Rojas-Barahona, Tanguy Urvoy

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08829v1

Vote

Add to Library

Recommend

1559. PatchTrAD: A Patch-Based Transformer focusing on Patch-Wise Reconstruction Error for Time Series Anomaly Detection

Samy-Melwan Vilhes, Gilles Gasso, Mokhtar Z Alaya

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08827v1

Vote

Add to Library

Recommend

1560. LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections

Rita Sevastjanova, Robin Gerling, Thilo Spinner, Mennatallah El-Assady

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.10504v1

Vote

Add to Library

Recommend

Benty-search

1541. L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.07300v1

1542. YANNs: Y-wise Affine Neural Networks for Exact and Efficient Representations of Piecewise Linear Functions

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.07054v1

1543. From Pixels to Perception: Interpretable Predictions via Instance-wise Grouped Feature Selection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.06003v2

1544. A targeted search for binary white dwarf pulsars using Gaia and WISE

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.04693v1

1545. SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.02655v1

1546. Wise Goose Chase: A Predictive Path Planning Algorithm for Dynamic Rebalancing in Ride-Hailing Systems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.02603v1

1547. Gateformer: Advancing Multivariate Time Series Forecasting through Temporal and Variate-Wise Attention with Gated Representations

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.00307v2

1548. Element-wise description of the $\mathcal I$-characterized subgroups of the circle

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.21642v1

1549. Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.21561v3

1550. Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2505.00259v1

1551. semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.19867v1

1552. Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.18746v1

1553. Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.15723v2

1554. Non-Uniform Class-Wise Coreset Selection: Characterizing Category Difficulty for Data-Efficient Transfer Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.13234v1

1555. GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image Classification

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.12484v1

1556. A 55-nm SRAM Chip Scanning Errors Every 125 ns for Event-Wise Soft Error Measurement

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.08305v1

1557. Exploring a Patch-Wise Approach for Privacy-Preserving Fake ID Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.07761v1

1558. Datum-wise Transformer for Synthetic Tabular Data Detection in the Wild

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.08829v1

1559. PatchTrAD: A Patch-Based Transformer focusing on Patch-Wise Reconstruction Error for Time Series Anomaly Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.08827v1

1560. LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2504.10504v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.07300v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.07054v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.06003v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.04693v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02655v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.02603v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.00307v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.21642v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.21561v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2505.00259v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.19867v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.18746v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.15723v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.13234v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.12484v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08305v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.07761v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08829v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.08827v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2504.10504v1