benty-fields - Search paper

1961. State-space Models with Layer-wise Nonlinearity are Universal Approximators with Exponential Decaying Memory

Shida Wang, Beichen Xue

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.13414v2

Vote

Add to Library

Recommend

1962. SPION: Layer-Wise Sparse Training of Transformer via Convolutional Flood Filling

Bokyeong Yoon, Yoonsang Han, Gordon Euhyun Moon

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.12578v1

Vote

Add to Library

Recommend

1963. Activation Compression of Graph Neural Networks using Block-wise Quantization with Improved Variance Minimization

Sebastian Eliassen, Raghavendra Selvan

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.11856v2

Vote

Add to Library

Recommend

1964. EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise Optimization

Ofir Gordon, Elad Cohen, Hai Victor Habi, Arnon Netzer

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.11531v2

Vote

Add to Library

Recommend

1965. PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.10400v3

Vote

Add to Library

Recommend

1966. Maximum-likelihood fits of piece-wise Pareto distributions with finite and non-zero core

Benjamin F. Maier

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.09589v1

Vote

Add to Library

Recommend

1967. LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data

Shaocong Xu, Pengfei Li, Qianpu Sun, Xinyu Liu, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.10230v3

Vote

Add to Library

Recommend

1968. MHLAT: Multi-hop Label-wise Attention Model for Automatic ICD Coding

Junwen Duan, Han Jiang, Ying Yu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.08868v1

Vote

Add to Library

Recommend

1969. DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input

Nicolas Jonason, Xin Wang, Erica Cooper, Lauri Juvela, Bob L. T. Sturm, Junichi Yamagishi

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.07658v1

Vote

Add to Library

Recommend

1970. iHAS: Instance-wise Hierarchical Architecture Search for Deep Learning Recommendation Models

Yakun Yu, Shi-ang Qi, Jiuding Yang, Liyao Jiang, Di Niu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.07967v1

Vote

Add to Library

Recommend

1971. Self-supervised Extraction of Human Motion Structures via Frame-wise Discrete Features

Tetsuya Abe, Ryusuke Sagawa, Ko Ayusawa, Wataru Takano

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.05972v1

Vote

Add to Library

Recommend

1972. Towards Federated Learning Under Resource Constraints via Layer-wise Training and Depth Dropout

Pengfei Guo, Warren Richard Morningstar, Raviteja Vemulapalli, Karan Singhal, Vishal M. Patel, Philip Andrew Mansfield

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.05213v1

Vote

Add to Library

Recommend

1973. SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition

Shaojie Zhang, Jianqin Yin, Yonghao Dang, Jiajun Fu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.16018v4

Vote

Add to Library

Recommend

1974. Textual and Visual Prompt Fusion for Image Editing via Step-Wise Alignment

Zhanbo Feng, Zenan Ling, Xinyu Lu, Ci Gong, Feng Zhou, Wugedele Bao, Jie Li, Fan Yang et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.15854v3

Vote

Add to Library

Recommend

1975. Multi-agent Coordination Under Temporal Logic Tasks and Team-Wise Intermittent Communication

Junjie Wang, Meng Guo, Zhongkui Li

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.14042v2

Vote

Add to Library

Recommend

1976. Mesh-Wise Prediction of Demographic Composition from Satellite Images Using Multi-Head Convolutional Neural Network

Yuta Sato

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.13441v1

Vote

Add to Library

Recommend

1977. Learn With Imagination: Safe Set Guided State-wise Constrained Policy Optimization

Yifan Sun, Feihan Li, Weiye Zhao, Rui Chen, Tianhao Wei, Changliu Liu

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.13140v5

Vote

Add to Library

Recommend

1978. Addressing Selection Bias in Computerized Adaptive Testing: A User-Wise Aggregate Influence Function Approach

Soonwoo Kwon, Sojung Kim, Seunghyun Lee, Jin-Young Kim, Suyeong An, Kyuseok Kim

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.11912v1

Vote

Add to Library

Recommend

1979. Efficient and Flexible Neural Network Training through Layer-wise Feedback Propagation

Leander Weber, Jim Berend, Moritz Weckbecker, Alexander Binder, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.12053v2

Gradient-based optimization has been a cornerstone of machine learning enabling the vast advances of AI development over the past decades. However, since this type of optimization requires differentiation, it reduces flexibility in the choice of model and objective. With recent evidence of the benefits of non-differentiable (e.g. neuromorphic) architectures over classical models, such constraints can become limiting in the future. We present Layer-wise Feedback Propagation (LFP), a novel training principle for neural network-like predictors utilizing methods from the domain of explainability to decompose a reward to individual neurons based on their respective contributions to solving a given task without imposing any differentiability requirements. Leveraging these neuron-wise rewards, our method then implements a greedy approach reinforcing helpful parts of the network and weakening harmful ones. While having comparable computational complexity to gradient descent, LFP offers the advantage that it obtains sparse models due to an implicit weight scaling. We establish the convergence of LFP theoretically and empirically, demonstrating its effectiveness on various models and datasets. We further investigate two applications for LFP: Firstly, neural network pruning, and secondly, the optimization of neuromorphic architectures such as Heaviside step function activated Spiking Neural Networks (SNNs). In the first setting, LFP naturally generates sparse models that are easily prunable and thus efficiently encode and compute information. In the second setting, LFP achieves comparable performance to surrogate gradient descent, but provides approximation-free training, which eases the implementation on neuromorphic hardware. Consequently, LFP combines efficiency in terms of computation and representation with flexibility w.r.t. model architecture and objective function. Our code is available.

Vote

Add to Library

Recommend

1980. Semi-blind-trace algorithm for self-supervised attenuation of trace-wise coherent noise

Mohammad Mahdi Abedi, David Pardo, Tariq Alkhalifah

Geophysical Prospecting, 72, 965-977 (2023)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.10772v2

Vote

Add to Library

Recommend

Benty-search

1961. State-space Models with Layer-wise Nonlinearity are Universal Approximators with Exponential Decaying Memory

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.13414v2

1962. SPION: Layer-Wise Sparse Training of Transformer via Convolutional Flood Filling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.12578v1

1963. Activation Compression of Graph Neural Networks using Block-wise Quantization with Improved Variance Minimization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.11856v2

1964. EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise Optimization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.11531v2

1965. PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.10400v3

1966. Maximum-likelihood fits of piece-wise Pareto distributions with finite and non-zero core

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.09589v1

1967. LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.10230v3

1968. MHLAT: Multi-hop Label-wise Attention Model for Automatic ICD Coding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.08868v1

1969. DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.07658v1

1970. iHAS: Instance-wise Hierarchical Architecture Search for Deep Learning Recommendation Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.07967v1

1971. Self-supervised Extraction of Human Motion Structures via Frame-wise Discrete Features

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.05972v1

1972. Towards Federated Learning Under Resource Constraints via Layer-wise Training and Depth Dropout

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.05213v1

1973. SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.16018v4

1974. Textual and Visual Prompt Fusion for Image Editing via Step-Wise Alignment

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.15854v3

1975. Multi-agent Coordination Under Temporal Logic Tasks and Team-Wise Intermittent Communication

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.14042v2

1976. Mesh-Wise Prediction of Demographic Composition from Satellite Images Using Multi-Head Convolutional Neural Network

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.13441v1

1977. Learn With Imagination: Safe Set Guided State-wise Constrained Policy Optimization

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.13140v5

1978. Addressing Selection Bias in Computerized Adaptive Testing: A User-Wise Aggregate Influence Function Approach

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.11912v1

1979. Efficient and Flexible Neural Network Training through Layer-wise Feedback Propagation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.12053v2

1980. Semi-blind-trace algorithm for self-supervised attenuation of trace-wise coherent noise

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.10772v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.13414v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.12578v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.11856v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.11531v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.10400v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.09589v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.10230v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.08868v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.07658v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.07967v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.05972v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.05213v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.16018v4

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.15854v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.14042v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.13441v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.13140v5

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.11912v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.12053v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.10772v2