benty-fields - Search paper

Continual learning aims to emulate the human ability to continually accumulate knowledge over sequential tasks. The main challenge is to maintain performance on previously learned tasks after learning new tasks, i.e., to avoid catastrophic forgetting. We propose a Channel-wise Lightweight Reprogramming (CLR) approach that helps convolutional neural networks (CNNs) overcome catastrophic forgetting during continual learning. We show that a CNN model trained on an old task (or self-supervised proxy task) could be ``reprogrammed" to solve a new task by using our proposed lightweight (very cheap) reprogramming parameter. With the help of CLR, we have a better stability-plasticity trade-off to solve continual learning problems: To maintain stability and retain previous task ability, we use a common task-agnostic immutable part as the shared ``anchor" parameter set. We then add task-specific lightweight reprogramming parameters to reinterpret the outputs of the immutable parts, to enable plasticity and integrate new knowledge. To learn sequential tasks, we only train the lightweight reprogramming parameters to learn each new task. Reprogramming parameters are task-specific and exclusive to each task, which makes our method immune to catastrophic forgetting. To minimize the parameter requirement of reprogramming to learn new tasks, we make reprogramming lightweight by only adjusting essential kernels and learning channel-wise linear mappings from anchor parameters to task-specific domain knowledge. We show that, for general CNNs, the CLR parameter increase is less than 0.6\% for any new task. Our method outperforms 13 state-of-the-art continual learning baselines on a new challenging sequence of 53 image classification datasets. Code and data are available at https://github.com/gyhandy/Channel-wise-Lightweight-Reprogramming
Authors' comments: ICCV 2023

Vote

Add to Library

Recommend

580. Communication-Efficient Split Learning via Adaptive Feature-Wise Compression

Yongjeong Oh, Jaeho Lee, Christopher G. Brinton, Yo-Seb Jeon

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2307.10805v2

Vote

Add to Library

Recommend

Benty-search

561. Learning to Transform for Generalizable Instance-wise Invariance

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.16672v3

562. Byzantine-Resilient Federated PCA and Low Rank Column-wise Sensing

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.14512v2

563. Size and Albedo Constraints for (152830) Dinkinesh Using WISE Data

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.13158v1

564. Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.13150v1

565. An Element-wise RSAV Algorithm for Unconstrained Optimization Problems

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.04013v1

566. Layer-wise training for self-supervised learning on graphs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2309.01503v1

567. On the hardness of inclusion-wise minimal separators enumeration

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.15444v1

568. MMBAttn: Max-Mean and Bit-wise Attention for CTR Prediction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.13187v1

569. HumanLiff: Layer-wise 3D Human Generation with Diffusion Model

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.09712v1

570. Component-wise dimensionally reduced flows with local models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2311.11938v1

571. Block-Wise Encryption for Reliable Vision Transformer models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.07612v1

572. Channel-Wise Contrastive Learning for Learning with Noisy Labels

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.06952v1

573. Learning Fine-Grained Features for Pixel-wise Video Correspondences

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.03040v1

574. Music De-limiter Networks via Sample-wise Gain Inversion

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.01187v1

575. Copula for Instance-wise Feature Selection and Ranking

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.00549v1

576. Patch-wise Auto-Encoder for Visual Anomaly Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2308.00429v2

577. Instance-Wise Adaptive Tuning and Caching for Vision-Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2307.15983v1

578. Patch-Wise Point Cloud Generation: A Divide-and-Conquer Approach

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2307.12049v1

579. CLR: Channel-wise Lightweight Reprogramming for Continual Learning

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2307.11386v1

580. Communication-Efficient Split Learning via Adaptive Feature-Wise Compression

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2307.10805v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.16672v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.14512v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.13158v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.13150v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.04013v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2309.01503v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.15444v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.13187v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.09712v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2311.11938v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.07612v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.06952v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.03040v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.01187v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.00549v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2308.00429v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2307.15983v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2307.12049v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2307.11386v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2307.10805v2