LegoNet: Memory Footprint Reduction Through Block Weight Clustering
LegoNet is a post-training compression technique that clusters 4x4 weight blocks across entire neural network architectures to achieve memory footprint reductions of up to 128x with negligible accuracy loss, without requiring any retraining or architectural modifications.