StructSAM: Structure- and Spectrum-Preserving Token Merging for Segment Anything Models
This paper introduces StructSAM, a novel token merging framework that preserves structural boundaries and spectral properties in Segment Anything Models (SAM) by using gradient-based energy scores and grid-based screening to achieve significant computational savings with minimal accuracy loss across natural and medical imaging benchmarks.
Duy M. H. Nguyen, Tuan A. Tran, Duong Nguyen, Siwei Xie, Trung Q. Nguyen, Mai T. N. Truong, Daniel Palenicek, An T. Le, Michael Barz, TrungTin Nguyen, Tuan Dam, Ngan Le, Minh Vu, Khoa Doan, Vien Ngo, Pengtao Xie, James Zou, Daniel Sonntag, Jan Peters, Mathias NiepertTue, 10 Ma🤖 cs.LG