SAD

Soft Anisotropic Diagrams for Differentiable Image Representation

Laki Iinbor1, Zhiyang Dou2,*, Wojciech Matusik2,*

1Independent Researcher   2MIT   * Joint Last Author

ACM Transactions on Graphics (SIGGRAPH 2026)

SAD teaser comparison

TL;DR

SAD represents an image as a soft, anisotropic, differentiable diagram over learnable sites. Each pixel is a softmax blend over its top-K nearby sites under a site-dependent distance, yielding a differentiable partition of unity with explicit ownership and content-aligned boundaries. A GPU-friendly top-K propagation scheme keeps cost constant per pixel, enabling fast fitting at matched or better quality.

Fitting Process

Each clip is a triple: RGB reconstruction, diagram view, and tau-position heatmap. It shows how sites migrate, temperatures sharpen, and the representation converges to content-aligned regions during optimization.

Method

SAD method diagram

We scatter many small sites across the image. Each site has a position, a color, a reach radius, an oriented anisotropic shape, and a temperature that controls boundary sharpness.

\[ s_i(\mathbf{x}) = d_{\mathbf{A}_i}\!\left(\mathbf{x}, \mathbf{p}_i\right) - r_i \]

The pixel color is a soft blend of nearby sites. The temperature \(\tau_i\) controls how abruptly ownership changes between adjacent sites.

\[ \hat{\mathbf{c}}(\mathbf{x}) = \sum_{i \in \mathcal{N}_K(\mathbf{x})} w_i(\mathbf{x})\,\mathbf{c}_i \] \[ w_i(\mathbf{x}) = \frac{\exp\!\left(-\tau_i s_i(\mathbf{x})\right)} {\sum_{j \in \mathcal{N}_K(\mathbf{x})} \exp\!\left(-\tau_j s_j(\mathbf{x})\right)} \]

Because each pixel only depends on a fixed-size nearby-site list, both rendering and fitting remain GPU-friendly.

Comparisons

Reconstruction quality on the Image-GS benchmark. Average metrics over 45 images at varying bitrates.
MethodMetric0.2 BPP0.3 BPP0.4 BPP0.5 BPP
Image-GSPSNR up31.3232.7933.8034.57
SSIM up0.89230.91120.92280.9307
LPIPS down0.13090.10330.08730.0769
Instant-NGPPSNR up26.6629.4129.8630.69
SSIM up0.77030.82530.83040.8461
LPIPS down0.24720.17010.16560.1463
SADPSNR up33.8735.7236.9737.86
SSIM up0.89830.92020.93340.9422
LPIPS down0.09140.06780.05460.0458
Image compression on Kodak. Average reconstruction quality and training time over 24 images at N = 50,000 primitives.
MethodPSNR upSSIM upLPIPS downTime (s) down
Image-GS36.900.95210.027228
Instant-NGP37.720.94940.02498.2
Fast 2DGS43.13--10
SAD46.000.98710.00322.2
Reconstruction quality on DIV2K. Average metrics over 100 images at varying bitrates and variable resolution.
MethodMetric0.5 BPP2.0 BPP
Image-GSPSNR up28.4832.15
SSIM up0.79140.8820
LPIPS down0.25150.1480
Instant-NGPPSNR up26.4429.24
SSIM up0.70450.7940
LPIPS down0.27780.1755
VBNFPSNR up27.1331.28
SSIM up0.74950.8737
LPIPS down0.43210.2765
Instant-GIPSNR up-38.01
Fast 2DGSPSNR up-37.81
SADPSNR up30.0034.73
SSIM up0.79820.9115
LPIPS down0.19950.0844
Reconstruction quality on CLIC. Average metrics over 41 images at varying bitrates.
MethodMetric0.5 BPP2.0 BPP
Image-GSPSNR up30.6534.15
SSIM up0.82230.8907
LPIPS down0.22800.1449
Instant-NGPPSNR up28.5932.67
SSIM up0.75590.8475
LPIPS down0.23510.1287
SADPSNR up31.8236.13
SSIM up0.81760.9112
LPIPS down0.18700.0884

BibTeX

@article{iinbor2026sad,
  title   = {Soft Anisotropic Diagrams for Differentiable Image Representation},
  author  = {Iinbor, Laki and Dou, Zhiyang and Matusik, Wojciech},
  journal = {ACM Transactions on Graphics},
  year    = {2026},
  note    = {SIGGRAPH 2026}
}