Explainers
1.68x Faster Diffusion on Blackwell with NVFP4 [Benchmarks]
Blackwell's NVFP4 format just turned diffusion models into speed demonsβ1.68x faster on Flux.1-Dev. But is this the quantization silver bullet, or just NVIDIA's latest flex?