Skip to content

NVFP4: cache GEMM-swizzled weight scale factors across micro-batches#3093

Open
cael-ling wants to merge 2 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-weight-swizzle-cache
Open

NVFP4: cache GEMM-swizzled weight scale factors across micro-batches#3093
cael-ling wants to merge 2 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-weight-swizzle-cache

Commits

Commits on Jun 5, 2026