NVFP4: cache GEMM-swizzled weight scale factors across micro-batches#3093
Open
cael-ling wants to merge 2 commits into
Open
NVFP4: cache GEMM-swizzled weight scale factors across micro-batches#3093cael-ling wants to merge 2 commits into
cael-ling wants to merge 2 commits into