🏠
Working from home
GPU dev
- mumbai
-
06:02
(UTC +05:30) - in/shlok-l-50180120b
- @shlok_fx
- https://leetcode.com/u/Shlok_Fx/
Pinned Loading
-
100-days-cuda
100-days-cuda PublicThis repository documents my 100-day journey of learning and writing CUDA kernels.
-
SageAttention
SageAttention PublicForked from thu-ml/SageAttention
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Cuda
-
ThunderKittens
ThunderKittens PublicForked from HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Cuda
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




