Pranav Thombare pranavthombare

👋 Hey, I'm Pranav Thombare

🚀 Machine Learning Engineer | Systems Builder | LLM Infra

I build high-performance AI systems that actually run in production.

From optimizing LLM inference pipelines to working deep in operating systems, I enjoy solving problems across the entire stack — from kernels to large language models.

🧠 What I Do

⚡ Optimize LLM/VLM systems for latency, throughput, and scale
🏗 Build end-to-end AI pipelines (RAG, agents, training systems)
🔍 Design intelligent systems that replace manual workflows
🧩 Work across systems: OS → backend → ML → infra

🚀 Current Focus

🧠 LLM inference optimization (TensorRT-LLM, vLLM, SGLang)
🔗 Retrieval-Augmented Generation (RAG) systems
🤖 Agentic workflows using LangGraph
⚙️ Distributed systems & Kubernetes-based deployments

🛠 Tech Stack

Languages
Python C++ Rust

ML / AI
PyTorch TensorRT-LLM Triton vLLM SGLang

Systems & Infra
Kubernetes Docker AWS GCP

Other
AOSP Linux Kernel RAG Pipelines LoRA Quantization

🏗 Notable Work

⚡ Improved LLM performance by 60%+ using speculative decoding & KV cache optimization
📄 Built a VLM-based document parsing system (98%+ accuracy)
🤖 Developed autonomous agents for processing real-world business workflows
🧪 Built RAG pipelines to generate automated integration tests from codebases
📱 Former Android OS engineer working on kernel, SELinux & device security

🤝 Open to Collaborate

🐧 Linux Kernel / systems programming
🤖 LLM / ML infrastructure
🛠 Developer tools & infra-heavy projects

📫 Reach Me

📧 Email: check profile
🌐 LinkedIn
💻 GitHub
📱 Telegram / Instagram / Unsplash: @pranavthombare

⚡ Fun Facts

🥋 I can use nunchucks
🏔️ Trekked to Everest Base Camp

🧭 Philosophy

Build things that are not just impressive — but useful, scalable, and real.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pranav Thombare pranavthombare

Achievements