Pinned Loading
-
rag-from-scratch
rag-from-scratch PublicRAG pipeline from scratch — FAISS + Sentence-Transformers + Mistral 7B via Ollama. No LangChain. FastAPI backend with chunking benchmarks, latency profiling, and LLM abstraction layer.
Python
-
autonomous-ai-agent
autonomous-ai-agent PublicAutonomous ReAct agent built from scratch — no LangChain. Features circuit breaker, dual database (ChromaDB + SQLite), Prometheus/Grafana monitoring, and a provider-agnostic LLM layer (Ollama/OpenA…
Python
-
production-rag
production-rag PublicProduction RAG system with LangChain, Qdrant & Docker. Multi-user isolation, 3 retrieval strategies (Similarity/MMR/HyDE), reranking, query rewriting, async ingestion worker.
Python 1
-
fine-tuned-with-lora
fine-tuned-with-lora PublicLoRA fine-tuning of TinyLlama on coding instructions · MLflow registry · SSE streaming FastAPI · Blue-green Docker deployment · 37% perplexity improvement
Python 1
-
multi-agent-system
multi-agent-system PublicProduction multi-agent system: LangGraph supervisor graph, MCP tool server, ChromaDB memory, SQLite observability, FastAPI — runs free on local Ollama.
Python
-
advanced-multi-object-tracking
advanced-multi-object-tracking PublicAdvanced multi-object tracking pipeline — YOLOv8m + StrongSORT with OSNet appearance embeddings on MOT17. HOTA 41.6 | MOTA 38.1 | IDF1 50.8. Benchmarked against ByteTrack baseline with TrackEval.
Python 1
If the problem persists, check the GitHub status page or contact support.