GitHub - XplnHUB/Quilix: Stop overpaying for AI. Quillix routes every request to the cheapest model that works — automatically.

 ________  ___  ___  ___  ___       ___     ___    ___ 
|\   __  \|\  \|\  \|\  \|\  \     |\  \   |\  \  /  /|
\ \  \|\  \ \  \\\  \ \  \ \  \    \ \  \  \ \  \/  / /
 \ \  \\\  \ \  \\\  \ \  \ \  \    \ \  \  \ \    / / 
  \ \  \\\  \ \  \\\  \ \  \ \  \____\ \  \  /     \/  
   \ \_____  \ \_______\ \__\ \_______\ \__\/  /\   \  
    \|___| \__\|_______|\|__|\|_______|\|__/__/ /\ __\ 
          \|__|                            |__|/ \|__|

One line. Every LLM provider. Optimized automatically.

Route AI inference across OpenAI, Anthropic, Google, Groq, and more — optimized for cost, latency, or quality. Built by Axion Labs.

Get Started · Documentation · Dashboard · Discord · Status

The Problem

You are paying too much for AI inference. You probably know it.

A GPT-4o call costs $5 per million tokens. The same task on Groq costs $0.27. That is an 18x price difference for identical output quality on most tasks.

But switching providers manually is painful. Different SDKs. Different APIs. Different response formats. Different rate limits. So most teams pick one provider and stay there — burning money they don't need to burn.

Quilix fixes this with a unified routing layer.

What Quilix Does

Quilix sits between your application and every LLM provider. When you send a request, Quilix:

Analyzes the request type and your optimization goal.
Checks real-time cost, latency, and availability across all providers.
Routes to the optimal provider automatically.
Returns the response in a unified, provider-agnostic format.
Logs metrics directly to your dashboard for total transparency.

Optimization Modes

Quilix supports three strategic optimization modes. You can set a global default or override per request.

Mode	Strategy	Use Case
Cost	Routes to the cheapest provider meeting your quality floor	Summarization, classification, embeddings
Latency	Routes to the lowest latency provider in real-time	User-facing chat, real-time products
Quality	Routes to the highest-performing model for the task	Complex reasoning, code generation, analysis

Supported Providers

Quilix connects you to all major LLM ecosystems through a single interface. No separate API keys or billing models required.

Ecosystem	Key Models	Primary Strength
OpenAI	GPT-4o, o1-preview	Advanced reasoning & versatility
Anthropic	Claude 3.5 Sonnet	Coding, safety, & logic
Google	Gemini 1.5 Pro	Context window & multimodal
Groq	Llama 3.1 (70B/8B)	Extreme inference speed
Together AI	Pro-grade Open Source	Fine-tuning & scale

The Dashboard

Gain full visibility into your AI infrastructure at app.quilix.ai.

Key Insights:

Financials: Total spend vs. savings achieved through intelligent routing.
Performance: P50/P99 latency trends across all your providers.
ShadowEval: Continuous quality scoring of model outputs.
Provider Health: Real-time availability and error rate monitoring.

Reliability & Governance

Automatic Failover

If a provider degrades or hits a rate limit, Quilix seamlessly switches to the next best available provider without dropping the request.

Cost Control

Set monthly budgets and hard limits at the API key level to prevent unexpected overages.

Unified Response

Stop rebuilding your parsers. Every response follows a predictive, provider-agnostic schema regardless of the backend model.

Architecture & Privacy

Quilix is built for speed and security. We add less than 15ms of overhead to your requests.

Privacy Policy:

We log metadata only (cost, latency, token counts). We never store or log your prompt content or model outputs. Your data remains completely private between you and the provider.

Enterprise Features

For teams requiring higher security or compliance standards:

VPC Deployment: Run the router inside your own private network.
SAML/SSO: Integrated with your identity provider.
Audit Logs: Full decision-history for compliance teams.
Guaranteed SLAs: 99.9% uptime for production workloads.

Built By

Quilix is maintained by Axion Labs — a venture studio building performance infrastructure for the AI-first world.

License

MIT License — see LICENSE for details.

Quilix · Built by Axion Labs

Website · Dashboard · Docs · Discord · Twitter

If Quilix saves you money, tell a friend. That's our entire marketing strategy.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
api		api
docs		docs
examples		examples
sdk/python		sdk/python
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
quilix.db		quilix.db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Problem

What Quilix Does

Optimization Modes

Supported Providers

The Dashboard

Key Insights:

Reliability & Governance

Automatic Failover

Cost Control

Unified Response

Architecture & Privacy

Privacy Policy:

Enterprise Features

Built By

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

The Problem

What Quilix Does

Optimization Modes

Supported Providers

The Dashboard

Key Insights:

Reliability & Governance

Automatic Failover

Cost Control

Unified Response

Architecture & Privacy

Privacy Policy:

Enterprise Features

Built By

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages