Skip to content

XplnHUB/Quilix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation



 ________  ___  ___  ___  ___       ___     ___    ___ 
|\   __  \|\  \|\  \|\  \|\  \     |\  \   |\  \  /  /|
\ \  \|\  \ \  \\\  \ \  \ \  \    \ \  \  \ \  \/  / /
 \ \  \\\  \ \  \\\  \ \  \ \  \    \ \  \  \ \    / / 
  \ \  \\\  \ \  \\\  \ \  \ \  \____\ \  \  /     \/  
   \ \_____  \ \_______\ \__\ \_______\ \__\/  /\   \  
    \|___| \__\|_______|\|__|\|_______|\|__/__/ /\ __\ 
          \|__|                            |__|/ \|__| 
                                                       
                                                   

One line. Every LLM provider. Optimized automatically.

Route AI inference across OpenAI, Anthropic, Google, Groq, and more — optimized for cost, latency, or quality. Built by Axion Labs.


Get Started · Documentation · Dashboard · Discord · Status


The Problem

You are paying too much for AI inference. You probably know it.

A GPT-4o call costs $5 per million tokens. The same task on Groq costs $0.27. That is an 18x price difference for identical output quality on most tasks.

But switching providers manually is painful. Different SDKs. Different APIs. Different response formats. Different rate limits. So most teams pick one provider and stay there — burning money they don't need to burn.

Quilix fixes this with a unified routing layer.


What Quilix Does

Quilix sits between your application and every LLM provider. When you send a request, Quilix:

  1. Analyzes the request type and your optimization goal.
  2. Checks real-time cost, latency, and availability across all providers.
  3. Routes to the optimal provider automatically.
  4. Returns the response in a unified, provider-agnostic format.
  5. Logs metrics directly to your dashboard for total transparency.

Optimization Modes

Quilix supports three strategic optimization modes. You can set a global default or override per request.

Mode Strategy Use Case
Cost Routes to the cheapest provider meeting your quality floor Summarization, classification, embeddings
Latency Routes to the lowest latency provider in real-time User-facing chat, real-time products
Quality Routes to the highest-performing model for the task Complex reasoning, code generation, analysis

Supported Providers

Quilix connects you to all major LLM ecosystems through a single interface. No separate API keys or billing models required.

Ecosystem Key Models Primary Strength
OpenAI GPT-4o, o1-preview Advanced reasoning & versatility
Anthropic Claude 3.5 Sonnet Coding, safety, & logic
Google Gemini 1.5 Pro Context window & multimodal
Groq Llama 3.1 (70B/8B) Extreme inference speed
Together AI Pro-grade Open Source Fine-tuning & scale

The Dashboard

Gain full visibility into your AI infrastructure at app.quilix.ai.

Key Insights:

  • Financials: Total spend vs. savings achieved through intelligent routing.
  • Performance: P50/P99 latency trends across all your providers.
  • ShadowEval: Continuous quality scoring of model outputs.
  • Provider Health: Real-time availability and error rate monitoring.

Reliability & Governance

Automatic Failover

If a provider degrades or hits a rate limit, Quilix seamlessly switches to the next best available provider without dropping the request.

Cost Control

Set monthly budgets and hard limits at the API key level to prevent unexpected overages.

Unified Response

Stop rebuilding your parsers. Every response follows a predictive, provider-agnostic schema regardless of the backend model.


Architecture & Privacy

Quilix is built for speed and security. We add less than 15ms of overhead to your requests.

Privacy Policy:

We log metadata only (cost, latency, token counts). We never store or log your prompt content or model outputs. Your data remains completely private between you and the provider.


Enterprise Features

For teams requiring higher security or compliance standards:

  • VPC Deployment: Run the router inside your own private network.
  • SAML/SSO: Integrated with your identity provider.
  • Audit Logs: Full decision-history for compliance teams.
  • Guaranteed SLAs: 99.9% uptime for production workloads.

Built By

Quilix is maintained by Axion Labs — a venture studio building performance infrastructure for the AI-first world.


License

MIT License — see LICENSE for details.


Quilix · Built by Axion Labs

Website · Dashboard · Docs · Discord · Twitter

If Quilix saves you money, tell a friend. That's our entire marketing strategy.

About

Stop overpaying for AI. Quillix routes every request to the cheapest model that works — automatically.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages