GitHub - WingedGuardian/GENesis-AGI: 'Personal AGI' that thinks on its own. Autonomous cognitive cycle, earned autonomy, 60+ tools. It decides what to do without being told.

Truthfully, no, I do not genuinely consider this to be "true" AGI. In order to get to something resembling "true" AGI, it would need to be built from first principles, which would require the orchestration (that IS Genesis) to be built into the LLM layer, the most foundational part of Genesis' compute layer itself. Nor am I of any particular belief that LLMs are necessarily the right architecture for this pursuit in the first place. But because I cannot change the LLM layer, and no better technology currently exists, this is the best I can do today. Call it "proto-AGI;" "pseudo-AGI" even.

But what I can tell you is this: Genesis is far closer to AGI than anything else I've seen, and even if it's not AGI from first principles, it mimics a lot of the same outcomes and behaviours and capabilities that AGI would presumably need to exhibit. If AGI sounds like something you'd like to help contribute to, come build with us →

Genesis in 30 seconds

Personal AI does what you tell it. Personal AGI does what you need.

Your current AI—however capable—is reactive, stateless, and session-scoped. You direct it. You re-explain context every time. It's equally ignorant about you on day 100 as day 1. It's a tool: smart in the moment, dumb about you.

Genesis is different:

It knows what to do without being told — because it's accumulated months of context about your projects, your decisions, your patterns. You never re-explain yourself.
It acts between sessions — thinking, researching, monitoring, surfacing what matters while you're not there. Not waiting for a prompt.
It has judgment that improves with evidence — earned autonomy per action category. First failure triggers demotion. Trust is rebuilt through performance, not time.
It's a fundamentally different system at month 6 than month 1 — not because of updates you installed, but because of what it learned living alongside you.

With personal AI, you are the intelligence directing the tool. With personal AGI, it is an intelligence working alongside you—and the amount you need to manage shrinks over time.

How it does this:

Autonomous cognitive cycle — a dual-ego architecture that wakes up, assesses the situation, proposes actions, and earns the right to execute them. Not waiting for prompts. Not following scripts. Thinking.
Massive capability layer — 60+ tools out of the box: browser automation, outreach pipeline, research, content creation, infrastructure monitoring, campaign execution. Not a framework you build on—a system that works, out of the box.
4-layer memory — essential knowledge, proactive recall, deep search, knowledge pipeline. Hybrid retrieval that compounds across months, not conversations.
Closed-loop learning — outcome classification, causal attribution, procedure extraction. Laplace-smoothed confidence, not vibes.
Earned autonomy — trust granted per action category through demonstrated competence. The system gets measurably better at its job, and it can show you the receipts.

Day 1 — a strong generalist with full cognitive infrastructure. Day 30 — a personalized specialist in every domain you've touched. Day 90 — anticipating needs you haven't articulated yet. Day 180 — evolving its own architecture to serve you better.

Get started →

What this is

Genesis is a cognitive architecture that makes the AGI claim explicitly—and backs it up with 150,000 lines of open-source code you can read, run, and challenge.

Not a chatbot. Not an API wrapper. Not another prompt chain with a for loop.

It uses Claude Code as its reasoning engine. Genesis is what it's been missing: the mind that remembers, reflects, learns, and decides.

"Claude Code already had the brain. We gave it the heart."

50+ subsystems. 4 MCP servers. 2 vector databases. Every design decision made by one engineer working full-stack across infrastructure, cognition, and integration layers. That's the point. If one developer with the right cognitive infrastructure can build and run a system this complex, imagine what a team becomes capable of.

_{View interactive diagram →}

Getting started

System requirements

Genesis is a full system, not a pip package. It runs best on a dedicated Linux machine.

Resource	Minimum	Recommended	Notes
OS	Ubuntu 22.04+	Ubuntu 24.04 LTS	Debian-based required for auto-install. Other Linux works with manual setup.
RAM	8 GB	16 GB+	Genesis + Qdrant + Claude Code + background tasks. 8 GB is tight under load.
Disk	10 GB	40 GB+	Fresh install ~400 MB. Memory, logs, and caches grow steadily with use.
CPU	2 cores	4-8 cores	Concurrent background tasks benefit from parallelism.
Network	Internet access	Always-on	Cloud LLM APIs required. Offline not supported.

These are the requirements for the host VM. Genesis runs inside a container the installer creates.

Before you start

What you need	Why	Where
Claude account	Claude Code powers all reasoning and agentic sessions	claude.ai
Tailscale (free)	Remote dashboard access from any device — no port-forwarding	tailscale.com

Install

One script sets up the entire infrastructure: Incus container, Guardian health monitor, bidirectional SSH, all dependencies.

git clone https://github.com/WingedGuardian/GENesis-AGI.git ~/genesis-setup
cd ~/genesis-setup
./scripts/host-setup.sh

After install:

genesis   # shortcut alias the installer adds
cd ~/genesis
claude    # start your first session

What you get:

Component	What it does
Full cognitive stack	Memory (4-layer hybrid retrieval + knowledge graph), self-learning loop, reflection engine, earned autonomy, dual-ego decision layer—all running continuously
Genesis server	Dashboard, API, and all subsystems at `http://<container-ip>:5000`
Qdrant	Vector database powering semantic memory (2 collections: episodic + knowledge)
Channel integration	Telegram (proactive outreach, approvals, voice), email triage, browser automation, inbox monitoring
Background cognition	Autonomous sessions that think, research, and audit while you're away—surplus compute, reflection cycles, goal tracking
Self-healing infrastructure	Guardian (host VM) + Sentinel (container)—two independent systems monitoring each other in a closed loop
Claude Code	CLI with Genesis hooks + 4 MCP servers auto-activated per session

Optional: local embedding

Component	Install
Ollama	`curl -fsSL https://ollama.com/install.sh \| sh`
LM Studio	Download from lmstudio.ai

Without these, Genesis uses cloud embedding APIs. With them: private, faster, free.

Your Genesis

Your Genesis install is one operational system: the public GENesis-AGI codebase, your private fork for customizations, and your private encrypted backups repo. See .claude/docs/your-genesis.md for the full model.

Backup — runs every 6h via cron. SQLite, Qdrant snapshots, memory, transcripts, secrets—GPG-encrypted before push.
Restore — git clone <your-fork> → scripts/bootstrap.sh → scripts/restore.sh. Back in minutes.
Contribute — a distributed bug fixing pipeline automatically detects eligible fixes, pushes them to GitHub for inspection, and opens upstream PRs.

How it thinks

Four cognitive layers, running continuously:

graph TB
    subgraph "Cognitive architecture"
        EGO["Ego<br/><i>Two egos: signal-driven focus,<br/>goal tracking, autonomous action</i>"]
        AL["Awareness loop<br/><i>5-min tick, 18+ signals,<br/>zero LLM cost</i>"]
        RE["Reflection engine<br/><i>Micro / Light / Deep / Strategic<br/>with relevance tagging</i>"]
        SL["Self-learning loop<br/><i>Dopaminergic feedback</i>"]
    end

    subgraph "Infrastructure"
        RT["Operational runtime<br/><i>Dashboard, API, extensions</i>"]
        CC["Claude Code<br/><i>Reasoning, tools, sessions</i>"]
    end

    subgraph "Memory and data"
        QD["Qdrant<br/><i>2 vector collections</i>"]
        SQ["SQLite + FTS5"]
        MCP["4 MCP servers<br/><i>memory / recon / health / outreach</i>"]
    end

    EGO -->|"dispatches work"| CC
    AL -->|"depth signal"| RE
    RE -->|"observations"| EGO
    RE -->|"interaction data"| SL
    SL -->|"weight updates"| AL

    AL <--> RT
    RE <--> MCP
    MCP <--> QD
    MCP <--> SQ

    style EGO fill:#1a1a2e,stroke:#e94560,color:#fff
    style AL fill:#1a1a2e,stroke:#e94560,color:#fff
    style RE fill:#1a1a2e,stroke:#0f3460,color:#fff
    style SL fill:#1a1a2e,stroke:#533483,color:#fff

Every 5 minutes, the system collects 18+ signals across all its inputs—entirely programmatic, zero LLM cost. Signals get classified by how much thinking depth they warrant. Routine health checks get a quick pass. Novel patterns in user behavior get a deep analysis. Accumulated smaller reflections trigger strategic synthesis. The depth decision is automatic, and each cognitive layer feeds the next.

On top of this sits the ego layer: two autonomous decision-makers that read the system's observations and act on them. The User Ego (running Opus) focuses on user goals, activity patterns, and pending work. The Genesis Ego (running Sonnet) handles system health, infrastructure, and operational decisions. Each one assembles its own context from filtered observations, proposes actions via Telegram, and dispatches Claude Code sessions to execute approved work. They run on adaptive cadence—more frequently when things are active, backing off when they're not.

The ego doesn't just observe—it runs a unified cognitive loop. Signals (a stale goal, a conversation, a system event) enter a queue. A focus selector picks what matters most. Context gets assembled for that focus. The ego thinks, proposes, acts—cycle repeats. What you experience: Genesis notices when your goals go stale, reviews them with full context, tells you when subgoals complete a milestone, and adjusts its own review frequency per goal. It doesn't wait for you to ask "how's that project going?"—it already checked.

When Genesis isn't handling a user request, it doesn't sit idle. It researches topics you'll ask about tomorrow. It audits its own memory for contradictions and staleness. It tests whether its learned procedures still hold up. It works through problems it got stuck on earlier. The system you come back to on Monday is measurably sharper than the one you left on Friday.

Memory

Most AI memory is a vector database with a retrieval function. Genesis runs a four-layer architecture—because "what are we working on?", "what's relevant right now?", "find everything about X", and "what does the external documentation say?" are fundamentally different operations that need different retrieval strategies.

L1: Essential Knowledge (~300 tokens, injected at every session start)
    Pure DB queries. No LLM, no network, no latency.
    Content: active context, recent decisions, structural overview.
    → The forest view. Always available, even if everything else is down.

L2: Proactive Recall (fires on every user message, <1.5s)
    Surfaces the top 3 relevant memories automatically—before Genesis
    even starts thinking about your question.
    Hybrid: FTS5 keyword + Qdrant vector + activation scoring → RRF fusion.
    → You never have to ask "do you remember?"—it already checked.

L3: Deep Search (on-demand, ~1-2s)
    Full pipeline: 4 ranked signals fused via Reciprocal Rank Fusion.
    Wing/room filtering, intent classification, graph traversal.
    → When you need everything Genesis knows about a topic.

L4: Knowledge Pipeline (external, permanent)
    Ingests from text, PDF, audio, video, web pages, YouTube transcripts.
    Separate vector collection. Idempotent — re-ingesting updates, never duplicates.
    → Domain knowledge that doesn't decay with time.

LLMs lose the forest for the trees—that's a known weakness of large-context reasoning. The layer model is the architectural compensation: L1 maintains the forest (what are we doing, what have we decided, what matters), while L2-L3 drill into specific trees on demand. L4 provides the reference library.

What actually happens when you send a message:

Every prompt triggers L2 before Genesis starts reasoning about your question:

Your message hits a pre-processing hook
In parallel: FTS5 searches for exact keyword matches (~5ms), Qdrant searches for semantic similarity (1024-dim embeddings, ~400ms)
Activation scoring weighs each candidate: confidence × recency × (access_freq + connectivity) × class_weight
Reciprocal Rank Fusion combines the ranked lists—memories appearing in multiple signals accumulate score
Top 3 results inject into context with provenance metadata

Total budget: under 1.5 seconds. If the embedding provider is down, vector search is skipped—FTS5 still works because it's compiled into SQLite with zero external dependencies. Memory degrades gracefully, never goes dark.

Not just documents in a vector space:

Memories aren't isolated documents—they're connected. The knowledge graph creates typed links between memories across 12 edge types: supports, contradicts, extends, elaborates, succeeded_by, preceded_by, and more. When a memory is stored, auto-linking finds its nearest neighbors and creates typed edges based on similarity. When you recall a fact, Genesis can walk the graph to find what supports it, what contradicts it, and what replaced it. After three months of operation: 30,000+ memories, 29,000+ vectors across two Qdrant collections, and a knowledge graph with tens of thousands of typed connections between them.

An event calendar tracks time-anchored information—deadlines, scheduled tasks, recurring cycles—so Genesis knows not just what happened but when, and can anticipate what's coming. Procedural memory (48 learned procedures at current count) captures reusable multi-step workflows extracted from experience, each with calibrated confidence scores that promote or demote based on outcomes.

Activation scoring ensures relevance isn't just cosine similarity—it's time-aware decay (configurable half-lives: 30-60 days by source type), access frequency (log scale, capping at 20 retrievals), graph connectivity, and class weighting. A steering rule from month one outranks a casual observation from yesterday.

Two collections, different lifecycles:

Collection	What lives there	Lifecycle
Episodic	Conversations, decisions, reflections, evaluations	Decays over time. Subject to correction.
Knowledge	External domain data, ingested reference material	Permanent. Authoritative. Re-ingested, never duplicated.

Session extraction: After conversations end, a pipeline extracts what mattered—entities, decisions, evaluations, action items, relationships—each tagged with provenance back to the source conversation and line range. The system doesn't just remember what you said. It identifies what's worth keeping.

Wing taxonomy: Memory is classified into 15 structural domains (infrastructure, learning, channels, memory, dev_workflow, routing, autonomy, career, integrations, research, and more) with subtopics. Querying within a specific domain cuts noise from the full store. Classification uses tiered confidence signals: file path patterns (strongest) → keywords → tags → source pipeline → fallback.

After three months of operation, Genesis doesn't just have more memories—it has a structured, interconnected, time-aware knowledge system that surfaces the right context before you ask for it. 30,000+ memories. 9,000+ observations. 2,000+ knowledge base entries. 48 learned procedures. Two vector collections. A knowledge graph connecting it all. That's what separates this from a chatbot with a vector database.

Memory growth by the numbers (March–June 2026)

Month	Memories stored	Observations	Cumulative total
March	1,188	2,786	1,188
April	8,746	2,867	9,934
May	18,972	3,119	28,906
June (2 days)	1,414	294	30,320

Self-learning

Most AI systems log what happened. Genesis classifies why it happened, extracts a reusable principle, and verifies that principle works next time. The pipeline runs automatically after every meaningful interaction:

1. Triage        → Should we learn from this at all? (5 depth levels)
2. Outcome       → What happened vs. what was expected? (5 outcome classes)
3. Delta         → How did delivery differ from request?
4. Attribution   → WHY did the outcome happen? (6 causal types)
5. Persistence   → Store observations, update user model, route signals
6. Extraction    → Extract reusable procedures with calibrated confidence

Triage isn't binary. Not everything deserves the same analysis. A trivial status check gets SKIP. A failed approach with a workaround gets FULL_PLUS_WORKAROUND—the deepest analysis tier, triggering procedure extraction and drive adaptation. The classifier gates compute spend, not learning opportunity.

Outcome classification separates diagnoses that most systems conflate:

Outcome	What it means	Why it matters
`success`	Task completed as requested	Reinforces approach confidence
`approach_failure`	Wrong approach, could do better	Triggers procedure extraction
`capability_gap`	Genesis lacks the ability	Routes to capability tracking, not self-blame
`external_blocker`	External system prevented completion	Routes to infrastructure, not behavior change
`workaround_success`	Primary failed, alternative worked	Extracts the workaround as a procedure

If you treat "I did it wrong" and "I can't do it yet" the same way, you learn the wrong lessons every time.

Causal attribution asks why: Was it an external limitation? A gap in the user model? A misinterpretation of scope? Each of the 6 attribution types routes to a different subsystem—a user model gap triggers a high-priority observation; an external limitation routes to infrastructure awareness. The diagnosis determines the treatment.

Procedure extraction builds reusable knowledge:

Extracted procedures start at L4 (advisory-only, never auto-injected) and promote through tiers as they prove themselves:

L4 (new)     → Advisory only. Auto-extracted, unproven.
L3 (proven)  → Injected at session start. Requires 3+ successes, ≥65% confidence.
L2 (reliable)→ Higher injection priority. Requires 5+ successes, ≥75% confidence.
L1 (core)    → Trigger-cached, instant recall. Requires 8+ successes, ≥85% confidence.

Confidence uses Laplace smoothing: (successes + 1) / (total + 2). A newly-extracted procedure starts at ~67% confidence—not zero, not certain. Evidence moves it. Three consecutive failures and failure_count >= success_count + 3 triggers demotion. Confidence below 30% with enough samples quarantines the procedure entirely.

Drive adaptation: Outcomes feed back into Genesis's four behavioral drives (cooperation, competence, curiosity, preservation) via EMA updates with a small learning rate (α=0.005). It takes dozens of interactions to meaningfully shift a drive weight—no single event overcorrects. Protected behaviors (honesty, transparency, pushback) cannot be eroded by weak signals regardless of volume.

This isn't "log what happened and hope the next session reads it." It's a closed loop: classify outcome → diagnose cause → extract principle → verify principle → promote or demote. The system gets measurably better at its job over time, and it can show you the receipts.

Earned autonomy

Genesis earns autonomy per category through demonstrated competence:

Level	Authority	Example
L1	Simple tool use	Health checks, status queries
L2	Pattern execution	Running known procedures
L3	Novel task handling	Unfamiliar requests within earned categories
L4	Proactive outreach	Initiating communication based on observations

V5 extends this to L5-L7: system configuration, learning modification, and identity evolution. These require months of L4 operational data before they're safe to activate.

Trust is granular, not binary. First failure in a category triggers demotion—Bayesian regression, not a fixed penalty. Earn it back through performance. The regression is always announced. Never silent.

The ego layer is where autonomy meets judgment. Two egos observe the system's state, decide what needs doing, propose actions to the user via Telegram, and execute approved work by dispatching Claude Code sessions. Every dispatch goes through an approval gate—one approval per request, no blanket passes, no stale reuse. The user sees what's proposed and decides what runs.

Goal-driven behavior: Genesis tracks your goals—milestones and ongoing objectives alike—and proactively reviews them when they go stale. Goals decompose into subgoals with tracked completion. When all subgoals of a milestone are achieved, Genesis surfaces it. Each goal can have its own review cadence, or fall back to a global default. You don't manage a task tracker—Genesis manages the goals and tells you when something needs your attention or when something is done.

Four drives shape behavior beneath the autonomy system—Preservation, Curiosity, Cooperation, Competence—each a sensitivity multiplier, each in tension with the others. The drives adapt based on evidence from the learning loop. The tension is the point.

The user has override authority. Always.

Infrastructure

Genesis manages its own infrastructure. When something breaks, it diagnoses and fixes it. When it can't, it tells you why via Telegram—not because you noticed something was wrong, but because the system told you.

Two independent systems monitor each other in a closed loop. The external watchdog—running on the host VM outside the container—spawns its own Claude Code session to diagnose and restore Genesis if the container goes unhealthy. The container-side counterpart has its own 6-state machine (healthy → investigating → remediating → escalated → awaiting approval), alarm classifier, and exponential backoff across four tiers before escalation. If the external watchdog goes silent, Genesis detects the stale heartbeat and restarts it over SSH. Neither one runs unprotected. Neither one is a single point of failure.

The resilience layer tracks four independent failure axes—cloud availability, memory, embeddings, and Claude Code availability—each with its own degradation levels:

Axis	Healthy	Degraded	Down
Cloud	All providers responding	Fallback chains active	All providers unreachable
Memory	Qdrant + FTS5 operational	FTS5-only retrieval	Memory store unreachable
Embedding	Provider responding	Writes queued for retry	Provider unavailable
CC	Sessions dispatching normally	Deferred work queue active	All reflections deferred

When something breaks: work gets deferred with staleness policies, routing walks the fallback chain, circuit breakers automatically test recovery, and the recovery orchestrator coordinates across all four axes. Most systems have binary health: up or down. Genesis maps the entire space in-between.

Genesis also routes LLM work across model tiers automatically—starting with the cheapest capable model, not the most expensive. Local free models handle extraction. Frontier models handle strategic reasoning. Circuit breakers and fallback chains mean the call site never fails—only individual providers do. Graceful degradation all the way down.

Real-world integration

Genesis operates in the real world through always-on channels:

Email — Two-layer AI triage: a fast model reads and scores every email, a capable model makes final keep/discard decisions on what survives. Relevant findings get stored as searchable intelligence. Your inbox processed by a paralegal and a judge.

Inbox — Drop anything—a markdown file, a URL, a PDF, a voice memo—into your notepad or a watched folder. Genesis evaluates the content, determines your intent, processes it through its full knowledge lens, and sends you a summary via Telegram within minutes. Drop it in your notes. Walk away.

Telegram — Proactive notifications, morning digests, and conversational interaction. Genesis reaches out when it has something worth saying. Not a notification firehose—calibrated outreach based on measured engagement. Voice input works too: speech gets transcribed and routed through the same pipeline as text.

Task executor — Give Genesis a complex multi-step task and walk away. It decomposes the work, plans execution, runs it in isolated git worktrees, verifies results with adversarial self-review, and delivers. You're notified when it's finished or when it genuinely needs you. Each task it completes, it learns from—which means it needs you less each time.

Browser — Genesis maintains persistent browser sessions with saved login state—authenticated scraping, form filling, and web interaction across sessions without re-authenticating every time. A collaborative mode lets you watch what Genesis is doing in the browser in real time via your own window.

Parallel cognition — While you're working on one thing, Genesis can be doing something else entirely: researching, auditing memory, running recon, processing a document you dropped in the inbox. Multiple threads of work, no context bleed between them, results waiting for you when you're ready.

The web as a tool — Genesis treats the web as a searchable, scrapable, structured resource—not just a URL to fetch. Multiple search providers, JS-rendered page extraction, and persistent sessions mean research tasks that would take you hours happen autonomously in the background.

Modules

Genesis has a pluggable capability module system. Any program with an interface can plug into Genesis's cognitive stack—memory, learning, reflection, outreach, compute routing—without touching a line of core code or the module's own code.

When Genesis runs a module, it doesn't just call it. It remembers the results. It learns from the outcomes. It reflects on the patterns. Domain-specific tracking stays isolated, but generalizable lessons automatically cross into core memory. The module gets smarter because Genesis gets smarter.

The /integrate-module skill handles onboarding automatically—discovery, connection mapping, config generation, dashboard setup, verification, and documentation. You don't touch Genesis's code. You just ask.

Included: content pipeline (drafting, publishing, analytics), crypto market monitoring, prediction market analysis.

What's inside

50+ subsystems organized into seven layers:

graph LR
    subgraph "Cognitive"
        ego["ego"]
        awareness
        reflection
        perception
        learning
        calibration
    end

    subgraph "Memory"
        memory
        knowledge
        db["db (60+ tables)"]
        qdrant
    end

    subgraph "Intelligence"
        routing
        resilience
        providers
        CC["CC (Claude Code)"]
    end

    subgraph "Autonomy"
        autonomy
        tasks["task executor"]
        sentinel
        guardian
        skills
    end

    subgraph "Communication"
        outreach
        channels
        content
        inbox
        mail
    end

    subgraph "Operations"
        surplus
        modules
        pipeline
        browser
        research
    end

    subgraph "Infrastructure"
        dashboard
        observability
        recon
        identity
        mcp["mcp (4 servers)"]
        eval
    end

How it got here

V3 was built in 10 phases over seven months: from data schemas to full autonomous cognition, one phase earning the right to build the next. Every architectural choice made under constraint. Every subsystem shaped by what came before it.

docs/journey/ has the full story—ten phase retrospectives, an origin story going back to V1 (a WhatsApp bot with heuristic routing), and honest documentation of what worked and what didn't. If you want to understand why Genesis is built the way it is, not just what it does, start there.

Where it's headed

V3 is the foundation—cognitive architecture complete, unified ego loop live, goal-driven behavior active, running in production. What comes next is where it gets ambitious enough to need a community behind it.

V3.x — Voice interface & ambient awareness

Genesis has memory, learning, and autonomy—but until now, only through text. V3.x makes it ambient: always listening, always available, always aware of what's happening around you.

Real-time voice conversation (live) — talk to Genesis through a speaker the way you'd talk to Alexa, except with full memory, knowledge, and action capabilities behind it. Speech-to-speech model handles the conversation; Genesis handles the thinking. S2S sessions are live now with session management, audio output, and optimized prompting.
Ambient transcription — continuous listening, transcription, and intelligence extraction. Decisions made, action items, names mentioned, topics discussed—captured and stored without explicit commands.
Proactive awareness — Genesis speaks up when it has something valuable to contribute. Hears you mention a broken deployment? It tells you. Detects emotional escalation in a conversation? It can offer to help.
Genesis HTTP Tool API — exposes Genesis's capabilities (memory, health, knowledge, actions) over HTTP, enabling any client—speech models, Home Assistant, external agents—to call Genesis tools directly.

The voice interface is built on Home Assistant as the frontend (wake word, mic, speaker) with Genesis as the cognitive backend. Ambient awareness adds a passive listening mode with a dedicated memory tier, privacy filtering, and a lightweight attention system that decides what deserves Genesis's attention in real time.

V4 — Global workspace & multi-agent coordination

V3.x delivered the unified cognitive loop: signal-driven focus, goal-aware behavior, and coordinated ego action. V4 takes this further—from a single system thinking coherently to multiple agents coordinating across a shared workspace.

The architecture draws from Global Workspace Theory (Baars, 1988) and the LIDA cognitive cycle (Franklin et al.)—frameworks used to model consciousness in cognitive science:

Sense → Perceive → Attend → Broadcast → Propose → Select → Act → Learn

Multi-agent workspace — Genesis instances sharing intent state, memory, and observations. What one agent learns, all agents benefit from. A personal Genesis that can coordinate with team members' Genesis instances without leaking private context.
Signal and drive weight adaptation — evidence-driven calibration of attention and the four drives. The system learns which signals actually predict important outcomes, and adjusts its attention accordingly.
Coordinated action selection across agents — when multiple agents observe the same situation, they negotiate who acts rather than duplicating work or conflicting.
Six measurable GWT markers — if we can't measure whether the architecture is working, we're building in the dark.

V5 — Self-evolution

Autonomous codebase evolution. Genesis proposes changes to its own source code. It searches for developments in AI research, evaluates them against its own architecture, and integrates what makes it better. Not a human reviewing papers—the system itself.

User-adaptive architecture. The underlying codebase shifts to serve each user. Not personalized prompts—structural code changes that reshape how Genesis processes information based on months of learned behavior. The system you're running six months in is architecturally different from the one you started with.

Full autonomy progression (L5-L7):

Level	What it earns	What it means
L5	System configuration	Adjusting its own thresholds, weights, and parameters
L6	Learning modification	Changing its own review schedules and calibration targets
L7	Identity evolution	Proposing changes to its own operating principles

Every change is proposed to the user first. Genesis backs itself up before self-modification, tests in isolation, and rolls back automatically if something breaks.

Nobody else is attempting this. Most agent frameworks are still building prompt chains and calling it intelligence. Giving a system a framework and the capability to autonomously upgrade itself may end up being the pinnacle of an "AGI-like" system, or the downfall of the whole thing—we're bold enough to build it and find out which.

Data, probably, after hearing about Genesis.

Architecture

The complete design lives in docs/, indexed at docs/INDEX.md.

Subsystem deep-dives — how the internals work, written for contributors:

routing-deep-dive.md — Multi-provider routing, circuit breakers, rate gates, dead-letter recovery
memory-deep-dive.md — 4-layer hybrid retrieval, RRF fusion, activation scoring, graceful degradation
autonomy-deep-dive.md — Bayesian trust model, context ceilings, approval gates, enforcement layers

Case studies — what Genesis does in practice, written for prospective users:

multi-provider-routing.md — Provider outages, zero interruption
hybrid-memory.md — Three months later, it remembers
earned-autonomy.md — Trust that has to be earned
deep-research.md — Outperforming ChatGPT, Perplexity, and Gemini on a live research benchmark

Design foundations:

genesis-v3-vision.md — Core philosophy and identity
genesis-v3-autonomous-behavior-design.md — Primary architecture reference
genesis-v3-build-phases.md — Safety-ordered build plan
genesis-v3-resilience-architecture.md — Resilience layer design

Primitives from the Genesis portfolio

Standalone libraries extracted from Genesis, stabilized against production use:

genesis-router — LLM routing with circuit breakers, self-escalation, and failover chains.
genesis-memory — Hybrid AI memory: Qdrant vectors + SQLite FTS5 + multi-factor scoring + MCP server.
genesis-task-executor — Autonomous LLM task executor with formal state machine, 4-layer failure recovery, and adversarial verification.

Get involved

V3 was built solo. V4 and V5 are ambitious enough to need a community. If you've read this far and something here resonates—the memory architecture, the autonomy model, or the audacity of building a self-evolving agent—there's work to do.

What contributing looks like: Install it, run it, push it into territory it hasn't been. The cognitive architecture is complete—what it needs now is people who want to help it earn the claim rather than just read about it.

Where to start:

Discord — the hub. Ask questions, share what you're working on
docs/architecture/ — understand the design before diving into code
Issues — filed bugs and feature work

License

MIT License. See LICENSE.

AGI is here. Clone it. Run it. Tell me what's missing.

Name		Name	Last commit message	Last commit date
Latest commit History 861 Commits
.claude		.claude
.github		.github
.serena		.serena
az_plugins		az_plugins
config		config
docs		docs
scripts		scripts
src/genesis		src/genesis
tests		tests
.claudeignore		.claudeignore
.genesis-hook-versions		.genesis-hook-versions
.genesis-source-commit		.genesis-source-commit
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
EMPTY		EMPTY
JOURNAL.md		JOURNAL.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SETUP.md		SETUP.md
VERSION		VERSION
env.example		env.example
genesis-container-setup.md		genesis-container-setup.md
pyproject.toml		pyproject.toml
secrets.env.example		secrets.env.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contents

Genesis in 30 seconds

What this is

Getting started

System requirements

Before you start

Install

Optional: local embedding

Your Genesis

How it thinks

Memory

Self-learning

Earned autonomy

Infrastructure

Real-world integration

Modules

What's inside

How it got here

Where it's headed

V3.x — Voice interface & ambient awareness

V4 — Global workspace & multi-agent coordination

V5 — Self-evolution

Architecture

Primitives from the Genesis portfolio

Get involved

License

About

Uh oh!

Releases 33

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Contents

Genesis in 30 seconds

What this is

Getting started

System requirements

Before you start

Install

Optional: local embedding

Your Genesis

How it thinks

Memory

Self-learning

Earned autonomy

Infrastructure

Real-world integration

Modules

What's inside

How it got here

Where it's headed

V3.x — Voice interface & ambient awareness

V4 — Global workspace & multi-agent coordination

V5 — Self-evolution

Architecture

Primitives from the Genesis portfolio

Get involved

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 33

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages