rlm-rs

RLMs in Rust using RustPython and gVisor

Development

System requirements

Linux running x86-64 or ARM64 architectures. See instructions for running on AWS EC2 below.

Installation

rustup
prek

prek install

Linux:

sudo apt-get update && sudo apt-get install -y runsc
sudo runsc install
sudo systemctl restart docker

EC2:

aws cli and auth setup

IAM_USER=<iam-user> make aws-setup                          # optionally specify IAM_USER to create access key, then create key pair
ARCH=arm64 INSTANCE_TYPE=t4g.medium ROOT_GB=50 make create  # optionally specify ARCH, INSTANCE_TYPE, ROOT_GB, then create instance
make conn

# in the instance
make ec2-setup

Setup

Create a .env file with the following variables:

OPENAI_API_KEY=<api-key>

Commands

Run make help for the full list of commands.

For both Linux and EC2 instances:

RLM_METHOD=<rlm|lambda_rlm> cargo run
make app METHOD=<rlm|lambda_rlm>
make goose HOST=<host>

Roadmap

port rlm-minimal to Rust and RustPython
unblock event loop
add support for depth > 1
add shared program state
add per-session REPL sandboxing with gVisor
add toggle for λ-RLM paper and code

Details

Sandboxing

Requests within a session remain ordered while different sessions execute concurrently, so one long-running REPL interaction does not create cross-session head-of-line blocking for unrelated traffic. Ingress is bounded and fails fast under saturation instead of queueing indefinitely, and pool ownership is centralized in a single broker to avoid contention around mutable container state.

Async Runtime

The async runtime separates network-facing work from interpreter execution so that blocking Python operations do not starve request handling or model I/O. REPL commands are dispatched through channels to a dedicated worker thread, which isolates synchronous interpreter calls from the async control plane. A persistent REPL worker is used to preserve interpreter-local state across iterative commands and to avoid per-command thread startup costs.

Load Testing

The load test runs 20 simulated users for 5 minutes against /v1/chat/completions.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
assets		assets
crates		crates
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
rust-toolchain.toml		rust-toolchain.toml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rlm-rs

Development

System requirements

Installation

Setup

Commands

Roadmap

Details

Sandboxing

Async Runtime

Load Testing

Credit

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

rlm-rs

Development

System requirements

Installation

Setup

Commands

Roadmap

Details

Sandboxing

Async Runtime

Load Testing

Credit

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages