Skip to content

add starter sandboxes example#1595

Open
andrewhinh wants to merge 1 commit into
mainfrom
sandbox-starter
Open

add starter sandboxes example#1595
andrewhinh wants to merge 1 commit into
mainfrom
sandbox-starter

Conversation

@andrewhinh

@andrewhinh andrewhinh commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

Add a new starter Sandboxes example to demonstrate how Sandboxes work in the SDK and dashboard.

@andrewhinh andrewhinh self-assigned this Jun 22, 2026
devin-ai-integration[bot]

This comment was marked as resolved.

@andrewhinh andrewhinh force-pushed the sandbox-starter branch 2 times, most recently from 4969943 to 4a306b0 Compare June 22, 2026 22:17
devin-ai-integration[bot]

This comment was marked as resolved.

@aaazzam aaazzam left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a bit involved for a Sandbox 101. Most other getting_started examples are roughly 20-30 lines of code and sprint towards showing the core primitive. I think this example could be repurposed for something more eval oriented (running evals with harbor or something) but as-is I think this is hard to lead off with. What do you think?

@andrewhinh

Copy link
Copy Markdown
Contributor Author

This is a bit involved for a Sandbox 101. Most other getting_started examples are roughly 20-30 lines of code and sprint towards showing the core primitive. I think this example could be repurposed for something more eval oriented (running evals with harbor or something) but as-is I think this is hard to lead off with. What do you think?

this was bascially a first draft response to this:

Tactically: I like to show the SDK and dashboard via a demo, inference.py and inference_map.py or get_started.py with H100s and map added live, scaling up to 50 or so GPUs. Showing massive scaling tends to impress people and get the vague shape of the value prop in their heads. You can see a recorded version of the get_started demo from January 26, 2025 here. The SDK and dashboard are our primary product surfaces, so you want to use this opportunity to familiarize people with them — even if just vaguely, so that the next impression sticks better. We Should™️  come up with something similar for Sandboxes — probably scaling up to 100 or 1000 containers running a coding agent, then having an LLM judge pick the best outcome? I have an old demo in this direction here, but it’s too complicated to use in most settings.

agree tho it's too long, especially all the test setup. but actually I didn't know about harbor, let me play around with it.

devin-ai-integration[bot]

This comment was marked as resolved.

@devin-ai-integration devin-ai-integration Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Devin Review found 1 new potential issue.

View 1 additional finding in Devin Review.

Open in Devin Review

Comment thread 01_getting_started/sandboxes.py
@andrewhinh andrewhinh requested a review from aaazzam June 23, 2026 23:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants