Is the Agents SDK free? Is it open source?

The SDK itself is a free, open-source project under the MIT license (pip install openai-agents), but using OpenAI's models and hosted tools (Code Interpreter, File Search) incurs API charges. Code Interpreter costs $0.03 per session; File Search costs $0.10 per GB/day plus $2.50 per 1,000 calls.

Can I use Claude or Gemini with the Agents SDK?

Yes, but with caveats. The SDK's model inference layer supports connecting to Claude, Gemini, and other models via OpenAI-compatible APIs or third-party adapters like LangDB. However, if you use hosted tools like Threads, Vector Stores, or File Search, those features only work on the OpenAI platform and can't be migrated along with the model.

Which sandbox vendor is cheapest? How do I choose between E2B, Modal, and Daytona?

Use E2B for dev and testing (free $100 credit), Modal for production deployment ($30/month free credit, per-second billing with no minimums), and Daytona for enterprise compliance ($200 free credit, supports self-hosting). For indie developers on a $20-50/month budget, Modal's per-second billing model is the most cost-effective.

How is this different from the old Assistants API? Can I still use the Assistants API?

The Agents SDK is the evolution of the Assistants API. The key differences are the harness/compute separation architecture and open-source controllability. The Assistants API still works, but OpenAI's development focus has shifted entirely to the Agents SDK. New projects should adopt the Agents SDK directly.

I don't know Python. Can I use the Agents SDK with TypeScript?

Yes. The TypeScript Agents SDK (openai-agents-js) now supports sandbox functionality in beta as of May 2026, including isolated filesystem workspaces, shell commands, file editing, and snapshots. However, code mode and subagents are still in development for both Python and TypeScript. If sandbox execution covers your needs, TypeScript works now; for the full feature set, Python remains the more complete option.

OpenAI Agents SDK: The Indie Maker's Practical Guide (May 2026 Update)

You want to build AI Agent side projects, but assembling the infrastructure — tracing, sandboxes, multi-agent orchestration — eats up most of your time? The OpenAI Agents SDK has evolved from its April 2026 architectural overhaul to v0.17.5 in June, continuing to unify these scattered pieces into a single API. Meanwhile, Anthropic launched the Claude Agent SDK and AWS open-sourced Strands, reshaping the competitive landscape. Before you pick a framework, there are a few things worth understanding first.

TL;DR

The Agents SDK is free and open source (MIT license), but hosted tools and model calls cost money — and the cost structure is non-linear
"Model-agnostic" comes with conditions: the inference layer is swappable, but hosted tools lock you into the OpenAI platform
TypeScript sandbox now available (beta): as of v0.17.5, sandbox works in the TypeScript SDK, though code mode and subagents are still in development
The real architectural innovation is harness/compute separation, not sandbox itself
New competitive landscape: Claude Agent SDK (deepest MCP integration), AWS Strands (Bedrock ecosystem) — your choice is no longer just OpenAI
For developers on a $20-50/month budget: use E2B for testing, Modal for deployment, and Manifest with your own storage to avoid vendor lock-in

You Think the Agents SDK Is Model-Agnostic, but That Freedom Has Conditions

OpenAI's official docs claim the Agents SDK supports 100+ LLMs. Technically, that's true. The SDK's model inference layer can connect to Claude, Gemini, DeepSeek, and other models via OpenAI-compatible APIs, and third-party adapters like LangDB have comprehensive tutorials.

But here's the crucial distinction: swapping models is not the same as swapping platforms.

Once you use hosted tools like Threads, Vector Stores, File Search, or Code Interpreter, your agent's data lives on OpenAI's platform. These tools have no universal interface — you can't export a Vector Store index directly to Pinecone, and you can't export Thread conversation history to another framework.

For indie developers, the pragmatic strategy is:

Inference layer: go ahead and start with OpenAI models, knowing you can switch later
Data layer: carefully evaluate each hosted tool — use your own alternatives when possible
Storage layer: mount S3/GCS via Manifest (details below) to keep your data portable

The Real Architectural Innovation Isn't Sandbox — It's Harness/Compute Separation

Most media coverage focused on the "new sandbox feature," but The New Stack's technical analysis identified a deeper design philosophy: the core of this update is the separation of harness (control plane) from compute (execution plane).

Why does this matter? In traditional architectures, your API keys, database passwords, and third-party service tokens all live in the same environment where the agent code executes. If the model gets hit with a prompt injection attack, the attacker could theoretically make the agent leak your credentials.

The harness/compute separation design assumes a fundamental principle: assume threats will occur. Credentials always stay in the harness layer and never enter the sandbox environment where model-generated code runs. Even if the sandbox is compromised, the attacker can't access your keys.

During testing, I ran import os; print(os.environ.get("OPENAI_API_KEY")) inside a Modal sandbox to try reading an API key set in the harness layer. The result was None, confirming that harness-layer isolation works — harness credentials are not injected into sandbox environment variables. For indie developers, this means you no longer need to build your own credential isolation mechanism; the SDK handles it at the architecture level.

This design is also a powerful argument for convincing your company's security team: it's not just "we added a sandbox," but "we fundamentally assume attacks will happen, so sensitive data simply doesn't exist in the execution environment."

Zero to First Agent: The Fastest Path for Indie Makers

Just install it and go. The Agents SDK is currently at v0.17.5 (released June 11, 2026; the default model has switched to gpt-5.4-mini. The SDK is still rapidly evolving, so check the official docs for the latest API). Requires Python 3.10+:

pip install openai-agents

Prerequisites: you already have an OpenAI API key (set as the OPENAI_API_KEY environment variable) and Python 3.10+ installed.

The minimum viable agent takes just a few lines:

from agents import Agent, Runner

agent = Agent(
    name="idea-validator",
    instructions="You are a side project idea validation assistant. Analyze the user's idea and provide a market viability assessment and recommended MVP feature list."
)

result = Runner.run_sync(agent, "I want to build a Slack bot that auto-generates weekly reports using AI")
print(result.final_output)

That's the bare minimum. But what actually saves me time with the Agents SDK isn't the Agent itself — it's the built-in tracing that requires zero configuration. Every Runner.run() call automatically records the complete execution trace, including each tool call's inputs and outputs, token consumption, and latency. You can view it all in the OpenAI Dashboard.

If you've built agents with LangChain before, you know how much time setting up LangSmith tracing takes. The Agents SDK makes it zero-config, which for someone who only has weekends for side projects saves not just setup time but debugging time.

Adding tools is equally intuitive:

from agents import Agent, Runner, function_tool

@function_tool
def check_domain(domain: str) -> str:
    """Check if a domain name is available"""
    # Your checking logic
    return f"{domain} is available"

agent = Agent(
    name="idea-validator",
    instructions="You are a side project idea validation assistant. You can check domain name availability.",
    tools=[check_domain]
)

From installation to running your first agent with tools, it took me under 30 minutes (given an existing API key and Python 3.10+ environment).

If your agent needs to execute code, read/write files, or run shell commands, you need a sandbox. The Agents SDK added a built-in SandboxAgent in v0.14.0 with official support for multiple sandbox vendors. Here's a selection guide for indie developers on a $20-50/month budget:

Criteria	E2B	Modal	Daytona
Free credit	$100 one-time	$30/month	$200 one-time
Billing model	Per-second	Per-second	Per-second
Unit price reference	1 vCPU ~$0.05/hr	CPU from $0.059/hr	Per actual compute
Max session	1 hour (free tier)	No hard limit	Plan-dependent
Best for	Dev/testing, prototyping	Production, sporadic use	Enterprise compliance, self-hosting
Indie dev recommendation	Best for getting started	Best for production	Overkill unless compliance is required

My actual setup: I use E2B's free credits for rapid validation during development, then switch to Modal for deployment once agent behavior is stable. Modal's per-second billing and $30/month free credit make it very economical for side projects that only run a few hours on weekends.

What happens when E2B's free credit runs out: The $100 E2B credit is one-time only; after that, you pay (also per-second, 1 vCPU ~$0.05/hr). Once your dev testing phase is over, switch to Modal rather than continuing to pay E2B — Modal's free credit resets monthly, making it better suited for low-frequency side projects.

The Full Cost Picture: Agent Costs Go Beyond Tokens

Many people assume the Agents SDK's cost is just token fees, but there are actually three dimensions that stack up:

1. Model token costs: the baseline, depending on your chosen model.

2. Hosted tools fixed costs:

Code Interpreter: $0.03/session (20-minute container each)
File Search: $0.10/GB/day (storage) + $2.50/1,000 calls

3. Token inflation from multi-step workflows: each agent turn resends the full context. A 5-step workflow may consume 3-4x the tokens you'd expect.

Cost modeling example (estimates for planning purposes only; actual costs vary by usage pattern):

Suppose you build a code review agent that runs 5 steps per review, averaging 2,000 input tokens + 500 output tokens per step (including context resending), using GPT-4o, and triggering 1 Code Interpreter session:

Model tokens (GPT-4o: $2.50/M input, $10/M output): ~$0.025-0.05/run
Code Interpreter session: $0.03/run
Context resending inflation (multi-step context accumulation): additional $0.02-0.06/run
Total cost per review: ~$0.075-0.14

If you run 20 tests per day, that's $1.5-2.8/day, potentially $45-84/month — and that's before sandbox vendor fees.

Cost guardrail recommendations:

Set a monthly spending cap in the OpenAI Dashboard
Use the max_turns parameter to limit maximum agent execution steps
Use cheaper models (e.g., GPT-4o-mini) during development; switch to more powerful models after confirming the workflow
File Search storage is billed daily — clean up your Vector Store after testing

The Reality for TypeScript Developers: Sandbox Now Available, But Gaps Remain

Good news: as of May 2026, the TypeScript Agents SDK (openai-agents-js) now supports sandbox functionality in beta, including isolated filesystem workspaces, shell command execution, file editing, and snapshots. This is a major improvement from the "everything is Python-only" situation in April.

However, gaps remain: code mode and subagents are still in development for both Python and TypeScript.

If you're a TypeScript developer, your options are much better than a month ago:

Use the TypeScript SDK + sandbox (beta) directly: most indie maker use cases are now covered — the sandbox beta supports file operations, shell commands, and stateful sessions
Use Python when you need full features: if your agent requires code mode or complex subagent orchestration, the Python SDK remains the most feature-complete choice
Hybrid architecture: TypeScript handles frontend/API layers, Python handles core agent logic, communicating via REST API or message queue

Note: sandbox functionality is still in beta — API details, defaults, and supported capabilities may change. Evaluate the risk before using in production.

Multi-Agent Collaboration: What Handoff Can Do Now, Where Subagents Stand

The Agents SDK's multi-agent mechanism currently has two parts — one is ready to use, one is still on the roadmap:

Available Now: Handoff (Sequential Orchestration)

Handoff lets you define transfer logic between agents. For example, a "triage agent" determines user intent and hands the conversation to the appropriate "specialist agent":

# Note: check the latest official docs for import paths; the SDK is still rapidly evolving
from agents import Agent, handoff

billing_agent = Agent(name="billing", instructions="Handle billing-related inquiries")
tech_agent = Agent(name="tech-support", instructions="Handle technical issues")

triage_agent = Agent(
    name="triage",
    instructions="Determine the type of user issue and hand off to the appropriate specialist",
    handoffs=[handoff(billing_agent), handoff(tech_agent)]
)

Handoff is sequential: only one agent runs at a time, passing control when finished. For most indie maker use cases, this is sufficient.

Still on the Roadmap: Subagents (Parallel Task Decomposition)

If you want multiple agents running different tasks simultaneously (e.g., one researching data, one writing code, one running tests), the subagents feature is still on the roadmap.

For now, parallel execution requires managing asyncio yourself:

import asyncio
from agents import Runner

async def parallel_agents():
    results = await asyncio.gather(
        Runner.run(research_agent, "Look up market data"),
        Runner.run(code_agent, "Generate MVP code"),
    )
    return results

It works, but without SDK-level tracing integration or error handling. When designing new project architectures, don't assume subagents are available — avoid needing a rewrite later.

Advanced: Using Manifest to Avoid Vendor Lock-in

If you're concerned about getting locked into the OpenAI ecosystem, Manifest is currently the most practical escape hatch.

Manifest is an abstraction layer in the Agents SDK that lets you define an agent's workspace (filesystem, environment variables, resource mounts) without tying it to a specific compute provider. The key point: you can mount your own cloud storage via Manifest.

Hybrid architecture strategy:

+----------------------------------+
|  Your control boundary           |
|  +-----------+  +--------------+ |
|  | Harness   |  | Your storage | |
|  | + Tracing |  | (S3 / GCS)  | |
|  | (SDK)     |  |              | |
|  +-----+-----+  +------+------+ |
|        |    Manifest    |        |
|        +-------+--------+        |
+-----------------+----------------+
                  |
          +-------+--------+
          |    Sandbox      |
          | (E2B / Modal)   |
          +----------------+

The core idea behind this strategy:

Use the SDK's harness + tracing: these are the Agents SDK's core value propositions and don't involve data lock-in
Use your own S3/GCS for storage: mount via Manifest so sandbox agents read/write to your storage
Avoid data dependencies on hosted tools: replace File Search with your own vector database; don't store critical data exclusively in Vector Stores

Minimal Manifest example (conceptual — check the official docs for the latest API):

from agents.sandbox import SandboxAgent, Manifest

# Define agent workspace, mounting your own S3 storage
manifest = Manifest(
    filesystem={
        "/data": {"type": "s3", "bucket": "my-bucket", "prefix": "agent-output/"}
    },
    env_vars={}  # Sensitive credentials stay in the harness, not in the manifest
)

agent = SandboxAgent(
    name="data-processor",
    instructions="Process data files in the /data directory",
    manifest=manifest
)

Note: the Manifest API is still evolving. Refer to the official sandbox docs for the latest information. The example above is conceptual to help you understand the mounting approach; check the latest docs for actual syntax.

The benefit: if you ever want to switch frameworks, your tracing data can be exported, storage is in your own hands, and the only thing you need to rewrite is the agent logic itself.

2026 Agent Framework Selection: Five Frameworks Compared

The 2026 agent framework landscape has shifted from "OpenAI vs open source" to "every cloud giant and AI lab has their own SDK." Based on Composio's framework comparison and hands-on experience:

Decision Criteria	OpenAI Agents SDK	Claude Agent SDK	AWS Strands	LangGraph	CrewAI
Learning curve	Low	Low (shell mindset)	Low (AWS users)	Medium-high	Low
Tracing integration	Built-in, zero config	Built-in	CloudWatch integration	Requires LangSmith	Self-built required
Security isolation	Harness/compute separation	Sandbox virtualization	IAM integration	Self-built required	Self-built required
Multi-agent	Handoff available, subagents in dev	Multi-agent sessions (beta)	Agent orchestration	Full DAG support	Mature role-based
Model flexibility	Conditional model-agnostic	Claude models only	Bedrock multi-model	Fully model-agnostic	Fully model-agnostic
MCP support	Yes, improving	Deepest integration	Limited	Community plugins	Community plugins
GitHub stars	26K+	Fast-growing	Growing	Mature	Active

Which should you pick?

Indie dev, want to ship an MVP fastest: OpenAI Agents SDK. Unified API + built-in tracing + low learning curve saves you from assembling infrastructure yourself
Already using Claude / Claude Code: Claude Agent SDK. Extracted from Claude Code's agent loop with the deepest MCP integration — its "give the agent a computer" philosophy (shell + filesystem + web) suits automation tasks well
AWS ecosystem user: AWS Strands. Deep Bedrock integration makes it the path of least resistance if your infrastructure runs on AWS
Need complex DAG workflows (branching logic, conditional loops, parallel execution): LangGraph. Most mature graph orchestration
Non-engineer building agents (PMs, product people): CrewAI's role-based DSL is the most intuitive

Risk Disclosure

Vendor lock-in risk: using hosted tools (File Search, Vector Stores, Code Interpreter) creates platform dependency. Plan a Manifest + self-owned storage hybrid architecture from the start
Cost risk: multi-step agent workflow token consumption is non-linear. Always set monthly spending caps and max_turns limits
Feature gap risk: TypeScript sandbox now available (beta), but code mode and subagents are still on the roadmap. Don't design architectures based on roadmap features
Security risk: harness/compute separation significantly improves security but doesn't mean zero risk. Still follow the principle of least privilege when configuring sandbox permissions

Pre-Launch Checklist: Agents SDK Production Checklist

Before pushing your agent side project to production, verify these 10 items:

Conclusion

The OpenAI Agents SDK has evolved from its April 2026 architectural overhaul to v0.17.5 in June, steadily lowering the barrier to AI Agent development. Harness/compute separation for security, built-in tracing, unified tool API — infrastructure that used to take weeks to build yourself now comes with a single pip install. The arrival of TypeScript sandbox support opens the door for even more frontend developers.

But the 2026 agent framework competition has intensified. Claude Agent SDK brings the deepest MCP integration, AWS Strands offers the lowest friction for Bedrock users. Choosing a framework is no longer just about features — it's about which model ecosystem and infrastructure you're already on.

If you're an indie developer on a $20-50/month budget, my recommendation is: start with the ecosystem closest to your model preference (OpenAI → Agents SDK, Claude → Claude Agent SDK, AWS → Strands), build the minimal version of your agent, test with E2B's free credits, switch to Modal for deployment once validated, and use your own storage from day one to avoid vendor lock-in. This path lets you validate your idea at minimum cost while preserving the freedom to switch frameworks later.

Now go pip install openai-agents and finally build that AI side project you've been thinking about.