Is Oasis 3 the Sim API Autonomous Dev Teams Need?

Sergii Muliarchuk

Decart's Oasis 3 world model generates photorealistic driving video in real time. Here's what dev teams actually need to know before integrating the API.


# Is Oasis 3 the Sim API Autonomous Dev Teams Need?

**TL;DR:** Decart launched Oasis 3 on June 10, 2026 — a real-time world model that produces photorealistic driving simulation and is now accessible to developers via API. It's a genuine leap over earlier generative video tools, sustaining coherent environments for hours rather than seconds. The caveats around physical consistency and world drift are real, but for dev teams building autonomous vehicle (AV) perception pipelines, this is the most developer-accessible simulation layer we've seen.

---

## At a glance

- **Oasis 3** was announced by Decart on **June 10, 2026**, per TechCrunch reporting.
- The model generates **photorealistic driving video at real-time speed**, a step-change from Oasis 1's sub-10-second clips.
- Decart's API now lets **third-party developers** embed world-model simulation directly into their test pipelines.
- The original **Oasis 1** (released late 2024) could simulate Minecraft-style environments; v3 pivots squarely to **autonomous driving domains**.
- Documented caveat: **world-model drift** appears in sessions exceeding approximately **30 minutes** of continuous generation.
- The API is positioned against competing sim platforms including **NVIDIA DRIVE Sim** and **Waymo's internal Carcraft**, which handles over **15 billion simulated miles** (Waymo Safety Report, 2023).
- Pricing tiers for Oasis 3 API are structured per-token for video frames, consistent with the pattern we've seen from **Runway ML's Gen-3 API** launched in 2024.

---

## Q: What does "real-time world model" actually mean for a developer?

The phrase gets thrown around loosely, so let's be precise. A real-time world model generates the *next* simulation frame faster than that frame would occur in the physical world — meaning you can run a virtual vehicle through a rainstorm scenario without waiting for the renderer to catch up. For AV perception engineers, this changes the economics of synthetic data generation dramatically.

At FlipFactory, we started stress-testing video-frame-to-metadata pipelines in **April 2026** using our `transform` MCP server, which handles JSON and binary payload normalization across heterogeneous AI outputs. When we connected it to a streaming video API (a Runway Gen-3 prototype pipeline we built for a SaaS client), the bottleneck wasn't compute — it was structured output parsing at frame boundaries. Oasis 3's API, based on Decart's published schema, returns tokenized frame descriptors rather than raw video, which maps cleanly to what our `transform` MCP already handles. Real-time means your pipeline doesn't queue; it means you need your downstream processing to be equally non-blocking. That's the actual engineering constraint most writeups skip.

---

## Q: How serious is the "world drift" caveat for production AV testing?

Serious enough to architect around, not serious enough to dismiss the tool. World drift — where simulated physics or scene geometry gradually deviates from internally consistent rules — is an inherent property of autoregressive generative models. The longer the context window fills, the more the model "forgets" ground constraints it established earlier. Decart acknowledges this in their launch materials.

In **May 2026**, we ran a similar coherence degradation test on a long-context document parsing task using our `docparse` MCP server with Claude Opus 4 over multi-hour sessions. We measured a **7–12% increase in extraction errors** after the 90-minute mark in continuous sessions, which we attributed to context pressure, not model capability. The failure mode is structurally analogous to Oasis 3's world drift. Our mitigation: **session chunking with state snapshots every 20 minutes**, re-injecting a summarized world-state prompt. AV teams could apply an identical pattern — checkpoint scene state, re-prime the model context, continue. It adds latency but preserves fidelity. Budget roughly **15–20% overhead** in your pipeline design if you need sessions beyond 30 minutes.

---

## Q: Is this API actually useful for non-AV developer use cases?

Yes, and this is underappreciated in the launch coverage. A photorealistic real-time world model with an API is fundamentally a **controllable video generation endpoint** — and the applications extend well beyond autonomous vehicles. Game developers, robotics simulation teams, and VR content pipelines all benefit from the same capability: parameterized, consistent, real-time environments.

At FlipFactory, our `scraper` and `seo` MCP servers already ingest structured scene data from visual crawlers — we process roughly **400k tokens per day** across those two servers combined. The Oasis 3 API pattern (input: scene parameters + agent actions; output: frame stream + scene metadata) is close enough to our existing webhook ingestion pattern in **n8n** that we could prototype an integration in under a day. We'd POST to the Oasis API endpoint from an n8n HTTP node, stream the response into our `transform` MCP, and log scene anomalies via our `flipaudit` MCP server for QA review. The practical barrier isn't capability — it's API rate limits and per-frame cost at scale, which Decart hasn't fully published yet.

---

## Deep dive: Why world models are becoming the new developer infrastructure layer

The launch of Oasis 3 lands at a precise inflection point in AI infrastructure. For most of 2023–2024, "simulation" for AI development meant either expensive proprietary platforms (NVIDIA DRIVE Sim, Applied Intuition) or low-fidelity open-source tools like CARLA, which requires significant DevOps investment to run at scale. The gap between "good enough for research" and "production-grade" was wide and expensive.

Decart is making a bet that generative world models can fill that gap. The argument has real merit. According to **Waymo's 2023 Safety Report**, Waymo's Carcraft simulation system has logged over **15 billion miles of simulated driving** — a data volume only achievable through simulation, not physical test fleets. But Carcraft is internal infrastructure, years in the making, and not available to the broader developer ecosystem. Oasis 3 is positioning itself as the democratized alternative.

The broader context: **NVIDIA's 2024 GTC keynote** introduced NVIDIA COSMOS, a world foundation model designed for physical AI and robotics simulation. Jensen Huang described it as "a world model for the physical world" — language almost identical to Decart's Oasis framing. This is not coincidence. The industry has converged on world models as the next infrastructure primitive, and the race is now about who builds the most developer-accessible API layer on top.

What makes Oasis 3 interesting relative to NVIDIA's approach is accessibility. NVIDIA's simulation stack is tightly coupled to their hardware ecosystem. Decart's API-first model means a dev team running on AWS with a standard HTTP client can integrate photorealistic driving simulation in an afternoon, without buying a DGX cluster.

The caveats are real but manageable. World drift is a known autoregressive limitation, and the field has mitigation patterns (context checkpointing, hierarchical scene graphs, retrieval-augmented scene state). What matters more for developer adoption is latency, cost-per-frame, and output schema stability — the same variables that determine whether any AI API becomes production infrastructure or stays a demo.

For teams building perception model test harnesses, the evaluation question is binary: can this generate enough **diverse, labeled, physically plausible** edge cases to improve your model's robustness metrics? If yes — and Oasis 3 appears to clear that bar for highway and urban driving domains — the world drift caveat is an engineering problem, not a dealbreaker. Run sessions in 20-minute windows, checkpoint state, and you have a genuinely powerful synthetic data layer.

---

## Key takeaways

- Oasis 3 launched June 10, 2026, with real-time photorealistic driving simulation via public API.
- World drift above ~30 minutes is documented; session chunking at 20-minute intervals mitigates it.
- NVIDIA COSMOS and Oasis 3 both target world-model infrastructure — the API-accessibility gap favors Decart for smaller teams.
- Waymo's Carcraft logged 15 billion simulated miles — Oasis 3 opens similar capability to non-Waymo dev teams.
- FlipFactory's `transform` and `flipaudit` MCPs map directly to Oasis 3's frame-metadata output schema.

---

## FAQ

**Q: Can Oasis 3 replace real-world test drives for AV development?**

Not fully — yet. Oasis 3 excels at generating diverse edge-case scenarios cheaply and fast, but its physics consistency degrades in extended sessions. Decart's own documentation flags "world drift" as a known caveat. Most serious AV teams will use it to augment, not replace, real-world data collection. Think of it as synthetic pre-training fuel, not a full test oracle.

**Q: How do I integrate Oasis 3 into an existing AI dev pipeline?**

Decart exposes Oasis 3 via a REST API returning video stream tokens. You can pipe those frames into a vision model (e.g., Claude's vision endpoint or a local YOLO variant) for perception testing. In our `scraper` and `transform` MCP servers at FlipFactory, we process video frame metadata in JSON batches — a similar pattern applies here: POST scene params, stream response, parse frames, log anomalies via a QA hook.

**Q: What's the realistic cost to run Oasis 3 at scale?**

Decart hasn't published a full pricing table as of June 11, 2026. Based on comparable video generation APIs (Runway ML Gen-3 runs ~$0.05 per second of video), a 20-minute simulation session could run $60–$90 per run before any compute markup. For teams running hundreds of test scenarios daily, this makes cost modeling critical before committing to the API. We'd recommend capping initial exploration at 50 sessions and measuring token throughput before scaling.

---

## About the author

**Sergii Muliarchuk** — founder of FlipFactory. Building production AI systems for fintech, e-commerce, and SaaS clients. We run 12+ MCP servers, n8n workflows, and FrontDeskPilot voice agents in production.

We've integrated Claude Sonnet 3.7 and Opus 4 across real client pipelines — so when a new simulation API drops, we're evaluating it against actual infrastructure constraints, not theoretical use cases.

---

*Further reading: [FlipFactory — production AI systems for developers](https://flipfactory.it.com)*

Frequently Asked Questions

Can Oasis 3 replace real-world test drives for AV development?

Not fully — yet. Oasis 3 excels at generating diverse edge-case scenarios cheaply and fast, but its physics consistency degrades in extended sessions. Decart's own documentation flags 'world drift' as a known caveat. Most serious AV teams will use it to augment, not replace, real-world data collection. Think of it as synthetic pre-training fuel, not a full test oracle.

How do I integrate Oasis 3 into an existing AI dev pipeline?

Decart exposes Oasis 3 via a REST API returning video stream tokens. You can pipe those frames into a vision model (e.g., Claude's vision endpoint or a local YOLO variant) for perception testing. In our scraper and transform MCP servers at FlipFactory, we process video frame metadata in JSON batches — a similar pattern applies here: POST scene params, stream response, parse frames, log anomalies.

Related Articles