Synchronous Call

Synchronous AI Pattern

Standard request-response blocking HTTP requests. Simplest to integrate, providing the absolute lowest time-to-first-token.

Compare architectures

Recommended

Event-driven async

Real-time updates via WebSocket. Backend callbacks for processing. Best for interactive apps.

Learn more

Async (Dual-Delivery)

Fire and forget. Results delivered simultaneously to your webhook and browser. Best for reliable background ops.

Learn more

Sync

Simple request/response. Blocks until complete. Best for quick operations.

Learn more

Sync request flow

Simple blocking request/response

Your backend

Sync

ModelRiver

AI providers

Architecture

Why sync requests?

Direct linear flows for the absolute fastest time-to-first-token. Eliminate all external state while remaining firmly anchored in our automated failover routing.

Fast Acknowledgment

Immediate response with a secure channel ID. No open connection holding, fundamentally eliminating gateway timeouts on slow generations.

Background Execution

AI runs asynchronously in isolated worker pools. Your backend remains lightning fast and completely insulated from thread exhaustion.

Automatic Fallback

Seamless multi-provider failover baked deeply into the pipeline. Achieve five-nines orchestration without altering any of your client architecture.

Instant Delivery

Real-time WebSocket conduits autonomously push the final streaming payloads directly back to the active client layer.

Ready to start?

Start building View documentation

AI infrastructure

Platform performance

Cost & insights

Developer tools

Data intelligence

Developer ecosystem

Framework integrations

Client SDKs

Command line

System status

Learning & support

Platform guides

Technical references

Operations & DevOps

Help & community

Synchronous AI Pattern

Event-driven async

Async (Dual-Delivery)

Sync

Sync request flow

Why sync requests?

Fast Acknowledgment

Background Execution

Automatic Fallback

Instant Delivery

Ready to start?