Skip to content
Dashboard

The AI Cloud: A unified platform for AI workloads

Product

The same principles and ease of use you expect from Vercel, now for your agentic applications.

The AI Cloud builds on the same foundation as the Frontend Cloud, extending its capabilities to support agentic workloads.The AI Cloud builds on the same foundation as the Frontend Cloud, extending its capabilities to support agentic workloads.The AI Cloud builds on the same foundation as the Frontend Cloud, extending its capabilities to support agentic workloads.The AI Cloud builds on the same foundation as the Frontend Cloud, extending its capabilities to support agentic workloads.
The AI Cloud builds on the same foundation as the Frontend Cloud, extending its capabilities to support agentic workloads.

See what the AI Cloud can do

Hear Guillermo Rauch introduce the AI Cloud at Vercel Ship 2025.

Watch the Keynote

Link to headingA unified, self-driving platform

app/api/flights/route.ts
import { streamText, StreamingTextResponse, tool } from 'ai';
import { z } from 'zod';
export async function POST(req: Request) {
const { prompt } = await req.json();
const result = await streamText({
model: 'openai/gpt-4o', // This will access the model via AI Gateway
prompt,
tools: {
weather: tool({
description: 'Get the weather in a location',
parameters: z.object({
location: z.string()
}),
execute: async ({ location }) => {
const res = await fetch(
`https://api.weatherapi.com/v1/current.json?q=${location}`
);
const data = await res.json();
return { location, weather: data };
},
}),
},
});
return new StreamingTextResponse(result);
}

A sample AI API endpoint using AI SDK and AI Gateway. Its structure resembles a traditional endpoint with an easy package to accept a prompt from the frontend and stream a response back.

Link to headingAI Cloud compute

This initial prompt execution and reasoning run on Vercel Functions backed by Fluid compute.This initial prompt execution and reasoning run on Vercel Functions backed by Fluid compute.This initial prompt execution and reasoning run on Vercel Functions backed by Fluid compute.This initial prompt execution and reasoning run on Vercel Functions backed by Fluid compute.
This initial prompt execution and reasoning run on Vercel Functions backed by Fluid compute.

Link to headingTool execution

Link to headingSimplified MCP server support

MCP Server with Next.js

Get started building your first MCP server on Vercel.

Deploy now

Link to headingOffloading tasks to the background

Tool calls can be thought of as functions that can be performed within the same function invocation as the agent itself, performed remotely on an MCP server, or reliably offloaded to a separate function with Vercel Queues.Tool calls can be thought of as functions that can be performed within the same function invocation as the agent itself, performed remotely on an MCP server, or reliably offloaded to a separate function with Vercel Queues.Tool calls can be thought of as functions that can be performed within the same function invocation as the agent itself, performed remotely on an MCP server, or reliably offloaded to a separate function with Vercel Queues.Tool calls can be thought of as functions that can be performed within the same function invocation as the agent itself, performed remotely on an MCP server, or reliably offloaded to a separate function with Vercel Queues.
Tool calls can be thought of as functions that can be performed within the same function invocation as the agent itself, performed remotely on an MCP server, or reliably offloaded to a separate function with Vercel Queues.

Link to headingSecure execution with Vercel Sandbox

const sandbox = await Sandbox.create({
source: {
url: "https://github.com/user/code-repo.git",
type: "git"
},
runtime: "node22",
timeout: ms("2m"),
});

Sandbox code is simple with an SDK that grants control over initial creation, updates, and termination.

Link to headingObservability into agentic workloads

End-to-end visibility into agent behavior, from outputs to human feedback, is key to reducing errors and improving performance and cost.End-to-end visibility into agent behavior, from outputs to human feedback, is key to reducing errors and improving performance and cost.End-to-end visibility into agent behavior, from outputs to human feedback, is key to reducing errors and improving performance and cost.End-to-end visibility into agent behavior, from outputs to human feedback, is key to reducing errors and improving performance and cost.
End-to-end visibility into agent behavior, from outputs to human feedback, is key to reducing errors and improving performance and cost.

Link to headingSecuring high-value, critical routes

Get started with Vercel BotID

Detect and stop advanced bots before they reach your most sensitive routes like login, checkout, AI agents, and APIs. Easy to implement, hard to bypass.

Get started

Link to headingThe AI Cloud, powered by Vercel

Agentic applications follow a decades-long web transformation from purely static sites, to highly dynamic, to now agentic. We're in the early stages of this new era.Agentic applications follow a decades-long web transformation from purely static sites, to highly dynamic, to now agentic. We're in the early stages of this new era.Agentic applications follow a decades-long web transformation from purely static sites, to highly dynamic, to now agentic. We're in the early stages of this new era.Agentic applications follow a decades-long web transformation from purely static sites, to highly dynamic, to now agentic. We're in the early stages of this new era.
Agentic applications follow a decades-long web transformation from purely static sites, to highly dynamic, to now agentic. We're in the early stages of this new era.

Let us know how we can help

Whether you're starting a migration, need help optimizing, or want to add AI to your apps and workflows, we're here to partner with you.

Contact Us