> ## Documentation Index > Fetch the complete documentation index at: https://docs.stagehand.dev/llms.txt > Use this file to discover all available pages before exploring further. # agent() > Complete API reference for the agent() method export const V3Banner = () => null; See how to use agent() to create autonomous AI agents for multi-step browser workflows ### Agent Creation ```typescript theme={null} // Create agent instance const agent = stagehand.agent(config?: AgentConfig): AgentInstance ``` **AgentConfig Interface:** ```typescript theme={null} interface AgentConfig { systemPrompt?: string; integrations?: (Client | string)[]; tools?: ToolSet; /** @deprecated Use `mode: "cua"` instead */ cua?: boolean; model?: string | AgentModelConfig; executionModel?: string | AgentModelConfig; stream?: boolean; // Enable streaming mode (experimental) mode?: "dom" | "hybrid" | "cua"; // Tool mode } // AgentModelConfig for advanced configuration type AgentModelConfig = { modelName: TModelName; } & Record; ``` **AgentInstance Interface:** ```typescript theme={null} interface AgentInstance { execute: (instructionOrOptions: string | AgentExecuteOptions) => Promise; } ``` ### Agent Configuration Custom system prompt to provide to the agent. Overrides the default system prompt and defines agent behavior. The model to use for agent functionality. Can be either: * A string in the format `"provider/model"` (e.g., `"openai/computer-use-preview"`, `"anthropic/claude-sonnet-4-6"`) * An object with `modelName` and additional provider-specific options **Available CUA Models:** * `"anthropic/claude-haiku-4-5-20251001"` * `"anthropic/claude-sonnet-4-6"` * `"anthropic/claude-sonnet-4-5-20250929"` * `"anthropic/claude-opus-4-5-20251101"` * `"anthropic/claude-opus-4-6"` * `"anthropic/claude-opus-4-8"` * `"google/gemini-2.5-computer-use-preview-10-2025"` * `"google/gemini-3-flash-preview"` * `"google/gemini-3-pro-preview"` * `"microsoft/fara-7b"` * `"openai/computer-use-preview"` * `"openai/computer-use-preview-2025-03-11"` The model name Additional provider-specific options (e.g., `apiKey`, `baseURL`) The model to use for tool execution (observe/act calls within agent tools). If not specified, inherits from the main model configuration. **Format:** `"provider/model"` (e.g., `"openai/gpt-4o-mini"`, `"google/gemini-2.0-flash-exp"`) **Deprecated:** Use `mode: "cua"` instead. This option will be removed in a future version. Indicates whether Computer Use Agent (CUA) mode is enabled. When false, the agent uses standard tool-based operation instead of computer control. MCP (Model Context Protocol) integrations for external tools and services. **Array of:** MCP server URLs (strings) or connected Client objects Custom tool definitions to extend agent capabilities using the AI SDK ToolSet format. Enable streaming mode for the agent. When `true`, `execute()` returns `AgentStreamResult` with `textStream` for incremental output. When `false` (default), `execute()` returns `AgentResult` after completion. **Default:** `false` **Non-CUA agents only.** Requires `experimental: true`. Not available when `mode: "cua"`. Tool mode for the agent. Determines which set of tools are available to the agent. **Modes:** * `"dom"`: Uses DOM based tools (`act`, `fillForm`) for structured page interactions. Works with any model. * `"hybrid"`: Uses both DOM-based and coordinate-based tools (`act`, `click`, `type`, `dragAndDrop`, `clickAndHold`, `fillForm`) for visual/screenshot-based interactions. Requires models with reliable coordinate-based action capabilities. * `"cua"`: Uses Computer Use Agent (CUA) providers like Anthropic Claude, Google Gemini, or OpenAI for screenshot-based automation. This is the preferred way to enable CUA mode (replaces the deprecated `cua: true` option). **Default:** When unspecified, the mode is auto-resolved based on the model — `"hybrid"` for hybrid-capable models (Claude, GPT-5.4, Gemini 3) and `"dom"` for everything else. **Hybrid Mode Model Requirements:** Only use hybrid mode with models that can reliably perform coordinate-based actions: * **Anthropic:** any Claude model (e.g. `anthropic/claude-sonnet-4-20250514`, `anthropic/claude-sonnet-4-5-20250929`, `anthropic/claude-haiku-4-5-20251001`) * **OpenAI:** `openai/gpt-5.4`, `openai/gpt-5.4-mini` * **Google:** `google/gemini-3-flash-preview`, `google/gemini-3.1-flash-lite-preview`, `google/gemini-3.1-pro-preview` Requires `experimental: true` in Stagehand constructor. ### Execute Method ```typescript theme={null} // String instruction await agent.execute(instruction: string): Promise // With options await agent.execute(options: AgentExecuteOptions): Promise ``` **AgentExecuteOptions Interface:** ```typescript theme={null} interface AgentExecuteOptions { instruction: string; maxSteps?: number; page?: PlaywrightPage | PuppeteerPage | PatchrightPage | Page; highlightCursor?: boolean; messages?: ModelMessage[]; // Continue from previous conversation (experimental) signal?: AbortSignal; // Cancel execution (experimental) excludeTools?: string[]; // Tools to exclude from this execution (experimental) output?: ZodObject; // Zod schema for structured output (experimental) callbacks?: AgentExecuteCallbacks; } interface AgentExecuteCallbacks { prepareStep?: PrepareStepFunction; onStepFinish?: GenerateTextOnStepFinishCallback; } ``` ```typescript theme={null} // With stream: true in AgentConfig await agent.execute(options: AgentStreamExecuteOptions): Promise ``` **AgentStreamExecuteOptions Interface:** ```typescript theme={null} interface AgentStreamExecuteOptions { instruction: string; maxSteps?: number; page?: PlaywrightPage | PuppeteerPage | PatchrightPage | Page; highlightCursor?: boolean; messages?: ModelMessage[]; // Continue from previous conversation (experimental) signal?: AbortSignal; // Cancel execution (experimental) excludeTools?: string[]; // Tools to exclude from this execution (experimental) output?: ZodObject; // Zod schema for structured output (experimental) callbacks?: AgentStreamCallbacks; } interface AgentStreamCallbacks { prepareStep?: PrepareStepFunction; onStepFinish?: StreamTextOnStepFinishCallback; onChunk?: StreamTextOnChunkCallback; onFinish?: StreamTextOnFinishCallback; onError?: StreamTextOnErrorCallback; onAbort?: (event: { steps: Array> }) => void | Promise; } ``` ### Execute Parameters High-level task description in natural language. Maximum number of actions the agent can take before stopping. **Default:** `20` Optional: Specify which page to perform the agent execution on. Supports multiple browser automation libraries: * **Playwright**: Native Playwright Page objects * **Puppeteer**: Puppeteer Page objects * **Patchright**: Patchright Page objects * **Stagehand Page**: Stagehand's wrapped Page object If not specified, defaults to the current "active" page in your Stagehand instance. Whether to show a visual cursor on the page during agent execution. Useful for debugging and demonstrations. **Default:** `false` Previous conversation messages to continue from. Pass the `messages` from a previous `AgentResult` to continue that conversation. **Non-CUA agents only.** Requires `experimental: true`. Not available when `mode: "cua"`. An `AbortSignal` that can be used to cancel the agent execution. When aborted, the agent will stop and throw an `AgentAbortError`. **Non-CUA agents only.** Requires `experimental: true`. Not available when `mode: "cua"`. Tools to exclude from this execution. Pass an array of tool names to prevent the agent from using those tools. **Available tools by mode:** **DOM mode:** `act`, `fillForm`, `ariaTree`, `extract`, `goto`, `scroll`, `keys`, `navback`, `screenshot`, `think`, `wait`, `search` **Hybrid mode:** `click`, `type`, `dragAndDrop`, `clickAndHold`, `fillFormVision`, `act`, `ariaTree`, `extract`, `goto`, `scroll`, `keys`, `navback`, `screenshot`, `think`, `wait`, `search` **Non-CUA agents only.** Requires `experimental: true`. Not available when `cua: true`. A Zod schema defining structured output data to return when the task completes. The agent will populate this data based on the information it gathered during execution. The result will be available in `AgentResult.output`. **Non-CUA agents only.** Requires `experimental: true`. Not available when `mode: "cua"`. ```typescript theme={null} import { z } from "zod"; const result = await agent.execute({ instruction: "Find the cheapest flight from NYC to LA", output: z.object({ price: z.string().describe("The price of the flight"), airline: z.string().describe("The airline name"), departureTime: z.string().describe("Departure time"), }), }); console.log(result.output); // { price: "$199", airline: "Delta", departureTime: "8:00 AM" } ``` Callbacks to hook into the agent's execution lifecycle. The available callbacks depend on whether streaming is enabled. **Non-CUA agents only.** Requires `experimental: true`. Not available when `mode: "cua"`. Called before each step to modify settings. You can change the model, tool choices, active tools, system prompt, and input messages for each step. Called when each step (LLM call) completes. Provides access to tool calls, reasoning, and step results. Called before each step to modify settings. Called when each step completes during streaming. Called for each chunk of the stream. Stream processing will pause until the callback promise resolves. Called when the stream finishes successfully. Called when an error occurs during streaming. Called when the stream is aborted via the `signal` option. ### Response **Returns:** `Promise` (non-streaming) or `Promise` (streaming) **AgentResult Interface:** ```typescript theme={null} interface AgentResult { success: boolean; message: string; actions: AgentAction[]; completed: boolean; metadata?: Record; messages?: ModelMessage[]; // Conversation history for continuation (experimental) output?: Record; // Structured output data (experimental) usage?: { input_tokens: number; output_tokens: number; reasoning_tokens?: number; cached_input_tokens?: number; inference_time_ms: number; }; } // AgentAction can contain various tool-specific fields interface AgentAction { type: string; reasoning?: string; taskCompleted?: boolean; action?: string; timeMs?: number; // wait tool pageText?: string; // ariaTree tool pageUrl?: string; // ariaTree tool instruction?: string; // various tools timestamp?: number; // Action timestamp [key: string]: unknown; // Additional tool-specific fields } ``` **AgentStreamResult Interface:** ```typescript theme={null} interface AgentStreamResult { // Async iterable of text chunks for incremental output textStream: AsyncIterable; // Async iterable of all stream events (tool calls, messages, etc.) fullStream: AsyncIterable; // Promise that resolves to the final AgentResult when streaming completes result: Promise; // Additional properties from StreamTextResult // See Vercel AI SDK documentation for full details } ``` Whether the task was completed successfully. Description of the execution result and status. Array of individual actions taken during execution. Each action contains tool-specific data. Whether the agent believes the task is fully complete. Additional execution metadata and debugging information. The conversation messages from this execution. Pass these to a subsequent `execute()` call via the `messages` option to continue the conversation. **Non-CUA agents only.** Requires `experimental: true`. Custom structured output data extracted based on the `output` Zod schema provided in execute options. Only populated if an `output` schema was provided. **Non-CUA agents only.** Requires `experimental: true`. Token usage and performance metrics. Number of input tokens used Number of output tokens generated Number of reasoning tokens (if supported by the model) Number of cached input tokens (if supported by the model) Total inference time in milliseconds ### Example Response ```json theme={null} { "success": true, "message": "Task completed successfully", "actions": [ { "type": "act", "instruction": "click the submit button", "reasoning": "User requested to submit the form", "taskCompleted": false }, { "type": "observe", "instruction": "check if submission was successful", "taskCompleted": true } ], "completed": true, "metadata": { "steps_taken": 2 }, "output": { "price": "$199", "airline": "Delta", "departureTime": "8:00 AM" }, "usage": { "input_tokens": 1250, "output_tokens": 340, "reasoning_tokens": 42, "cached_input_tokens": 0, "inference_time_ms": 2500 } } ``` ### Code Examples ```typescript theme={null} import { Stagehand } from "@browserbasehq/stagehand"; // Initialize with Browserbase (API key from environment variable) // Set BROWSERBASE_API_KEY in your environment const stagehand = new Stagehand({ env: "BROWSERBASE", model: "anthropic/claude-sonnet-4-6" }); await stagehand.init(); const page = stagehand.context.pages()[0]; // Create agent with default configuration const agent = stagehand.agent(); // Navigate to a page await page.goto("https://www.google.com"); // Execute a task const result = await agent.execute("Search for 'Stagehand automation' and click the first result"); console.log(result.message); console.log(`Completed: ${result.completed}`); console.log(`Actions taken: ${result.actions.length}`); ``` ```typescript theme={null} // Create agent with custom model and system prompt const agent = stagehand.agent({ model: "openai/computer-use-preview", systemPrompt: "You are a helpful assistant that can navigate websites efficiently. Always verify actions before proceeding.", executionModel: "openai/gpt-4o-mini" // Use faster model for tool execution }); const page = stagehand.context.pages()[0]; await page.goto("https://example.com"); const result = await agent.execute({ instruction: "Fill out the contact form with test data", maxSteps: 10, highlightCursor: true }); ``` ```typescript theme={null} // Using AgentModelConfig for advanced configuration const agent = stagehand.agent({ model: { modelName: "anthropic/claude-sonnet-4-6", apiKey: process.env.ANTHROPIC_API_KEY, baseURL: "https://custom-proxy.com/v1" } }); const result = await agent.execute("Complete the checkout process"); ``` ```typescript theme={null} const page1 = stagehand.context.pages()[0]; const page2 = await stagehand.context.newPage(); const agent = stagehand.agent(); // Execute on specific page await page2.goto("https://example.com/dashboard"); const result = await agent.execute({ instruction: "Export the data table", page: page2 }); ``` ```typescript theme={null} const stagehand = new Stagehand({ env: "LOCAL", experimental: true, // Required for streaming }); await stagehand.init(); const page = stagehand.context.pages()[0]; await page.goto("https://amazon.com"); // Create a streaming agent const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-5-20250929", stream: true, }); const streamResult = await agent.execute({ instruction: "Search for headphones and find the best deal", maxSteps: 20, }); // Stream text output incrementally for await (const delta of streamResult.textStream) { process.stdout.write(delta); } // Get the final result const finalResult = await streamResult.result; console.log("Completed:", finalResult.completed); ``` ```typescript theme={null} const stagehand = new Stagehand({ env: "LOCAL", experimental: true, }); await stagehand.init(); const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-5-20250929", }); const page = stagehand.context.pages()[0]; await page.goto("https://example.com"); const result = await agent.execute({ instruction: "Fill out the contact form", maxSteps: 10, callbacks: { prepareStep: async (stepContext) => { console.log(`Starting step ${stepContext.stepNumber}`); return stepContext; }, onStepFinish: async (event) => { console.log(`Step finished: ${event.finishReason}`); if (event.toolCalls) { for (const tc of event.toolCalls) { console.log(`Tool: ${tc.toolName}`, tc.input); } } }, }, }); ``` ```typescript theme={null} const stagehand = new Stagehand({ env: "LOCAL", experimental: true, }); await stagehand.init(); const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-5-20250929", }); const page = stagehand.context.pages()[0]; await page.goto("https://example.com"); const controller = new AbortController(); // Abort after 30 seconds setTimeout(() => controller.abort("Timeout exceeded"), 30000); try { const result = await agent.execute({ instruction: "Complete a complex multi-step workflow", maxSteps: 50, signal: controller.signal, }); } catch (error) { if (error.name === "AgentAbortError") { console.log("Task cancelled:", error.message); } } ``` ```typescript theme={null} const stagehand = new Stagehand({ env: "LOCAL", experimental: true, }); await stagehand.init(); const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-5-20250929", }); const page = stagehand.context.pages()[0]; await page.goto("https://example.com/shop"); // First execution const firstResult = await agent.execute({ instruction: "Search for laptops and list the top 3 options", maxSteps: 10, }); // Continue conversation with context from first run const secondResult = await agent.execute({ instruction: "Filter those results by price under $1000", maxSteps: 10, messages: firstResult.messages, // Pass previous messages }); // Chain further with accumulated context const thirdResult = await agent.execute({ instruction: "Add the best-rated one to cart", maxSteps: 10, messages: secondResult.messages, }); console.log("Final:", thirdResult.message); ``` ```typescript theme={null} import { Client } from "@modelcontextprotocol/sdk/client/index.js"; // Create agent with MCP integrations const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-6", integrations: [ "https://mcp-server.example.com", // MCP server URL mcpClientInstance // Or pre-connected Client object ] }); const result = await agent.execute("Use the external tool to process this data"); ``` ```typescript theme={null} const stagehand = new Stagehand({ env: "LOCAL", experimental: true, // Required for excludeTools }); await stagehand.init(); const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-5-20250929", }); const page = stagehand.context.pages()[0]; await page.goto("https://example.com"); // Exclude specific tools from this execution const result = await agent.execute({ instruction: "Navigate the page and click buttons", maxSteps: 15, excludeTools: ["screenshot", "extract", "search"], }); console.log("Completed:", result.completed); ``` ```typescript theme={null} const stagehand = new Stagehand({ env: "LOCAL", experimental: true, // Required for hybrid mode }); await stagehand.init(); // Create agent with hybrid mode for coordinate-based interactions const agent = stagehand.agent({ mode: "hybrid", model: "google/gemini-3-flash-preview", // Use a model that supports coordinate-based actions }); const page = stagehand.context.pages()[0]; await page.goto("https://example.com/form"); const result = await agent.execute({ instruction: "Fill out the registration form with test data", maxSteps: 15, highlightCursor: true, // Enabled by default in hybrid mode }); console.log("Completed:", result.completed); ``` ```typescript theme={null} import { z } from "zod"; const stagehand = new Stagehand({ env: "LOCAL", experimental: true, // Required for structured output }); await stagehand.init(); const agent = stagehand.agent({ model: "anthropic/claude-sonnet-4-5-20250929", }); const page = stagehand.context.pages()[0]; await page.goto("https://www.google.com/flights"); // Define output schema to receive structured data const result = await agent.execute({ instruction: "Find the cheapest flight from NYC to LA for next week", maxSteps: 20, output: z.object({ price: z.string().describe("The price of the flight"), airline: z.string().describe("The airline name"), departureTime: z.string().describe("Departure time"), arrivalTime: z.string().describe("Arrival time"), flightNumber: z.string().optional().describe("Flight number if available"), }), }); // Access the structured output console.log("Flight found:"); console.log(` Price: ${result.output?.price}`); console.log(` Airline: ${result.output?.airline}`); console.log(` Departure: ${result.output?.departureTime}`); console.log(` Arrival: ${result.output?.arrivalTime}`); ``` ```typescript theme={null} import { tool } from "@browserbasehq/stagehand"; import { z } from "zod"; // Define custom tools using the tool helper from @browserbasehq/stagehand const customTools = { calculateTotal: tool({ description: "Calculate the total of items in cart", parameters: z.object({ items: z.array(z.object({ price: z.number(), quantity: z.number() })) }), execute: async ({ items }) => { const total = items.reduce((sum, item) => sum + (item.price * item.quantity), 0); return { total }; } }) }; const agent = stagehand.agent({ model: "openai/computer-use-preview", tools: customTools }); const result = await agent.execute("Calculate the total cost of items in the shopping cart"); ``` ### Error Types The following errors may be thrown by the `agent()` method: * **StagehandError** - Base class for all Stagehand-specific errors * **StagehandInitError** - Agent was not properly initialized * **MissingLLMConfigurationError** - No LLM API key or client configured * **UnsupportedModelError** - The specified model is not supported for agent functionality * **UnsupportedModelProviderError** - The specified model provider is not supported * **InvalidAISDKModelFormatError** - Model string does not follow the required `provider/model` format * **MCPConnectionError** - Failed to connect to MCP server * **StagehandDefaultError** - General execution error with detailed message * **AgentAbortError** - Thrown when agent execution is cancelled via an `AbortSignal` * **StreamingCallbacksInNonStreamingModeError** - Thrown when streaming-only callbacks (`onChunk`, `onFinish`, `onError`, `onAbort`) are used without `stream: true` * **ExperimentalNotConfiguredError** - Thrown when experimental features (callbacks, signal, messages, streaming) are used without `experimental: true` in Stagehand constructor