Skip to main content

Overview

The Browserbase MCP server provides comprehensive tools for browser automation and session management. These tools allow you to perform actions like navigating pages, capturing screenshots, manipulating cookies, and managing multiple browser sessions simultaneously.

Core Browser Automation Tools

These are the primary tools for modern web automation using natural language commands.
Navigate to any URL in the browser
url
string
required
The URL to navigate to
Perform an action on the web page using natural language
action
string
required
The action to perform (e.g., “click the login button”, “fill form field”)
Extract all text content from the current page (filters out CSS and JavaScript)
No input parameters required
instruction
string
Extracted text content from the current page
Observe and find actionable elements on the web page
instruction
string
required
Specific instruction for observation (e.g., “find the login button”, “locate search form”)
Capture a PNG screenshot of the current page
No input parameters required
image
string
Base-64 encoded PNG data
Get the current URL of the browser page
No input parameters required
url
string
Complete URL including protocol, domain, path, and any query parameters or fragments

Session Management

Manage your browser session lifecycle with create and close operations.
Create or reuse a cloud browser session using Browserbase with fully initialized Stagehand
sessionId
string
Optional session ID to use/reuse. If not provided, creates new session
Close the current Browserbase session, disconnect the browser, and cleanup Stagehand instance
No input parameters required

Resources

Screenshot Resources

The server provides access to screenshot resources with URI-based access.example:
screenshot://screenshot-name-of-the-screenshot

Further Reading

I