Navigate to a specified URL in the browser
Click on an element in the webpage
Fill a form field with text
Take a screenshot of the current page or a specific element
Extract structured data from the webpage
Execute custom JavaScript in the browser context
Perform an action described in natural language (Stagehand MCP)
Browserbase provides powerful browser automation capabilities through the Model Context Protocol (MCP). It combines Browserbase's cloud browser infrastructure with Stagehand's intelligent browser automation to enable LLMs to interact with web pages, take screenshots, extract data, and execute JavaScript in a controlled environment. The server offers two complementary approaches: the Browserbase MCP for direct browser control with features like navigation, data extraction, and console monitoring, and the Stagehand MCP for natural language-based browser automation that supports multiple AI models including GPT-4 and Claude-3.7 Sonnet.
Browserbase MCP Server enables AI models to control web browsers through the Model Context Protocol. This integration allows LLMs to perform complex web automation tasks including navigation, data extraction, form filling, and capturing screenshots.
The repository provides two distinct but complementary approaches:
You can install the Browserbase MCP Server using the following configuration:
git clone https://github.com/browserbase/mcp-server-browserbase.git
cd mcp-server-browserbase
Choose which MCP you want to use:
cd browserbase
cd stagehand
Install dependencies and start the server:
npm install
npm start
The Browserbase MCP provides direct control over browser automation with features like:
To use Browserbase MCP, you'll need to:
Stagehand MCP offers a more natural language approach to browser automation:
act("click the login button")
or extract("find the red shoes")
To use Stagehand MCP:
You can also install Browserbase MCP through Smithery, which provides a simplified setup process.
For support and community discussions: