Hyperbrowser MCP
H

hyperbrowserai

Enables web browsing capabilities through tools for content extraction, link following, and browser automation with customizable parameters for scraping, data collection, and web crawling tasks.

Content

Tools

Hyperbrowser MCP Server

smithery badge

Frame 5

This is Hyperbrowser's Model Context Protocol (MCP) Server. It provides various tools to scrape, extract structured data, and crawl webpages. It also provides easy access to general purpose browser agents like OpenAI's CUA, Anthropic's Claude Computer Use, and Browser Use.

More information about the Hyperbrowser can be found here. The hyperbrowser API supports a superset of features present in the mcp server.

More information about the Model Context Protocol can be found here.

Table of Contents

Installation

Manual Installation

To install the server, run:

npx hyperbrowser-mcp <YOUR-HYPERBROWSER-API-KEY>

Running on Cursor

Add to ~/.cursor/mcp.json like this:

{
  "mcpServers": {
    "hyperbrowser": {
      "command": "npx",
      "args": ["-y", "hyperbrowser-mcp"],
      "env": {
        "HYPERBROWSER_API_KEY": "YOUR-API-KEY"
      }
    }
  }
}

Running on Windsurf

Add to your ./codeium/windsurf/model_config.json like this:

{
  "mcpServers": {
    "hyperbrowser": {
      "command": "npx",
      "args": ["-y", "hyperbrowser-mcp"],
      "env": {
        "HYPERBROWSER_API_KEY": "YOUR-API-KEY"
      }
    }
  }
}

Development

For development purposes, you can run the server directly from the source code.

  1. Clone the repository:

    git clone git@github.com:hyperbrowserai/mcp.git hyperbrowser-mcp
    cd hyperbrowser-mcp
    
  2. Install dependencies:

    npm install # or yarn install
    npm run build
    
  3. Run the server:

    node dist/server.js
    

Claude Desktop app

This is an example config for the Hyperbrowser MCP server for the Claude Desktop client.

{
  "mcpServers": {
    "hyperbrowser": {
      "command": "npx",
      "args": ["--yes", "hyperbrowser-mcp"],
      "env": {
        "HYPERBROWSER_API_KEY": "your-api-key"
      }
    }
  }
}

Tools

  • scrape_webpage - Extract formatted (markdown, screenshot etc) content from any webpage
  • crawl_webpages - Navigate through multiple linked pages and extract LLM-friendly formatted content
  • extract_structured_data - Convert messy HTML into structured JSON
  • search_with_bing - Query the web and get results with Bing search
  • browser_use_agent - Fast, lightweight browser automation with the Browser Use agent
  • openai_computer_use_agent - General-purpose automation using OpenAI’s CUA model
  • claude_computer_use_agent - Complex browser tasks using Claude computer use
  • create_profile - Creates a new persistent Hyperbrowser profile.
  • delete_profile - Deletes an existing persistent Hyperbrowser profile.
  • list_profiles - Lists existing persistent Hyperbrowser profiles.

Installing via Smithery

To install Hyperbrowser MCP Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @hyperbrowserai/mcp --client claude

Resources

The server provides the documentation about hyperbrowser through the resources methods. Any client which can do discovery over resources has access to it.

License

This project is licensed under the MIT License.

scrape_webpage

Scrape a webpage and extract its content in various formats. This tool allows fetching content from a single URL with configurable browser behavior options. Use this for extracting text content, HTML structure, collecting links, or capturing screenshots of webpages.

crawl_webpages

Crawl a website starting from a URL and explore linked pages. This tool allows systematic collection of content from multiple pages within a domain. Use this for larger data collection tasks, content indexing, or site mapping.

extract_structured_data

Extract structured data from a webpage. This tool allows you to extract structured data from a webpage using a schema.

browser_use_agent

This tool employs an open-source browser automation agent optimized specifically for fast, efficient, and cost-effective browser tasks using a cloud browser. It requires explicit, detailed instructions to perform highly specific interactions quickly. Optimal for tasks requiring: - Precise, explicitly defined interactions and actions - Speed and efficiency with clear, unambiguous instructions - Cost-effective automation at scale with straightforward workflows Best suited use cases include: - Explicitly defined registration and login processes - Clearly guided navigation through web apps - Structured, step-by-step web scraping with detailed guidance - Extracting data via explicitly specified browser interactions You must provide extremely detailed step-by-step instructions, including exact elements, actions, and explicit context. Clearly define the desired outcome for optimal results. Returns the completed result or an error message if issues arise. Note: This agent trades off flexibility for significantly faster performance and lower costs compared to Claude and OpenAI agents.

openai_computer_use_agent

This tool utilizes OpenAI's model to autonomously execute general-purpose browser-based tasks with balanced performance and reliability using a cloud browser. It handles complex interactions effectively with practical reasoning and clear execution. Optimal for tasks requiring: - Reliable, general-purpose browser automation - Clear, structured interactions with moderate complexity - Efficient handling of common web tasks and workflows Best suited use cases include: - Standard multi-step registration or form submissions - Navigating typical web applications requiring multiple interactions - Conducting structured web research tasks - Extracting data through interactive web processes Provide a clear step-by-step description, necessary context, and expected outcomes. Returns the completed result or an error message if issues arise.

claude_computer_use_agent

This tool leverages Anthropic's Claude model to autonomously execute complex browser tasks with sophisticated reasoning capabilities using a cloud browser. It specializes in handling intricate, nuanced, or highly context-sensitive web interactions. Optimal for tasks requiring: - Complex reasoning over multiple web pages - Nuanced interpretation and flexible decision-making - Human-like interaction with detailed context awareness Best suited use cases include: - Multi-step processes requiring reasoning (e.g., detailed registrations or onboarding) - Interacting intelligently with advanced web apps - Conducting in-depth research with complex conditions - Extracting information from dynamic or interactive websites Provide detailed task instructions, relevant context, and clearly specify the desired outcome for best results. Returns the completed result or an error message if issues arise.

search_with_bing

Search the web using Bing. This tool allows you to search the web using bing.com

create_profile

Creates a new persistent Hyperbrowser profile.

delete_profile

Deletes an existing persistent Hyperbrowser profile.

list_profiles

Lists existing persistent Hyperbrowser profiles, with optional pagination.

Connect MCP Server