Add Firecrawl to third-party tools (#839)

koverholt · joefernandez · web-flow · commit 9b457dc319b9 · 2025-10-31T09:14:01.000-07:00
* Add Firecrawl tools page

* Add abolute path

* Python code samples

* Update icon and cards

* Update Firecrawl page content

* Update nav

* Fix image link

* Other cleanup

---------

Co-authored-by: Joe Fernandez &lt;joefernandez@users.noreply.github.com&gt;
diff --git a/docs/assets/tools-firecrawl.png b/docs/assets/tools-firecrawl.png
diff --git a/docs/tools/index.md b/docs/tools/index.md
@@ -133,6 +133,16 @@ Check out the following pre-built tools that you can use with ADK agents:
 
 <div class="tool-card-grid">
 
+  <a href="/adk-docs/tools/third-party/firecrawl/" class="tool-card">
+    <div class="tool-card-image-wrapper">
+      <img src="../assets/tools-firecrawl.png" alt="Firecrawl">
+    </div>
+    <div class="tool-card-content">
+      <h3>Firecrawl</h4>
+      <p>Empower your AI apps with clean data from any website</p>
+    </div>
+  </a>
+
   <a href="/adk-docs/tools/third-party/github/" class="tool-card">
     <div class="tool-card-image-wrapper">
       <img src="../assets/tools-github.png" alt="GitHub">
diff --git a/docs/tools/third-party/firecrawl.md b/docs/tools/third-party/firecrawl.md
@@ -0,0 +1,138 @@
+# Firecrawl
+
+The [Firecrawl MCP Server](https://github.com/firecrawl/firecrawl-mcp-server)
+connects your ADK agent to the [Firecrawl](https://www.firecrawl.dev/) API, a
+service that can crawl any website and convert its content into clean,
+structured markdown. This allows your agent to ingest, search, and reason over
+web data from any URL, including all its subpages.
+
+## Features
+
+- **Agent-based Web Research**: Deploy an agent that can take a topic, use the
+  search tool to find relevant URLs, and then use the scrape tool to extract the
+  full content of each page for analysis or summarization.
+
+- **Structured Data Extraction**: Use the extract tool to pull specific,
+  structured information (like product names, prices, or contact info) from a
+  list of URLs, powered by LLM extraction.
+
+- **Large-Scale Content Ingestion**: Automate the scraping of entire websites or
+  large batches of URLs using the batch scrape and crawl tools. This is ideal
+  for populating a vector database for a RAG (Retrieval-Augmented Generation)
+  pipeline.
+
+## Prerequisites
+
+- [Sign up on Firecrawl](https://www.firecrawl.dev/signin) and [get an API key](https://firecrawl.dev/app/api-keys)
+
+## Usage with ADK
+
+=== "Local MCP Server"
+
+    ```python
+    from google.adk.agents.llm_agent import Agent
+    from google.adk.tools.mcp_tool.mcp_session_manager import StdioConnectionParams
+    from google.adk.tools.mcp_tool.mcp_toolset import MCPToolset
+    from mcp import StdioServerParameters
+
+    FIRECRAWL_API_KEY = "YOUR_FIRECRAWL_API_KEY"
+
+    root_agent = Agent(
+        model="gemini-2.5-pro",
+        name="firecrawl_agent",
+        description="A helpful assistant for scraping websites with Firecrawl",
+        instruction="Help the user search for website content",
+        tools=[
+            MCPToolset(
+                connection_params=StdioConnectionParams(
+                    server_params = StdioServerParameters(
+                        command="npx",
+                        args=[
+                            "-y",
+                            "firecrawl-mcp",
+                        ],
+                        env={
+                            "FIRECRAWL_API_KEY": FIRECRAWL_API_KEY,
+                        }
+                    ),
+                    timeout=30,
+                ),
+            )
+        ],
+    )
+    ```
+
+=== "Remote MCP Server"
+
+    ```python
+    from google.adk.agents.llm_agent import Agent
+    from google.adk.tools.mcp_tool.mcp_session_manager import StreamableHTTPServerParams
+    from google.adk.tools.mcp_tool.mcp_toolset import MCPToolset
+
+    FIRECRAWL_API_KEY = "YOUR_FIRECRAWL_API_KEY"
+
+    root_agent = Agent(
+        model="gemini-2.5-pro",
+        name="firecrawl_agent",
+        description="A helpful assistant for scraping websites with Firecrawl",
+        instruction="Help the user search for website content",
+        tools=[
+            MCPToolset(
+                connection_params=StreamableHTTPServerParams(
+                    url=f"https://mcp.firecrawl.dev/{FIRECRAWL_API_KEY}/v2/mcp",
+                ),
+            )
+        ],
+    )
+    ```
+
+## Available tools
+
+This toolset provides a comprehensive suite of functions for web crawling,
+scraping, and searching:
+
+Tool | Name | Description
+---- | ---- | -----------
+Scrape Tool | `firecrawl_scrape` | Scrape content from a single URL with advanced options
+Batch Scrape Tool | `firecrawl_batch_scrape` | Scrape multiple URLs efficiently with built-in rate limiting and parallel processing
+Check Batch Status | `firecrawl_check_batch_status` | Check the status of a batch operation
+Map Tool | `firecrawl_map` | Map a website to discover all indexed URLs on the site
+Search Tool | `firecrawl_search` | Search the web and optionally extract content from search results
+Crawl Tool | `firecrawl_crawl` | Start an asynchronous crawl with advanced options
+Check Crawl Status | `firecrawl_check_crawl_status` | Check the status of a crawl job
+Extract Tool | `firecrawl_extract` | Extract structured information from web pages using LLM capabilities. Supports both cloud AI and self-hosted LLM extraction
+
+## Configuration
+
+The Firecrawl MCP server can be configured using environment variables:
+
+**Required**:
+
+- `FIRECRAWL_API_KEY`: Your Firecrawl API key
+    - Required when using cloud API (default)
+    - Optional when using self-hosted instance with `FIRECRAWL_API_URL`
+
+**Firecrawl API URL (optional)**:
+
+- `FIRECRAWL_API_URL` (Optional): Custom API endpoint for self-hosted instances
+    - Example: `https://firecrawl.your-domain.com`
+    - If not provided, the cloud API will be used (requires API key)
+
+**Retry configuration (optional)**:
+
+- `FIRECRAWL_RETRY_MAX_ATTEMPTS`: Maximum number of retry attempts (default: 3)
+- `FIRECRAWL_RETRY_INITIAL_DELAY`: Initial delay in milliseconds before first retry (default: 1000)
+- `FIRECRAWL_RETRY_MAX_DELAY`: Maximum delay in milliseconds between retries (default: 10000)
+- `FIRECRAWL_RETRY_BACKOFF_FACTOR`: Exponential backoff multiplier (default: 2)
+
+**Credit usage monitoring (optional)**:
+
+- `FIRECRAWL_CREDIT_WARNING_THRESHOLD`: Credit usage warning threshold (default: 1000)
+- `FIRECRAWL_CREDIT_CRITICAL_THRESHOLD`: Credit usage critical threshold (default: 100)
+
+## Additional resources
+
+- [Firecrawl MCP Server Documentation](https://docs.firecrawl.dev/mcp-server)
+- [Firecrawl MCP Server Repository](https://github.com/firecrawl/firecrawl-mcp-server)
+- [Firecrawl Use Cases](https://docs.firecrawl.dev/use-cases/overview)
+- [Firecrawl Advanced Scraping Guide](https://docs.firecrawl.dev/advanced-scraping-guide)
diff --git a/docs/tools/third-party/hugging-face.md b/docs/tools/third-party/hugging-face.md
@@ -29,6 +29,8 @@ your ADK agent to the Hugging Face Hub and thousands of Gradio AI Applications.
     from google.adk.tools.mcp_tool.mcp_toolset import MCPToolset
     from mcp import StdioServerParameters
 
+    HUGGING_FACE_TOKEN = "YOUR_HUGGING_FACE_TOKEN"
+
     root_agent = Agent(
         model="gemini-2.5-pro",
         name="hugging_face_agent",
@@ -43,7 +45,7 @@ your ADK agent to the Hugging Face Hub and thousands of Gradio AI Applications.
                             "@llmindset/hf-mcp-server",
                         ],
                         env={
-                            "HF_TOKEN": "YOUR-HUGGING-FACE-TOKEN",
+                            "HF_TOKEN": HUGGING_FACE_TOKEN,
                         }
                     ),
                     timeout=30,
diff --git a/docs/tools/third-party/index.md b/docs/tools/third-party/index.md
@@ -4,6 +4,16 @@ Check out the following third-party tools that you can use with ADK agents:
 
 <div class="tool-card-grid">
 
+  <a href="/adk-docs/tools/third-party/firecrawl/" class="tool-card">
+    <div class="tool-card-image-wrapper">
+      <img src="../../assets/tools-firecrawl.png" alt="Firecrawl">
+    </div>
+    <div class="tool-card-content">
+      <h3>Firecrawl</h4>
+      <p>Empower your AI apps with clean data from any website</p>
+    </div>
+  </a>
+
   <a href="/adk-docs/tools/third-party/github/" class="tool-card">
     <div class="tool-card-image-wrapper">
       <img src="../../assets/tools-github.png" alt="GitHub">
diff --git a/docs/tutorials/index.md b/docs/tutorials/index.md
@@ -46,12 +46,4 @@ applications with ADK. Explore our collection below and happy building:
 
     [:octicons-arrow-right-24: Discover adk-samples](https://github.com/google/adk-samples){:target="_blank"}
 
--   :material-console-line: **Agentic UI with AG-UI**
-
-    ---
-
-    Build a rich user interface for your agent using the AG-UI protocol and CopilotKit.
-
-    [:octicons-arrow-right-24: Build an agentic UI](ag-ui.md)
-
 </div>
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -163,11 +163,12 @@ nav:
       - Code Execution with Agent Engine: tools/google-cloud/code-exec-agent-engine.md
     - Third-party tools:
       - tools/third-party/index.md
+      - Firecrawl: tools/third-party/firecrawl.md
       - GitHub: tools/third-party/github.md
       - Hugging Face: tools/third-party/hugging-face.md
       - LangChain tools: tools/third-party/langchain.md
       - CrewAI tools: tools/third-party/crewai.md
-      - Agentic UI (AG-UI): tools/third-party/ag-ui.md      
+      - Agentic UI (AG-UI): tools/third-party/ag-ui.md
   - Custom Tools:
     - tools-custom/index.md
     - Function tools: