feat: implement Firecrawl web scraping integration

- Add Firecrawl SDK dependency for web scraping functionality - Create /api/scrape endpoint to handle website content extraction - Replace mock data with real Firecrawl API integration - Add recent sources section showing scraped content excerpts - Update UI to display scraping success/error messages - Fix API response structure handling for Firecrawl integration 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-22 20:01:58 +02:00
parent f31780cbf5
commit d380d68555
8 changed files with 986 additions and 20 deletions
--- a/docs/business/overview.md
+++ b/docs/business/overview.md
@@ -0,0 +1,13 @@
+# Overview
+This is an AI Podcast Generation app that converts a URL into a natural sounding podcast audio file.
+
+# Core Features
+
+## Scrape Website
+The user enters a URL and the app then use Firecrawl to extract the website content.
+
+## Generate Conversation
+The app then uses the AI SDK v5 (OpenAI gpt-5-mini as the model) to create a conversation between 2 hosts discussing the scraped content.
+
+## Generate Podcast Audio
+Use the Elevenlabs SDK to generate the dialog from the conversation text.
--- a/docs/inspiration/notebooklm.png
+++ b/docs/inspiration/notebooklm.png
--- a/docs/technical/firecrawl.md
+++ b/docs/technical/firecrawl.md
@@ -0,0 +1,675 @@
+# Quickstart
+
+> Firecrawl allows you to turn entire websites into LLM-ready markdown
+
+<img className="block" src="https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=4e7b593752a4ff638c1d1dbfddb54a9a" alt="Hero Light" width="1200" height="675" data-path="images/turn-websites-into-llm-ready-data--firecrawl.png" srcset="https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?w=280&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=e80d7c4a694a930342577283249d461a 280w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?w=560&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=1344a5340ddad90ee50ccf66b088b430 560w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?w=840&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=060bf67f6a5b76002c4b416b165d0dce 840w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?w=1100&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=f89c070acdb98f7e2b91f58209f7a628 1100w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?w=1650&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=0486baeea836b83b6c84e28635b20604 1650w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/turn-websites-into-llm-ready-data--firecrawl.png?w=2500&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=1fe62bc4843cf736d0462e93ae0610c8 2500w" data-optimize="true" data-opv="2" />
+
+## Welcome to Firecrawl
+
+[Firecrawl](https://firecrawl.dev?ref=github) is an API service that takes a URL, crawls it, and converts it into clean markdown. We crawl all accessible subpages and give you clean markdown for each. No sitemap required.
+
+## How to use it?
+
+We provide an easy to use API with our hosted version. You can find the playground and documentation [here](https://firecrawl.dev/playground). You can also self host the backend if you'd like.
+
+Check out the following resources to get started:
+
+* [x] **API**: [Documentation](https://docs.firecrawl.dev/api-reference/introduction)
+* [x] **SDKs**: [Python](https://docs.firecrawl.dev/sdks/python), [Node](https://docs.firecrawl.dev/sdks/node)
+* [x] **LLM Frameworks**: [Langchain (python)](https://python.langchain.com/docs/integrations/document_loaders/firecrawl/), [Langchain (js)](https://js.langchain.com/docs/integrations/document_loaders/web_loaders/firecrawl), [Llama Index](https://docs.llamaindex.ai/en/latest/examples/data_connectors/WebPageDemo/#using-firecrawl-reader), [Crew.ai](https://docs.crewai.com/), [Composio](https://composio.dev/tools/firecrawl/all), [PraisonAI](https://docs.praison.ai/firecrawl/), [Superinterface](https://superinterface.ai/docs/assistants/functions/firecrawl), [Vectorize](https://docs.vectorize.io/integrations/source-connectors/firecrawl)
+* [x] **Low-code Frameworks**: [Dify](https://dify.ai/blog/dify-ai-blog-integrated-with-firecrawl), [Langflow](https://docs.langflow.org/), [Flowise AI](https://docs.flowiseai.com/integrations/langchain/document-loaders/firecrawl), [Cargo](https://docs.getcargo.io/integration/firecrawl), [Pipedream](https://pipedream.com/apps/firecrawl/)
+* [x] **Community SDKs**: [Go](https://docs.firecrawl.dev/sdks/go), [Rust](https://docs.firecrawl.dev/sdks/rust) (v1)
+* [x] **Others**: [Zapier](https://zapier.com/apps/firecrawl/integrations), [Pabbly Connect](https://www.pabbly.com/connect/integrations/firecrawl/)
+* [ ] Want an SDK or Integration? Let us know by opening an issue.
+
+**Self-host:** To self-host refer to guide [here](/contributing/self-host).
+
+### API Key
+
+To use the API, you need to sign up on [Firecrawl](https://firecrawl.dev) and get an API key.
+
+### Features
+
+* [**Scrape**](#scraping): scrapes a URL and get its content in LLM-ready format (markdown, summary, structured data via [json mode](#json-mode), screenshot, html)
+* [**Crawl**](#crawling): scrapes all the URLs of a web page and return content in LLM-ready format
+* [**Map**](/features/map): input a website and get all the website urls - extremely fast
+* [**Search**](/features/search): search the web and get full content from results
+* [**Extract**](/features/extract): get structured data from single page, multiple pages or entire websites with AI.
+
+### Powerful Capabilities
+
+* **LLM-ready formats**: markdown, summary, structured data, screenshot, HTML, links, metadata
+* **The hard stuff**: proxies, anti-bot mechanisms, dynamic content (js-rendered), output parsing, orchestration
+* **Lightning fast**: Get results in seconds—built for speed and high-throughput use cases.
+* **Customizability**: exclude tags, crawl behind auth walls with custom headers, max crawl depth, etc...
+* **Media parsing**: pdfs, docx, images.
+* **Reliability first**: designed to get the data you need - no matter how hard it is.
+* **Actions**: click, scroll, input, wait and more before extracting data
+
+You can find all of Firecrawl's capabilities and how to use them in our [documentation](https://docs.firecrawl.dev/api-reference/v2-introduction)
+
+## Installing Firecrawl
+
+<CodeGroup>
+  ```python Python
+  # pip install firecrawl-py
+
+  from firecrawl import Firecrawl
+
+  firecrawl = Firecrawl(api_key="fc-YOUR-API-KEY")
+  ```
+
+  ```js Node
+  # npm install @mendable/firecrawl-js
+
+  import Firecrawl from '@mendable/firecrawl-js';
+
+  const firecrawl = new Firecrawl({ apiKey: "fc-YOUR-API-KEY" });
+  ```
+</CodeGroup>
+
+## Scraping
+
+To scrape a single URL, use the `scrape` method. It takes the URL as a parameter and returns the scraped data as a dictionary.
+
+<CodeGroup>
+  ```python Python
+  from firecrawl import Firecrawl
+
+  firecrawl = Firecrawl(api_key="fc-YOUR-API-KEY")
+
+  # Scrape a website:
+  doc = firecrawl.scrape("https://firecrawl.dev", formats=["markdown", "html"])
+  print(doc)
+  ```
+
+  ```js Node
+  import Firecrawl from '@mendable/firecrawl-js';
+
+  const firecrawl = new Firecrawl({ apiKey: "fc-YOUR-API-KEY" });
+
+  // Scrape a website:
+  const doc = await firecrawl.scrape('https://firecrawl.dev', { formats: ['markdown', 'html'] });
+  console.log(doc);
+  ```
+
+  ```bash cURL
+  curl -s -X POST "https://api.firecrawl.dev/v2/scrape" \
+    -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
+    -H "Content-Type: application/json" \
+    -d '{
+      "url": "https://firecrawl.dev",
+      "formats": ["markdown", "html"]
+    }'
+  ```
+</CodeGroup>
+
+### Response
+
+SDKs will return the data object directly. cURL will return the payload exactly as shown below.
+
+```json
+{
+  "success": true,
+  "data" : {
+    "markdown": "Launch Week I is here! [See our Day 2 Release 🚀](https://www.firecrawl.dev/blog/launch-week-i-day-2-doubled-rate-limits)[💥 Get 2 months free...",
+    "html": "<!DOCTYPE html><html lang=\"en\" class=\"light\" style=\"color-scheme: light;\"><body class=\"__variable_36bd41 __variable_d7dc5d font-inter ...",
+    "metadata": {
+      "title": "Home - Firecrawl",
+      "description": "Firecrawl crawls and converts any website into clean markdown.",
+      "language": "en",
+      "keywords": "Firecrawl,Markdown,Data,Mendable,Langchain",
+      "robots": "follow, index",
+      "ogTitle": "Firecrawl",
+      "ogDescription": "Turn any website into LLM-ready data.",
+      "ogUrl": "https://www.firecrawl.dev/",
+      "ogImage": "https://www.firecrawl.dev/og.png?123",
+      "ogLocaleAlternate": [],
+      "ogSiteName": "Firecrawl",
+      "sourceURL": "https://firecrawl.dev",
+      "statusCode": 200
+    }
+  }
+}
+```
+
+## Crawling
+
+The crawl feature allows you to automatically discover and extract content from a URL and all of its accessible subpages. With our SDKs, simply call the crawl method—this will submit a crawl job, wait for it to finish, and return the complete results for the entire site.
+
+### Usage
+
+<CodeGroup>
+  ```python Python
+  from firecrawl import Firecrawl
+
+  firecrawl = Firecrawl(api_key="fc-YOUR-API-KEY")
+
+  docs = firecrawl.crawl(url="https://docs.firecrawl.dev", limit=10)
+  print(docs)
+  ```
+
+  ```js Node
+  import Firecrawl from '@mendable/firecrawl-js';
+
+  const firecrawl = new Firecrawl({ apiKey: "fc-YOUR-API-KEY" });
+
+  const docs = await firecrawl.crawl('https://docs.firecrawl.dev', { limit: 10 });
+  console.log(docs);
+  ```
+
+  ```bash cURL
+  curl -s -X POST "https://api.firecrawl.dev/v2/crawl" \
+    -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
+    -H "Content-Type: application/json" \
+    -d '{
+      "url": "https://docs.firecrawl.dev",
+      "limit": 10
+    }'
+  ```
+</CodeGroup>
+
+If you're using our API directly, cURL or `start crawl` functions on SDKs, this will return an `ID` where you can use to check the status of the crawl.
+
+```json
+{
+  "success": true,
+  "id": "123-456-789",
+  "url": "https://api.firecrawl.dev/v2/crawl/123-456-789"
+}
+```
+
+### Get Crawl Status
+
+Used to check the status of a crawl job and get its result.
+
+<CodeGroup>
+  ```python Python
+  status = firecrawl.get_crawl_status("<crawl-id>")
+  print(status)
+  ```
+
+  ```js Node
+  const status = await firecrawl.getCrawlStatus("<crawl-id>");
+  console.log(status);
+  ```
+
+  ```bash cURL
+  # After starting a crawl, poll status by jobId
+  curl -s -X GET "https://api.firecrawl.dev/v2/crawl/<jobId>" \
+    -H "Authorization: Bearer $FIRECRAWL_API_KEY"
+  ```
+</CodeGroup>
+
+#### Response
+
+The response will be different depending on the status of the crawl. For not completed or large responses exceeding 10MB, a `next` URL parameter is provided. You must request this URL to retrieve the next 10MB of data. If the `next` parameter is absent, it indicates the end of the crawl data.
+
+<CodeGroup>
+  ```json Scraping
+  {
+    "status": "scraping",
+    "total": 36,
+    "completed": 10,
+    "creditsUsed": 10,
+    "expiresAt": "2024-00-00T00:00:00.000Z",
+    "next": "https://api.firecrawl.dev/v2/crawl/123-456-789?skip=10",
+    "data": [
+      {
+        "markdown": "[Firecrawl Docs home page![light logo](https://mintlify.s3-us-west-1.amazonaws.com/firecrawl/logo/light.svg)!...",
+        "html": "<!DOCTYPE html><html lang=\"en\" class=\"js-focus-visible lg:[--scroll-mt:9.5rem]\" data-js-focus-visible=\"\">...",
+        "metadata": {
+          "title": "Build a 'Chat with website' using Groq Llama 3 | Firecrawl",
+          "language": "en",
+          "sourceURL": "https://docs.firecrawl.dev/learn/rag-llama3",
+          "description": "Learn how to use Firecrawl, Groq Llama 3, and Langchain to build a 'Chat with your website' bot.",
+          "ogLocaleAlternate": [],
+          "statusCode": 200
+        }
+      },
+      ...
+    ]
+  }
+  ```
+
+  ```json Completed
+  {
+    "status": "completed",
+    "total": 36,
+    "completed": 36,
+    "creditsUsed": 36,
+    "expiresAt": "2024-00-00T00:00:00.000Z",
+    "next": "https://api.firecrawl.dev/v2/crawl/123-456-789?skip=26",
+    "data": [
+      {
+        "markdown": "[Firecrawl Docs home page![light logo](https://mintlify.s3-us-west-1.amazonaws.com/firecrawl/logo/light.svg)!...",
+        "html": "<!DOCTYPE html><html lang=\"en\" class=\"js-focus-visible lg:[--scroll-mt:9.5rem]\" data-js-focus-visible=\"\">...",
+        "metadata": {
+          "title": "Build a 'Chat with website' using Groq Llama 3 | Firecrawl",
+          "language": "en",
+          "sourceURL": "https://docs.firecrawl.dev/learn/rag-llama3",
+          "description": "Learn how to use Firecrawl, Groq Llama 3, and Langchain to build a 'Chat with your website' bot.",
+          "ogLocaleAlternate": [],
+          "statusCode": 200
+        }
+      },
+      ...
+    ]
+  }
+  ```
+</CodeGroup>
+
+## JSON mode
+
+With JSON mode, you can easily extract structured data from any URL. We support pydantic schemas to make it easier for you too. Here is how you to use it:
+
+<CodeGroup>
+  ```python Python
+  from firecrawl import Firecrawl
+  from pydantic import BaseModel
+  app = Firecrawl(api_key="fc-YOUR-API-KEY")
+
+  class JsonSchema(BaseModel):
+      company_mission: str
+      supports_sso: bool
+      is_open_source: bool
+      is_in_yc: bool
+
+  result = app.scrape(
+      'https://firecrawl.dev',
+      formats=[{
+        "type": "json",
+        "schema": JsonSchema
+      }],
+      only_main_content=False,
+      timeout=120000
+  )
+
+  print(result)
+  ```
+
+  ```js Node
+  import FirecrawlApp from "@mendable/firecrawl-js";
+  import { z } from "zod";
+
+  const app = new FirecrawlApp({
+    apiKey: "fc-YOUR_API_KEY"
+  });
+
+  // Define schema to extract contents into
+  const schema = z.object({
+    company_mission: z.string(),
+    supports_sso: z.boolean(),
+    is_open_source: z.boolean(),
+    is_in_yc: z.boolean()
+  });
+
+  const result = await app.scrape("https://docs.firecrawl.dev/", {
+    formats: [{
+      type: "json",
+      schema: schema
+    }],
+  });
+
+  console.log(result);
+  ```
+
+  ```bash cURL
+  curl -X POST https://api.firecrawl.dev/v2/scrape \
+      -H 'Content-Type: application/json' \
+      -H 'Authorization: Bearer YOUR_API_KEY' \
+      -d '{
+        "url": "https://docs.firecrawl.dev/",
+        "formats": [ {
+          "type": "json",
+          "schema": {
+            "type": "object",
+            "properties": {
+              "company_mission": {
+                        "type": "string"
+              },
+              "supports_sso": {
+                        "type": "boolean"
+              },
+              "is_open_source": {
+                        "type": "boolean"
+              },
+              "is_in_yc": {
+                        "type": "boolean"
+              }
+            },
+            "required": [
+              "company_mission",
+              "supports_sso",
+              "is_open_source",
+              "is_in_yc"
+            ]
+          }
+        } ]
+      }'
+  ```
+</CodeGroup>
+
+Output:
+
+```json JSON
+{
+    "success": true,
+    "data": {
+      "json": {
+        "company_mission": "AI-powered web scraping and data extraction",
+        "supports_sso": true,
+        "is_open_source": true,
+        "is_in_yc": true
+      },
+      "metadata": {
+        "title": "Firecrawl",
+        "description": "AI-powered web scraping and data extraction",
+        "robots": "follow, index",
+        "ogTitle": "Firecrawl",
+        "ogDescription": "AI-powered web scraping and data extraction",
+        "ogUrl": "https://firecrawl.dev/",
+        "ogImage": "https://firecrawl.dev/og.png",
+        "ogLocaleAlternate": [],
+        "ogSiteName": "Firecrawl",
+        "sourceURL": "https://firecrawl.dev/"
+      },
+    }
+}
+```
+
+## Search
+
+Firecrawl's search API allows you to perform web searches and optionally scrape the search results in one operation.
+
+* Choose specific output formats (markdown, HTML, links, screenshots)
+* Choose specific sources (web, news, images)
+* Search the web with customizable parameters (location, etc.)
+
+For details, see the [Search Endpoint API Reference](/api-reference/endpoint/search).
+
+<CodeGroup>
+  ```python Python
+  from firecrawl import Firecrawl
+
+  firecrawl = Firecrawl(api_key="fc-YOUR-API-KEY")
+
+  results = firecrawl.search(
+      query="firecrawl",
+      limit=3,
+  )
+  print(results)
+  ```
+
+  ```js Node
+  import Firecrawl from '@mendable/firecrawl-js';
+
+  const firecrawl = new Firecrawl({ apiKey: "fc-YOUR-API-KEY" });
+
+  const results = await firecrawl.search('firecrawl', {
+    limit: 3,
+    scrapeOptions: { formats: ['markdown'] }
+  });
+  console.log(results);
+  ```
+
+  ```bash
+  curl -s -X POST "https://api.firecrawl.dev/v2/search" \
+    -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
+    -H "Content-Type: application/json" \
+    -d '{
+      "query": "firecrawl",
+      "limit": 3
+    }'
+  ```
+</CodeGroup>
+
+### Response
+
+SDKs will return the data object directly. cURL will return the complete payload.
+
+```json JSON
+{
+  "success": true,
+  "data": {
+    "web": [
+      {
+        "url": "https://www.firecrawl.dev/",
+        "title": "Firecrawl - The Web Data API for AI",
+        "description": "The web crawling, scraping, and search API for AI. Built for scale. Firecrawl delivers the entire internet to AI agents and builders.",
+        "position": 1
+      },
+      {
+        "url": "https://github.com/mendableai/firecrawl",
+        "title": "mendableai/firecrawl: Turn entire websites into LLM-ready ... - GitHub",
+        "description": "Firecrawl is an API service that takes a URL, crawls it, and converts it into clean markdown or structured data.",
+        "position": 2
+      },
+      ...
+    ],
+    "images": [
+      {
+        "title": "Quickstart | Firecrawl",
+        "imageUrl": "https://mintlify.s3.us-west-1.amazonaws.com/firecrawl/logo/logo.png",
+        "imageWidth": 5814,
+        "imageHeight": 1200,
+        "url": "https://docs.firecrawl.dev/",
+        "position": 1
+      },
+      ...
+    ],
+    "news": [
+      {
+        "title": "Y Combinator startup Firecrawl is ready to pay $1M to hire three AI agents as employees",
+        "url": "https://techcrunch.com/2025/05/17/y-combinator-startup-firecrawl-is-ready-to-pay-1m-to-hire-three-ai-agents-as-employees/",
+        "snippet": "It's now placed three new ads on YC's job board for “AI agents only” and has set aside a $1 million budget total to make it happen.",
+        "date": "3 months ago",
+        "position": 1
+      },
+      ...
+    ]
+  }
+}
+```
+
+### Extracting without schema
+
+You can now extract without a schema by just passing a `prompt` to the endpoint. The llm chooses the structure of the data.
+
+<CodeGroup>
+  ```python Python
+  from firecrawl import Firecrawl
+
+  app = Firecrawl(api_key="fc-YOUR-API-KEY")
+
+  result = app.scrape(
+      'https://firecrawl.dev',
+      formats=[{
+        "type": "json",
+        "prompt": "Extract the company mission from the page."
+      }],
+      only_main_content=False,
+      timeout=120000
+  )
+
+  print(result)
+  ```
+
+  ```js Node
+  import FirecrawlApp from "@mendable/firecrawl-js";
+
+  const app = new FirecrawlApp({
+    apiKey: "fc-YOUR_API_KEY"
+  });
+
+  const result = await app.scrape("https://docs.firecrawl.dev/", {
+    formats: [{
+      type: "json",
+      prompt: "Extract the company mission from the page."
+    }]
+  });
+
+  console.log(result);
+  ```
+
+  ```bash cURL
+  curl -X POST https://api.firecrawl.dev/v2/scrape \
+      -H 'Content-Type: application/json' \
+      -H 'Authorization: Bearer YOUR_API_KEY' \
+      -d '{
+        "url": "https://docs.firecrawl.dev/",
+        "formats": [{
+          "type": "json",
+          "prompt": "Extract the company mission from the page."
+        }]
+      }'
+  ```
+</CodeGroup>
+
+Output:
+
+```json JSON
+{
+    "success": true,
+    "data": {
+      "json": {
+        "company_mission": "AI-powered web scraping and data extraction",
+      },
+      "metadata": {
+        "title": "Firecrawl",
+        "description": "AI-powered web scraping and data extraction",
+        "robots": "follow, index",
+        "ogTitle": "Firecrawl",
+        "ogDescription": "AI-powered web scraping and data extraction",
+        "ogUrl": "https://firecrawl.dev/",
+        "ogImage": "https://firecrawl.dev/og.png",
+        "ogLocaleAlternate": [],
+        "ogSiteName": "Firecrawl",
+        "sourceURL": "https://firecrawl.dev/"
+      },
+    }
+}
+```
+
+## Interacting with the page with Actions
+
+Firecrawl allows you to perform various actions on a web page before scraping its content. This is particularly useful for interacting with dynamic content, navigating through pages, or accessing content that requires user interaction.
+
+Here is an example of how to use actions to navigate to google.com, search for Firecrawl, click on the first result, and take a screenshot.
+
+It is important to almost always use the `wait` action before/after executing other actions to give enough time for the page to load.
+
+### Example
+
+<CodeGroup>
+  ```python Python
+  from firecrawl import Firecrawl
+
+  firecrawl = Firecrawl(api_key="fc-YOUR-API-KEY")
+
+  doc = firecrawl.scrape('https://example.com/login', {
+    formats=['markdown'],
+    actions=[
+      { type: 'write', text: 'john@example.com' },
+      { type: 'press', key: 'Tab' },
+      { type: 'write', text: 'secret' },
+      { type: 'click', selector: 'button[type="submit"]' },
+      { type: 'wait', milliseconds: 1500 },
+      { type: 'screenshot', fullPage: true },
+    ],
+  });
+
+  print(doc.markdown, doc.screenshot);
+  ```
+
+  ```js Node
+  import Firecrawl from '@mendable/firecrawl-js';
+
+  const firecrawl = new Firecrawl({ apiKey: "fc-YOUR-API-KEY" });
+
+  const doc = await firecrawl.scrape('https://example.com/login', {
+    formats: ['markdown'],
+    actions: [
+      { type: 'write', text: 'john@example.com' },
+      { type: 'press', key: 'Tab' },
+      { type: 'write', text: 'secret' },
+      { type: 'click', selector: 'button[type="submit"]' },
+      { type: 'wait', milliseconds: 1500 },
+      { type: 'screenshot', fullPage: true },
+    ],
+  });
+
+  console.log(doc.markdown, doc.screenshot);
+  ```
+
+  ```bash cURL
+  curl -X POST https://api.firecrawl.dev/v2/scrape \
+      -H 'Content-Type: application/json' \
+      -H 'Authorization: Bearer YOUR_API_KEY' \
+      -d '{
+        "url": "https://example.com/login",
+        "formats": ["markdown"],
+        "actions": [
+          { "type": "write", "text": "john@example.com" },
+          { "type": "press", "key": "Tab" },
+          { "type": "write", "text": "secret" },
+          { "type": "click", "selector": "button[type=\"submit\"]" },
+          { "type": "wait", "milliseconds": 1500 },
+          { "type": "screenshot", "fullPage": true },
+        ],
+    }'
+  ```
+</CodeGroup>
+
+### Output
+
+<CodeGroup>
+  ```json JSON
+  {
+    "success": true,
+    "data": {
+      "markdown": "Our first Launch Week is over! [See the recap 🚀](blog/firecrawl-launch-week-1-recap)...",
+      "actions": {
+        "screenshots": [
+          "https://alttmdsdujxrfnakrkyi.supabase.co/storage/v1/object/public/media/screenshot-75ef2d87-31e0-4349-a478-fb432a29e241.png"
+        ],
+        "scrapes": [
+          {
+            "url": "https://www.firecrawl.dev/",
+            "html": "<html><body><h1>Firecrawl</h1></body></html>"
+          }
+        ]
+      },
+      "metadata": {
+        "title": "Home - Firecrawl",
+        "description": "Firecrawl crawls and converts any website into clean markdown.",
+        "language": "en",
+        "keywords": "Firecrawl,Markdown,Data,Mendable,Langchain",
+        "robots": "follow, index",
+        "ogTitle": "Firecrawl",
+        "ogDescription": "Turn any website into LLM-ready data.",
+        "ogUrl": "https://www.firecrawl.dev/",
+        "ogImage": "https://www.firecrawl.dev/og.png?123",
+        "ogLocaleAlternate": [],
+        "ogSiteName": "Firecrawl",
+        "sourceURL": "http://google.com",
+        "statusCode": 200
+      }
+    }
+  }
+  ```
+</CodeGroup>
+
+## Open Source vs Cloud
+
+Firecrawl is open source available under the [AGPL-3.0 license](https://github.com/mendableai/firecrawl/blob/main/LICENSE).
+
+To deliver the best possible product, we offer a hosted version of Firecrawl alongside our open-source offering. The cloud solution allows us to continuously innovate and maintain a high-quality, sustainable service for all users.
+
+Firecrawl Cloud is available at [firecrawl.dev](https://firecrawl.dev) and offers a range of features that are not available in the open source version:
+
+<img src="https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=763a6e92c8605d06294ed7ed45df85d0" alt="Firecrawl Cloud vs Open Source" width="2808" height="856" data-path="images/open-source-cloud.png" srcset="https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?w=280&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=2e9112d82aec51ca204ceee026b6bad3 280w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?w=560&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=9fabc257f1caa297b1b8ec68fb13eddc 560w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?w=840&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=e766290156ea4226df484ee815f5036f 840w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?w=1100&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=ed02646081bce28427156ba1d8bf4fa2 1100w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?w=1650&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=41d72e1c116d48ebc0cfa1a3499b3e9e 1650w, https://mintcdn.com/firecrawl/vlKm1oZYK3oSRVTM/images/open-source-cloud.png?w=2500&fit=max&auto=format&n=vlKm1oZYK3oSRVTM&q=85&s=0f6f34e97633cabdc17cbc28d7af2bb9 2500w" data-optimize="true" data-opv="2" />
+
+## Contributing
+
+We love contributions! Please read our [contributing guide](https://github.com/mendableai/firecrawl/blob/main/CONTRIBUTING.md) before submitting a pull request.