firecrawl

- - - None

0.0.0 (15 Mar 2025) Apr 01 2024

Firecrawl is an AI-powered web scraping API that converts web pages into clean Markdown or structured data, optimized for use with large language models (LLMs) and retrieval-augmented generation (RAG) pipelines. It handles JavaScript rendering, anti-bot bypass, and content extraction automatically.

Firecrawl offers multiple modes:

Scrape Convert a single URL into clean Markdown, HTML, or structured data. Handles JavaScript rendering and anti-bot protections automatically.
Crawl Crawl an entire website starting from a URL, with configurable depth, URL patterns, and page limits. Returns all pages as clean Markdown.
Map Quickly discover all URLs on a website without fully scraping each page. Useful for sitemap generation and crawl planning.
Extract Use LLMs to extract specific structured data from pages based on a schema definition.

Key features:

Clean Markdown output ideal for LLM context windows
Automatic JavaScript rendering with headless browsers
Built-in anti-bot bypass for protected websites
Structured extraction with JSON schemas
Batch crawling with webhook notifications
Python and JavaScript SDKs

Firecrawl is a commercial API service (requires API key, has a free tier) backed by Y Combinator. It has become one of the most popular tools for feeding web content into AI applications and is widely used in the LLM/RAG ecosystem.

Note: while the primary service is an API, the core is open source and can be self-hosted.

Highlights

ai-poweredpopularasync

Example Use

```python from firecrawl import FirecrawlApp

app = FirecrawlApp(api_key="YOUR_API_KEY")

Scrape a single page - get clean markdown

result = app.scrape_url("https://example.com/blog/article") print(result["markdown"]) # clean markdown content

Extract structured data with a schema

result = app.scrape_url( "https://example.com/product/123", params={ "formats": ["extract"], "extract": { "schema": { "type": "object", "properties": { "name": {"type": "string"}, "price": {"type": "number"}, "description": {"type": "string"}, }, } }, }, ) print(result["extract"]) # {"name": "...", "price": 29.99, ...}

Crawl an entire website

crawl_result = app.crawl_url( "https://example.com", params={"limit": 100, "scrapeOptions": {"formats": ["markdown"]}}, ) for page in crawl_result["data"]: print(page["metadata"]["title"], page["markdown"][:100])

Map all URLs on a site

map_result = app.map_url("https://example.com") print(f"Found {len(map_result['links'])} URLs") ```

Alternatives / Similar

crawl4ai new

63,373 0.8.6 (2026-03-24 15:07:50 ago) May 01 2024 compare

scrapegraphai new

23,278 1.76.0 (2026-04-09 09:41:03 ago) Jan 15 2024 compare

browser-use new

87,251 0.12.6 (2026-04-02 07:55:13 ago) Nov 01 2024 compare

scrapy

61,276 2.15.0 (2026-04-09 12:02:09 ago) Jul 26 2019 compare

scrapling new

36,206 0.4.5 (2026-04-07 04:22:27 ago) Aug 01 2024 compare

skyvern new

21,046 1.0.29 (2026-04-02 14:42:44 ago) Feb 01 2024 compare

zenrows new

18 1.4.0 (2024-11-13 14:49:00 ago) Jun 01 2022 compare

scrapyd

3,087 1.6.0 (2025-07-22 06:00:53 ago) Sep 04 2013 compare

botasaurus new

4,321 4.0.97 (2026-01-06 07:45:54 ago) Oct 01 2023 compare

gracy

248 1.34.0 (2024-11-27 14:57:34 ago) Feb 05 2023 compare

scrapydweb

3,400 1.6.0 (2025-02-16 13:18:50 ago) Sep 30 2018 compare

photon

12,807 1.1.9 (2018-10-21 03:39:17 ago) Aug 24 2018 compare

autoscraper

7,136 1.1.14 (2022-07-17 17:20:09 ago) Jul 26 2019 compare

gerapy

3,495 0.9.13 (2023-07-19 18:53:46 ago) Jul 04 2017 compare

ruia

1,743 0.8.5 (2022-09-06 08:54:56 ago) Oct 17 2018 compare

dude

425 0.1.3 (2023-08-01 20:28:33 ago) Feb 20 2022 compare

Other Languages

colly

25,231 v2.2.0 (2025-03-27 10:47:28 ago) May 14 2018 compare

katana new

16,499 v1.5.0 (2026-03-10 14:52:47 ago) Nov 07 2022 compare

pholcus

7,594 v1.4.0 (2026-03-03 03:58:32 ago) Feb 15 2020 compare

geziyor

2,772 2026-04-11 (2026-04-11 21:30:25 ago) Jun 06 2019 compare

dataflowkit

711 2026-03-21 (2026-03-21 09:11:03 ago) Feb 09 2017 compare

stagehand new

22,012 3.2.1 (2026-04-10 21:10:37 ago) Oct 29 2024 compare

rvest

1,517 1.0.5 (2024-02-12 21:10:00 ago) Nov 22 2014 compare

crawlee new

22,720 3.16.0 (2026-04-09 07:36:53 ago) Apr 22 2022 compare

mechanize new

4,440 2.14.0 (2025-01-05 18:30:46 ago) Jul 25 2009 compare

ferret

5,964 v2.0.0-alpha.7 (2026-04-07 15:33:51 ago) Oct 28 2020 compare

gocrawl

2,053 (2021-05-19 15:14:49 ago) Nov 20 2016 compare

node-crawler

6,790 2.0.2 (2025-05-28 09:36:01 ago) Sep 10 2012 compare

panther

3,062 v2.4.0 (2026-01-08 05:29:21 ago) Jul 17 2018 compare

goutte new

9,215 v4.0.3 (2023-04-01 09:05:33 ago) Dec 02 2012 compare

spidr

835 0.7.2 (2025-02-03 07:58:27 ago) Jul 25 2009 compare

kimurai new

1,098 2.2.0 (2026-01-27 17:36:19 ago) Aug 23 2018 compare

wombat

1,360 3.3.0 (2026-04-07 16:31:34 ago) Dec 27 2011 compare

roach

1,454 v3.2.1 (2025-03-21 06:53:36 ago) Dec 27 2021 compare

ralger

165 2.3.0 (2021-03-18 00:10:00 ago) Dec 22 2019 compare

ayakashi

217 1.0.0-beta8.4 (2023-06-29 12:37:12 ago) Apr 18 2019 compare

phpscraper

583 3.0.0 (2024-04-09 15:34:59 ago) May 04 2020 compare

php-spider

1,341 v0.7.6 (2025-12-04 15:08:06 ago) Mar 16 2013 compare

crwlr-crawler

369 v3.5.6 (2026-01-05 11:13:18 ago) Apr 18 2022 compare