katana

16,499 6 18 MIT

v1.5.0 (10 Mar 2026) Nov 07 2022

Katana is a next-generation web crawling and spidering framework written in Go by ProjectDiscovery. It is designed for fast, comprehensive endpoint and asset discovery and is widely used in the security research and bug bounty communities.

Katana offers multiple crawling modes:

Standard mode Fast HTTP-based crawling without a browser. Parses HTML, JavaScript files, and other resources to discover endpoints and links.
Headless mode Uses a headless Chrome browser for crawling JavaScript-rendered pages and single-page applications (SPAs).
Passive mode Discovers URLs from external sources (Wayback Machine, CommonCrawl, etc.) without actively visiting the target.

Key features include:

Scope control Configurable crawl scope with regex patterns for including/excluding URLs, domains, and file extensions.
JavaScript parsing Extracts endpoints from JavaScript files, inline scripts, and AJAX requests even in standard (non-headless) mode.
Customizable output Filter and format output with field selection, JSON output, and custom templates.
Rate limiting Built-in rate limiting and concurrency control to avoid overwhelming targets.
Proxy support HTTP and SOCKS5 proxy support with rotation.
Form filling Can detect and auto-fill forms to discover endpoints behind form submissions.

While Katana was designed for security research and reconnaissance, its fast crawling capabilities and JavaScript parsing make it equally useful for web scraping discovery and sitemap generation.

Highlights

fastpopularlarge-scale

Example Use

```go package main

import ( "context" "math"

"github.com/projectdiscovery/katana/pkg/engine/standard"
"github.com/projectdiscovery/katana/pkg/output"
"github.com/projectdiscovery/katana/pkg/types"

)

func main() { // Configure crawl options options := &types.Options{ MaxDepth: 3, FieldScope: "rdn", // restrict to root domain BodyReadSize: math.MaxInt, Timeout: 10, Concurrency: 10, Parallelism: 10, Delay: 0, RateLimit: 150, Strategy: "depth-first", OnResult: func(result output.Result) { // Process each discovered URL println(result.Request.URL) }, }

// Create and run the crawler
crawlerOptions, _ := types.NewCrawlerOptions(options)
defer crawlerOptions.Close()

crawler, _ := standard.New(crawlerOptions)
defer crawler.Close()

// Start crawling
_ = crawler.Crawl("https://example.com")

} ```

Alternatives / Similar

colly

25,231 v2.2.0 (2025-03-27 10:47:28 ago) May 14 2018 compare

pholcus

7,594 v1.4.0 (2026-03-03 03:58:32 ago) Feb 15 2020 compare

geziyor

2,772 2026-04-11 (2026-04-11 21:30:25 ago) Jun 06 2019 compare

dataflowkit

711 2026-03-21 (2026-03-21 09:11:03 ago) Feb 09 2017 compare

ferret

5,964 v2.0.0-alpha.7 (2026-04-07 15:33:51 ago) Oct 28 2020 compare

gocrawl

2,053 (2021-05-19 15:14:49 ago) Nov 20 2016 compare

Other Languages

scrapy

61,276 2.15.0 (2026-04-09 12:02:09 ago) Jul 26 2019 compare

crawl4ai new

63,373 0.8.6 (2026-03-24 15:07:50 ago) May 01 2024 compare

rvest

1,517 1.0.5 (2024-02-12 21:10:00 ago) Nov 22 2014 compare

scrapling new

36,206 0.4.5 (2026-04-07 04:22:27 ago) Aug 01 2024 compare

crawlee new

22,720 3.16.0 (2026-04-09 07:36:53 ago) Apr 22 2022 compare

mechanize new

4,440 2.14.0 (2025-01-05 18:30:46 ago) Jul 25 2009 compare

scrapegraphai new

23,278 1.76.0 (2026-04-09 09:41:03 ago) Jan 15 2024 compare

scrapyd

3,087 1.6.0 (2025-07-22 06:00:53 ago) Sep 04 2013 compare

botasaurus new

4,321 4.0.97 (2026-01-06 07:45:54 ago) Oct 01 2023 compare

node-crawler

6,790 2.0.2 (2025-05-28 09:36:01 ago) Sep 10 2012 compare

panther

3,062 v2.4.0 (2026-01-08 05:29:21 ago) Jul 17 2018 compare

goutte new

9,215 v4.0.3 (2023-04-01 09:05:33 ago) Dec 02 2012 compare

gracy

248 1.34.0 (2024-11-27 14:57:34 ago) Feb 05 2023 compare

spidr

835 0.7.2 (2025-02-03 07:58:27 ago) Jul 25 2009 compare

kimurai new

1,098 2.2.0 (2026-01-27 17:36:19 ago) Aug 23 2018 compare

scrapydweb

3,400 1.6.0 (2025-02-16 13:18:50 ago) Sep 30 2018 compare

photon

12,807 1.1.9 (2018-10-21 03:39:17 ago) Aug 24 2018 compare

wombat

1,360 3.3.0 (2026-04-07 16:31:34 ago) Dec 27 2011 compare

autoscraper

7,136 1.1.14 (2022-07-17 17:20:09 ago) Jul 26 2019 compare

roach

1,454 v3.2.1 (2025-03-21 06:53:36 ago) Dec 27 2021 compare

gerapy

3,495 0.9.13 (2023-07-19 18:53:46 ago) Jul 04 2017 compare

ruia

1,743 0.8.5 (2022-09-06 08:54:56 ago) Oct 17 2018 compare

ralger

165 2.3.0 (2021-03-18 00:10:00 ago) Dec 22 2019 compare

ayakashi

217 1.0.0-beta8.4 (2023-06-29 12:37:12 ago) Apr 18 2019 compare

phpscraper

583 3.0.0 (2024-04-09 15:34:59 ago) May 04 2020 compare

dude

425 0.1.3 (2023-08-01 20:28:33 ago) Feb 20 2022 compare

php-spider

1,341 v0.7.6 (2025-12-04 15:08:06 ago) Mar 16 2013 compare

crwlr-crawler

369 v3.5.6 (2026-01-05 11:13:18 ago) Apr 18 2022 compare

firecrawl new

- 0.0.0 (2025-03-15 00:00:00 ago) Apr 01 2024 compare