soup

2,227 1 22 MIT

v1.2.5 (16 Jan 2022) Apr 29 2017 58.1 thousand (month)

soup is a Go library for parsing and querying HTML documents.

It provides a simple and intuitive interface for extracting information from HTML pages. It's inspired by popular Python web scraping library BeautifulSoup and shares similar use API implementing functions like Find and FindAll.

soup can also use go's built-in http client to download HTML content.

Note that unlike beautifulsoup, soup does not support CSS selectors or XPath.

Example Use

```go package main

import ( "fmt" "log"

"github.com/anaskhan96/soup" )

func main() {

url := "https://www.bing.com/search?q=weather+Toronto"

# soup has basic HTTP client though it's not recommended for scraping: resp, err := soup.Get(url) if err != nil { log.Fatal(err) }

# create soup object from HTML doc := soup.HTMLParse(resp)

# html elements can be found using Find or FindStrict methods: # in this case find

elements where "class" attribute matches some values: grid := doc.FindStrict("div", "class", "b_antiTopBleed b_antiSideBleed b_antiBottomBleed") # note: to find all elements FindAll() method can be used the same way

# elements can be further searched for descendents: heading := grid.Find("div", "class", "wtr_titleCtrn").Find("div").Text() conditions := grid.Find("div", "class", "wtr_condition") primaryCondition := conditions.Find("div") secondaryCondition := primaryCondition.FindNextElementSibling() temp := primaryCondition.Find("div", "class", "wtr_condiTemp").Find("div").Text() others := primaryCondition.Find("div", "class", "wtr_condiAttribs").FindAll("div") caption := secondaryCondition.Find("div").Text()

fmt.Println("City Name : " + heading) fmt.Println("Temperature : " + temp + "˚C") for _, i := range others { fmt.Println(i.Text()) } fmt.Println(caption) } ```

Alternatives / Similar

goquery

14,926 v1.12.0 (2026-03-15 16:28:52 ago) Aug 29 2016 compare

cascadia

754 Start (2018-02-20 18:47:44 ago) Feb 20 2018 compare

htmlquery

781 v1.3.6 (2026-03-06 04:46:15 ago) Feb 07 2019 compare

xpath

739 v1.3.6 (2026-02-23 07:10:29 ago) Jun 08 2019 compare

Other Languages

parse5

3,886 8.0.0 (2026-02-21 19:30:52 ago) Jul 03 2013 compare

sax-js

1,153 1.6.0 (2026-03-17 01:32:31 ago) Feb 09 2011 compare

htmlparser2

4,789 12.0.0 (2026-03-20 23:08:40 ago) Aug 28 2011 compare

lxml

3,010 6.0.3 (2026-04-09 14:33:38 ago) Dec 13 2022 compare

beautifulsoup

- 4.14.3 (2025-11-30 15:08:24 ago) Jul 26 2019 compare

jsdom new

21,552 29.0.2 (2026-04-07 03:38:38 ago) Nov 21 2011 compare

xmltodict

5,734 1.0.4 (2026-02-22 02:21:21 ago) Jul 30 2007 compare

cheerio

30,265 1.2.0 (2026-02-21 19:30:40 ago) Oct 08 2011 compare

html5lib

1,220 1.1 (2020-06-22 23:32:36 ago) Jul 30 2007 compare

cssselect

309 1.4.0 (2026-01-29 07:00:24 ago) Apr 14 2012 compare

feedparser

2,351 6.0.12 (2025-09-10 13:33:58 ago) Jun 15 2007 compare

nokogiri

6,248 1.19.2 (2026-03-19 21:12:43 ago) Jul 25 2009 compare

parsel

1,324 1.11.0 (2026-01-29 07:19:22 ago) Jul 26 2019 compare

selectolax

1,607 0.4.7 (2026-03-06 09:23:35 ago) Mar 01 2018 compare

pyquery

2,381 2.0.1 (2024-08-30 08:12:22 ago) Dec 05 2008 compare

xml2

223 1.5.2 (2025-12-01 15:40:00 ago) Apr 20 2015 compare

requests-html

13,863 0.10.0 (2019-02-17 20:14:17 ago) Feb 25 2018 compare

rvest

1,517 1.0.5 (2024-02-12 21:10:00 ago) Nov 22 2014 compare

untangle

632 1.2.1 (2022-07-02 14:09:28 ago) Jun 09 2011 compare

scrapling new

36,206 0.4.5 (2026-04-07 04:22:27 ago) Aug 01 2024 compare

html5-php

1,772 2.10.0 (2025-07-25 09:04:22 ago) Jun 01 2013 compare

domcrawler

4,038 v8.0.8 (2026-03-30 15:14:47 ago) Sep 26 2011 compare

chompjs

218 1.4.0 (2025-08-04 21:07:54 ago) Jul 30 2007 compare

html5-parser

700 0.4.12 (2023-11-19 15:09:54 ago) Jun 03 2007 compare

gazpacho

768 1.1 (2020-10-09 12:50:18 ago) Dec 28 2012 compare

embed

2,103 v4.4.15 (2025-01-02 16:53:09 ago) Oct 26 2013 compare

chopper

23 0.6.0 (2023-04-26 10:16:25 ago) Jul 24 2014 compare

simple-html-dom new

- 2.0-RC2 (2019-11-09 15:42:50 ago) Nov 09 2019 compare

ralger

165 2.3.0 (2021-03-18 00:10:00 ago) Dec 22 2019 compare