curl-cffivshrequests

MIT 34 2 1,751

594.9 thousand (month) Feb 23 2022 0.7.1(2024-07-13 09:07:25 ago)

1,001 1 51 MIT

Feb 23 2022 33.3 thousand (month) 0.9.2(2024-12-01 02:55:27 ago)

Curl-cffi is a Python library for implementing curl-impersonate which is a HTTP client that appears as one of popular web browsers like: - Google Chrome - Microsoft Edge - Safari - Firefox Unlike requests and httpx which are native Python libraries, curl-cffi uses cURL and inherits it's powerful features like extensive HTTP protocol support and detection patches for TLS and HTTP fingerprinting.

Using curl-cffi web scrapers can bypass TLS and HTTP fingerprinting.

hrequests is a feature rich modern replacement for a famous requests library for Python. It provides a feature rich HTTP client capable of resisting popular scraper identification techniques: - Seamless transition between headless browser and http client based requests - Integrated HTML parser - Mimicking of real browser TLS fingerprints - Javascript rendering - HTTP2 support - Realistic browser headers

Highlights

bypasshttp2tls-fingerprinthttp-fingerprintsyncasync

Example Use

curl-cffi can be accessed as low-level curl client as well as an easy high-level HTTP client: ```python from curl_cffi import requests response = requests.get('https://httpbin.org/json') print(response.json()) # or using sessions session = requests.Session() response = session.get('https://httpbin.org/json') # also supports async requests using asyncio import asyncio from curl_cffi.requests import AsyncSession urls = [ "http://httpbin.org/html", "http://httpbin.org/html", "http://httpbin.org/html", ] async with AsyncSession() as s: tasks = [] for url in urls: task = s.get(url) tasks.append(task) # scrape concurrently: responses = await asyncio.gather(*tasks) # also supports websocket connections from curl_cffi.requests import Session, WebSocket def on_message(ws: WebSocket, message): print(message) with Session() as s: ws = s.ws_connect( "wss://api.gemini.com/v1/marketdata/BTCUSD", on_message=on_message, ) ws.run_forever() ```

hrequests has almost identical API and UX as requests and here's a quick overview: ```python import hrequests # perform HTTP client requests resp = hrequests.get('https://httpbin.org/html') print(resp.status_code) # 200 # use headless browsers and sessions: session = hrequests.Session('chrome', version=122, os="mac") # supports asyncio and easy concurrency requests = [ hrequests.async_get('https://www.google.com/', browser='firefox'), hrequests.async_get('https://www.duckduckgo.com/'), hrequests.async_get('https://www.yahoo.com/'), hrequests.async_get('https://www.httpbin.org/'), ] responses = hrequests.map(requests, size=3) # max 3 conccurency ```

Alternatives / Similar

curl-impersonate

5,944 compare

primp new

504 compare

hrequests

1,001 compare

requests

53,883 compare

node-fetch

8,860 compare

httpx

15,183 compare

aiohttp

16,395 compare

axios

108,987 compare

got

14,897 compare

needle

1,631 compare

superagent

16,610 compare

faraday

5,927 compare

pycurl

1,147 compare

puppeteer-stealth

94,086 compare

excon

1,172 compare

httpclient

707 compare

httparty

5,889 compare

puppeteer-extra new

7,293 compare

typhoeus

4,131 compare

httr

985 compare

undetected-chromedriver

12,519 compare

camoufox new

7,089 compare

guzzle

23,447 compare

rvest

1,517 compare

nodriver new

4,003 compare

wreck

378 compare

em-http-request

1,219 compare

symfony-http

2,033 compare

mechanize new

4,440 compare

treq

605 compare

http-2

908 compare

req

4,781 compare

resty

11,632 compare

nestful

505 compare

botasaurus new

4,321 compare

crul

107 compare

requests

3,577 compare

buzz

1,924 compare

selenium-driverless

718 compare

httpful

1,803 compare

ralger

165 compare

http.rb

3,104 compare

pydoll new

- compare

curl-impersonate

5,944 compare

primp new

504 compare

curl-cffi

1,751 compare

requests

53,883 compare

node-fetch

8,860 compare

httpx

15,183 compare

aiohttp

16,395 compare

axios

108,987 compare

got

14,897 compare

needle

1,631 compare

superagent

16,610 compare

faraday

5,927 compare

pycurl

1,147 compare

puppeteer-stealth

94,086 compare

excon

1,172 compare

httpclient

707 compare

httparty

5,889 compare

puppeteer-extra new

7,293 compare

typhoeus

4,131 compare

httr

985 compare

undetected-chromedriver

12,519 compare

camoufox new

7,089 compare

guzzle

23,447 compare

rvest

1,517 compare

nodriver new

4,003 compare

wreck

378 compare

em-http-request

1,219 compare

symfony-http

2,033 compare

mechanize new

4,440 compare

treq

605 compare

http-2

908 compare

req

4,781 compare

resty

11,632 compare

nestful

505 compare

botasaurus new

4,321 compare

crul

107 compare

requests

3,577 compare

buzz

1,924 compare

selenium-driverless

718 compare

httpful

1,803 compare

ralger

165 compare

http.rb

3,104 compare

pydoll new

- compare