requestsvspycurl

Apache-2.0 267 30 52,519

573.0 million (month) Feb 14 2011 2.32.3(1 year, 2 months ago)

1,094 9 9 LGPL-2.1

Feb 25 2003 1.7 million (month) 7.45.4(8 months ago)

The requests package is a popular library for making HTTP requests in Python. It provides a simple, easy-to-use API for sending HTTP/1.1 requests, and it abstracts away many of the low-level details of working with HTTP. One of the key features of requests is its simple API. You can send a GET request with a single line of code:

import requests
response = requests.get('https://webscraping.fyi/lib/requests/')

requests makes it easy to send data along with your requests, including JSON data and files. It also automatically handles redirects and cookies, and it can handle both basic and digest authentication. Additionally, it's also providing powerful functionality for handling exceptions, managing timeouts and session, also handling a wide range of well-known content-encoding types. One thing to keep in mind is that requests is a synchronous library, which means that your program will block (stop execution) while waiting for a response. In some situations, this may not be desirable, and you may want to use an asynchronous library like httpx or aiohttp. You can install requests package via pip package manager:

pip install requests

requests is a very popular library and has a large and active community, which means that there are many third-party libraries that build on top of it, and it has a wide range of usage.

PycURL is a Python interface to libcurl, a multi-protocol file transfer library written in C. PycURL allows developers to use a variety of network protocols in their Python programs, including HTTP, FTP, SMTP, POP3, and many more.

PycURL is often used in web scraping, data analysis, and automation tasks, as it allows developers to send and receive data over the internet. It can be used to perform various types of requests, such as GET, POST, PUT, and DELETE, and can also handle file uploads and downloads, cookies, and redirects.

One of the key features of PycURL is its support for SSL and proxy servers, which allows developers to securely transfer data over the internet and work around any network restrictions. PycURL also supports a wide range of authentication methods, such as Basic, Digest, and NTLM, and allows developers to easily set custom headers and query parameters.

Just like cURL itself, PycURL is also highly configurable and allows for fine-grained control over various aspects of the transfer, such as timeouts, retries, buffer sizes, and verbosity levels. Additionally, PycURL also provides easy access to the underlying libcurl library, which allows developers to access advanced functionality that is not exposed by the PycURL API.

It's important to note that PycURL is a wrapper around the libcurl library and therefore provides the same functionality and performance as libcurl.

Main strengths of PycURL is that it uses cURL which is one of the most feature rich low-level http clients. The downside is that it's a very low-level client (see the examples below) with complex API making use in web scraping very difficult and niche.

Highlights

syncease-of-useno-http2no-asyncpopular

uses-curlhttp2multi-partresponse-streaminghttp-proxy

Example Use

import requests

# get request:
response = requests.get("http://webscraping.fyi/")
response.status_code
200
response.text
"text"
response.content
b"bytes"

# requests can automatically convert json responses to Python dictionaries:
response = requests.get("http://httpbin.org/json")
print(response.json())
{'slideshow': {'author': 'Yours Truly', 'date': 'date of publication', 'slides': [{'title': 'Wake up to WonderWidgets!', 'type': 'all'}, {'items': ['Why <em>WonderWidgets</em> are great', 'Who <em>buys</em> WonderWidgets'], 'title': 'Overview', 'type': 'all'}], 'title': 'Sample Slide Show'}}

# for POST request it can ingest Python's dictionaries as JSON:
response = requests.post("http://httpbin.org/post", json={"query": "hello world"})
# or form data:
response = requests.post("http://httpbin.org/post", data={"query": "hello world"})

# Session object can be used to automatically keep track of cookies and set defaults:
from requests import Session
s = Session()
s.headers = {"User-Agent": "webscraping.fyi"}
s.get('http://httpbin.org/cookies/set/foo/bar')
print(s.cookies['foo'])
'bar'
print(s.get('http://httpbin.org/cookies').json())
{'cookies': {'foo': 'bar'}}

import pycurl
from io import BytesIO

buf = BytesIO()
headers = BytesIO()

c = pycurl.Curl()
c.setopt(c.HTTP_VERSION, c.CURL_HTTP_VERSION_2_0)  # set to use http2
# set proxy
c.setopt(c.PROXY, 'http://proxy.example.com:8080') 
c.setopt(c.PROXYUSERNAME, 'username')
c.setopt(c.PROXYPASSWORD, 'password')

# make a request
c.setopt(c.URL, 'https://httpbin.org/get')
c.setopt(c.WRITEFUNCTION, buf.write)  # where to save response body
c.setopt(c.HEADERFUNCTION, headers.write)  # where to save response headers
# to make post request enable POST option:
# c.setopt(c.POST, 1)
# c.setopt(c.POSTFIELDS, 'key1=value1&key2=value2')
c.perform()  # send request

# read response
data = buf.getvalue().decode()
headers = headers.getvalue().decode()  # headers as a string
headers = dict([h.split(': ') for h in headers.splitlines() if ': ' in h])  # headers as a dict
c.close()

# multiple concurrent requests can be made using CurlMulti object:
# Create a CurlMulti object
multi = pycurl.CurlMulti()
# Set the number of maximum connections
multi.setopt(pycurl.MAXCONNECTS, 5)

# Create a list to store the Curl objects
curls = []

# Add the first request
c1 = pycurl.Curl()
c1.setopt(c1.URL, 'https://httpbin.org/get')
c1.setopt(c1.WRITEFUNCTION, BytesIO().write)
multi.add_handle(c1)
curls.append(c1)

# Add the second request
c2 = pycurl.Curl()
c2.setopt(c2.URL, 'https://httpbin.org/')
c2.setopt(c2.WRITEFUNCTION, BytesIO().write)
multi.add_handle(c2)
curls.append(c2)

# Start the requests
while True:
    ret, _ = multi.perform()
    if ret != pycurl.E_CALL_MULTI_PERFORM:
        break

# Close the connections
for c in curls:
    multi.remove_handle(c)
    c.close()

Alternatives / Similar

node-fetch

8,825 compare

axios

106,345 compare

aiohttp

15,425 compare

httpx

13,703 compare

got

14,454 compare

curl-impersonate

4,221 compare

superagent

16,610 compare

needle

1,637 compare

faraday

5,785 compare

httpclient

703 compare

excon

1,163 compare

httparty

5,837 compare

pycurl

1,094 compare

typhoeus

4,084 compare

httr

988 compare

rvest

1,498 compare

curl-cffi

1,751 compare

guzzle

23,055 compare

em-http-request

1,217 compare

symfony-http

1,976 compare

wreck

381 compare

http-2

898 compare

treq

590 compare

resty

10,341 compare

req

4,374 compare

nestful

505 compare

crul

107 compare

requests

3,576 compare

hrequests

780 compare

buzz

1,913 compare

httpful

1,741 compare

ralger

156 compare

http.rb

3,013 compare

requests

52,519 compare

node-fetch

8,825 compare

axios

106,345 compare

aiohttp

15,425 compare

httpx

13,703 compare

got

14,454 compare

curl-impersonate

4,221 compare

superagent

16,610 compare

needle

1,637 compare

faraday

5,785 compare

httpclient

703 compare

excon

1,163 compare

httparty

5,837 compare

typhoeus

4,084 compare

httr

988 compare

rvest

1,498 compare

curl-cffi

1,751 compare

guzzle

23,055 compare

em-http-request

1,217 compare

symfony-http

1,976 compare

wreck

381 compare

http-2

898 compare

treq

590 compare

resty

10,341 compare

req

4,374 compare

nestful

505 compare

crul

107 compare

requests

3,576 compare

hrequests

780 compare

buzz

1,913 compare

httpful

1,741 compare

ralger

156 compare

http.rb

3,013 compare