Skip to content

newspaper

14,204 6 501 MIT
0.2.8 (28 Sep 2018) Dec 28 2012 609.2 thousand (month)

newspaper is a Python package that allows developers to easily extract text, images, and videos from articles on the web.

It is designed to be fast, easy to use, and compatible with a wide variety of websites. It uses advanced algorithms to extract relevant information and metadata from articles, and it also supports several languages.

newspaper includes a http client or can ingest pre-scraped HTML documents.

Example Use


from newspaper import Article

# Create a new article object
article = Article('https://www.example.com/article')

# Download the article
article.download()

# Parse the article
article.parse()

# Print the article text
print(article.text)

# Print the article title
print(article.title)

# Print the article authors
print(article.authors)

# Print the article publication date
print(article.publish_date)

Alternatives / Similar


1,860 2024.2.26 (9 months ago) Dec 14 2008 compare
3,703 2.0.0 (6 days ago) Jul 17 2019 compare
859 0.18.0 (a month ago) Oct 27 2015 compare
2,675 0.8.1 (4 years ago) Jun 30 2011 compare
3,533 0.11.0 (2 years ago) Oct 20 2013 compare
246 2.0.7 (2 years ago) Dec 11 2020 compare
11,112 1.1.9 (6 years ago) Aug 24 2018 compare

Other Languages

2,565 v1.3.0 (9 months ago) Apr 20 2016 compare
Was this page helpful?