Skip to content

newspaper

15,018 6 513 MIT
0.2.8 (28 Sep 2018) Dec 28 2012 1.0 million (month)

newspaper is a Python package that allows developers to easily extract text, images, and videos from articles on the web.

It is designed to be fast, easy to use, and compatible with a wide variety of websites. It uses advanced algorithms to extract relevant information and metadata from articles, and it also supports several languages.

newspaper includes a http client or can ingest pre-scraped HTML documents.

Example Use


```python from newspaper import Article

Create a new article object

article = Article('https://www.example.com/article')

Download the article

article.download()

Parse the article

article.parse()

Print the article text

print(article.text)

Print the article title

print(article.title)

Print the article authors

print(article.authors)

Print the article publication date

print(article.publish_date) ```

Alternatives / Similar


2,140 2025.4.15 (2025-04-15 04:02:28 ago) Dec 14 2008 compare
5,650 2.0.0 (2024-12-03 15:23:21 ago) Jul 17 2019 compare
2,894 0.8.4.1 (2025-05-03 21:11:43 ago) Jun 30 2011 compare
961 0.18.0 (2024-11-08 14:59:22 ago) Oct 27 2015 compare
3,670 0.12.0 (2026-02-14 21:00:12 ago) Oct 20 2013 compare
12,807 1.1.9 (2018-10-21 03:39:17 ago) Aug 24 2018 compare
297 2.0.7 (2022-11-06 07:33:14 ago) Dec 11 2020 compare

Other Languages

2,824 v1.3.0 (2024-03-01 03:34:34 ago) Apr 20 2016 compare
Was this page helpful?