We’ve made a change. Scrapinghub is now Zyte! 

Zyte Developers Community newsletter issue #10

time to read
< 1
Mins
By the one and only
August 19, 2021

Hi there,

If you are not signed up already for the Zyte Developers Community newsletter, you can sign up here.

In this issue:

  • Scrape news headlines in <10 lines of code
  • Automate the Wiki-Link Game with Python
  • Web scraping with Cheerio
  • How Scrapy makes web crawling easy
  • Web Data Extraction Summit 2021

Scrape news headlines in <10 lines of code

Code_Jedi shares a way to scrape news headlines from BBC news in under 10 lines of code using BeautifulSoup. Check it out!

Automate the Wiki-Link Game with Python

Have you played the Wiki-link game before? You may have noticed that continuously following the first link of any Wikipedia article will eventually lead to “Philosophy". Does this actually happen because you’re clicking the first link or is this based on Wikipedia’s structure? Read all about it in this article by Timo Kats. He extracts the information using the Wikipedia API.

Web scraping with Cheerio

Julian Rosenthal is very impressed with Cheerio, a popular node package for scraping and parsing web content. In this article, he explores cheerio by creating a scraper to get a list of all US presidents from Wikipedia.

How Scrapy makes web crawling easy

For everyone who’s just starting or wanting to start with Scrapy, this article will make you fall in love with the framework. It is a quick read covering the fundamentals of Scrapy and how to set it up to extract data easily and without getting blocked.

Web Data Extraction Summit 2021

Extract Summit 2021 is a great platform to learn from the experts in the web data extraction industry and meet hundreds of like-minded developers. Grab your free ticket now!

Written by Zyte team
Sign up to the blog