Zyte Developers Community newsletter issue #10
If you are not signed up already for the Zyte Developers Community newsletter, you can sign up here.
In this issue:
- Scrape news headlines in <10 lines of code
- Automate the Wiki-Link Game with Python
- Web scraping with Cheerio
- How Scrapy makes web crawling easy
- Web Data Extraction Summit 2021
Scrape news headlines in <10 lines of code
Code_Jedi shares a way to scrape news headlines from BBC news in under 10 lines of code using BeautifulSoup. Check it out!
Automate the Wiki-Link Game with Python
Have you played the Wiki-link game before? You may have noticed that continuously following the first link of any Wikipedia article will eventually lead to “Philosophy". Does this actually happen because you’re clicking the first link or is this based on Wikipedia’s structure? Read all about it in this article by Timo Kats. He extracts the information using the Wikipedia API.
Web scraping with Cheerio
Julian Rosenthal is very impressed with Cheerio, a popular node package for scraping and parsing web content. In this article, he explores cheerio by creating a scraper to get a list of all US presidents from Wikipedia.
How Scrapy makes web crawling easy
For everyone who’s just starting or wanting to start with Scrapy, this article will make you fall in love with the framework. It is a quick read covering the fundamentals of Scrapy and how to set it up to extract data easily and without getting blocked.
Web Data Extraction Summit 2021
Extract Summit 2021 is a great platform to learn from the experts in the web data extraction industry and meet hundreds of like-minded developers.