If you are not signed up already for the Zyte Developers Community newsletter, you can sign up here.
Code_Jedi shares a way to scrape news headlines from BBC news in under 10 lines of code using BeautifulSoup. Check it out!
Have you played the Wiki-link game before? You may have noticed that continuously following the first link of any Wikipedia article will eventually lead to “Philosophy". Does this actually happen because you’re clicking the first link or is this based on Wikipedia’s structure? Read all about it in this article by Timo Kats. He extracts the information using the Wikipedia API.
Julian Rosenthal is very impressed with Cheerio, a popular node package for scraping and parsing web content. In this article, he explores cheerio by creating a scraper to get a list of all US presidents from Wikipedia.
For everyone who’s just starting or wanting to start with Scrapy, this article will make you fall in love with the framework. It is a quick read covering the fundamentals of Scrapy and how to set it up to extract data easily and without getting blocked.
Extract Summit 2021 is a great platform to learn from the experts in the web data extraction industry and meet hundreds of like-minded developers.