Developed by Pablo Hoffman and Shane Evans, Scrapy is an open source python framework built specifically for web data extraction. With Scrapy spiders you are able to download HTML, parse and process the data and save it in either CSV, JSON or XML file formats.
In our first tutorials we've focussed on how to extract data from a website. In this video we'll show you how to crawl a website using Scrapy.
After watching this, you will know:
Watch the third part of the Scrapy tutorial course.