Articles from the Zyte blog about Open-source.

In our interview, a QA expert warns - before you delegate web scraping quality assurance to AI, make sure you can describe what ‘good’ looks like for yourself.

What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?

See what 10 years of Scrapy 1.0 has built — in milestones and metrics.

The story of Scrapy reflects the broader evolution of the web itself and the ongoing quest to harness its ever-expanding ocean of information.

Discover how Zyte’s open-source libraries like ClearHTML, Extruct, Chomp.js, and more simplify web data extraction and processing.

Here are four essential Scrapy plugins we use to build efficient web crawlers for our customers.

Learn about the scrapers system: Explorer’s Compass to analyze websites.

Get the best value for your web crawling project by using Scrapy. An awesome framework you should learn and incorporate for easy and accurate web crawling.

Meet Dateparser, a potent date parsing library simplifying date extraction from HTML pages. Ideal for various applications like command-line tools, chatbots, and more.

Understand which Scrapy settings help you honor these limits and how to achieve better performance during broad crawls in the presence of these limits.

We are introducing a new open source project, Scrapy-GUI. It provides a GUI for Scrapy Shell and makes it easier to write spiders.

Zyte Smart Proxy Manager is specifically designed for web scraping. In this article, learn how to use Zyte Smart Proxy Manager, inside your Scrapy spider.
No matter what data type you're looking for, we've got you
G2.com