Explore resources by topic or category
Browse by Category
Blog
The future of Scrapy: Smarter, faster and ready for AI-powered scraping
Robert Andrews
6 min
June 23, 2025
What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?
Blog
Rise of the Data Vendor: How Outsourcing is Transforming Supply and Fuelling Businesses
Robert Andrews
6 min
June 20, 2025
With the emergence of managed data extraction vendors, businesses no longer need to gather web data themselves.
Blog
Quality, focus and scale: Three ways data outsourcing benefits businesses
Theresia Tanzil
8 min
June 11, 2025
The Strategic Case for Buying Web Data: Quality, Focus, and Scale
Blog
Ten years since Scrapy 1.0: The stats and stories behind your favorite framework
Cleber Alexandre
5 mins
June 5, 2025
See what 10 years of Scrapy 1.0 has produced — in milestones and metrics - as it became the most-used open source web scraping framework in the world.
Blog
A Deep Dive into Zyte's Open-Source Libraries
Neha Setia Nagpal
1 mins
December 19, 2024
Discover how Zyte’s open-source libraries like ClearHTML, Extruct, Chomp.js, and more simplify web data extraction and processing.
Blog
Selenium, Puppeteer, Playwright: Which tool is right for web scraping at scale?
Neha Setia Nagpal
1 mins
October 7, 2024
Discover the strengths and limitations of Selenium, Puppeteer, and Playwright for web scraping at scale.
Blog
4 essential Scrapy plugins for building efficient and effective spiders
Neha Setia Nagpal
1 mins
August 15, 2024
Here are four essential Scrapy plugins we use to build efficient web crawlers for our customers.
Blog
Choosing Between Puppeteer vs. Selenium for Web Scraping
Karlo Jedud
8 mins
July 10, 2024
Web scraping tools save hours of work by automating data extraction, testing web applications, and performing repetitive tasks.
Blog
The Scraper’s System Part 2: Explorer’s Compass to analyze websites
Neha Setia Nagpal
8 min
February 16, 2024
In the first part, we discussed a template to define the clear purpose of your web scraping system that can help you design your crawlers better and prepare you for the uncertainty involved in a large scale web scraping project.