Field notes from the world of data extraction.

Articles, interviews and analysis on how data is gathered, used and fought over — written by the people closest to it.

⌕

AI and the web: What 2025 changed and what comes next

Web scraping as social practice: Ethics and efficiency in a data-hungry world

Web scraping is more than a technical act, it’s a social practice. Explore ethical scraping principles, responsible data collection, and how to balance efficiency with respect for data and people.

Rodrigo Silva Ferreira10 min readOctober 27, 2025

Buy or Build? The Four Roads to Acquiring Web Data

AI-assisted data extraction

Extract clean content automatically with Zyte API’s new pageContent data type

Discover how Zyte API’s new PageContent data type makes content extraction effortless — delivering clean, structured data from any web page automatically.

Daniel Cave10 min readOctober 20, 2025

Web data collection legality

Balancing innovation and regulation in data scraping

Weighing your options from full control to full service.

Sanaea Daruwalla10 min readOctober 14, 2025

Scraping strategy

The new economics of web data: Smaller scraping just got cheaper

Discover how AI and new scraping tech are lowering costs, removing the “scale tax,” and making web data access affordable for smaller teams.

Theresia Tanzil2 min readOctober 6, 2025

Why your agent deserves a wallet

Acting autonomously on tasks like data gathering, financially-empowered agents could change the economics of the web.

Jan Seidler2 min readSeptember 29, 2025

Developer interest

Memo to CTOs: Don’t build the product

Zyte’s CTO says great tech leaders don’t work on software, they work systems that make software.

Jan Seidler2 min readSeptember 23, 2025

Web data collection

How to Plan Your Web Scraping Project Like a Product Manager

Most web scraping projects that collapse don't fail because of technical incompetence. They fail because teams treat data extraction like a coding sprint rather than a product launch.

Theresia Tanzil2 min readSeptember 15, 2025

AI-assisted data extraction

Agentic web scraping: Hype, reality and what happens next

From broken parsers to context limits, today’s AI agents have real challenges—but with the right tools and orchestration, they could reshape how we extract web data.

Konstantin Lopukhin2 min readSeptember 9, 2025

AI-assisted data extraction

AI Web Scraping as the Future of Scalable Data Collection

AI-powered web scraping is transforming data collection by making it faster, smarter, and highly scalable.

Karlo Jedud5 min readSeptember 4, 2025

Scraping strategy

How Zyte’s extraction experts guarantee data quality

Ensuring web data quality at scale means moving beyond fragile scripts and spot checks to robust validation that keeps business decisions accurate and reliable.

Artur Sadurski2 min readSeptember 1, 2025

Web scraping APIs

Death of the Proxy? There’s An API for That

The proxy era is ending as web scraping shifts from managing IP pools to smarter, API-driven solutions.

Robert Andrews2 min readAugust 26, 2025

Web scraping APIs

Why the Best Engineers Are Actually Lazy

There’s a popular archetype in web scraping circles: the heroic engineer who fights CAPTCHAs at 3 a.m., hand-tunes a proxy farm before breakfast, then rewrites four spiders after lunch because the target sites pushed new JavaScript.

Robert Andrews2 min readAugust 18, 2025

Get the latest posts straight to your inbox

No matter what data type you're looking for, we've got you