Field notes from the world of data extraction.

Articles, interviews and analysis on how data is gathered, used and fought over — written by the people closest to it.

⌕

No-code web scraping workflows are here: Introducing the Zyte integration for Zapier

Discover Zyte’s Zapier integration to automate web data extraction and connect it to 10,000+ apps. Build powerful workflows without coding and turn web data into actionable insights instantly.

Robert Andrews10 min readMarch 24, 2026

Product Update

The Scrapy whisperer: Adrian Chaves on Web Scraping Copilot

An interview with Scrapy maintainer Adrian Chaves on Zyte’s Web Scraping Copilot, AI-generated parsing code, and building reliable scraping workflows.

Neha Setia Nagpal10 min readMarch 23, 2026

Code is cheap, show me the talk: How copilots are re-engineering developers

Discover how AI copilots like Zyte’s Web Scraping Copilot are transforming developer workflows—making code a commodity and shifting value to problem-solving and prompting skills.

Ayan Pahwa10 min readMarch 20, 2026

Product Update

Web scraping finally has a home in the IDE

Discover how web scraping is moving into the IDE. Learn how tools like VS Code and AI-assisted extensions are streamlining scraper development, testing, and maintenance.

Mitch Holt10 min readMarch 20, 2026

Announcement

Introducing Web Scraping Copilot 1.0: AI-accelerated web scraping inside VS Code

Discover Web Scraping Copilot 1.0, Zyte’s VS Code extension that uses AI to generate, test, and deploy production-ready Scrapy spiders faster while maintaining full developer control.

Mitch Holt10 min readMarch 18, 2026

How To

Build your own MCP server: LLMs meets web data with Zyte API

Learn how to build your own Model Context Protocol (MCP) server to connect LLMs with real-time web data using Zyte API, FastMCP, and the Docker MCP toolkit.

Ayan Pahwa10 min readMarch 16, 2026

Open Source

More data, more trouble: How a perfect corpus corrupted my AI dream

A failed AI experiment reveals why adding more data doesn’t always improve LLM outputs. Learn when web scraping, RAG, and curated datasets actually make AI better.

Neha Setia Nagpal10 min readMarch 13, 2026

Open Source

Claude skills, MCP or Web Scraping Copilot: Which should you choose?

Compare Claude skills, MCP servers, and Web Scraping Copilot to understand when to use each for AI-powered web scraping, data extraction, and production pipelines with Zyte API.

Neha Setia Nagpal10 min readMarch 11, 2026

Open Source

Supercharging web scraping with Claude skills

Learn how Claude skills can automate HTML fetching, AI parsing, selector generation, and structured data extraction to build faster, smarter web scraping workflows.

John Rooney10 min readMarch 11, 2026

Leadership

Is your AI breaking the law? Legal experts’ advice for web scrapers

Legal experts discuss how AI, web scraping, copyright law, and the EU AI Act intersect—covering fair use, data provenance, and compliance risks for businesses.

Robert Andrews10 min readMarch 10, 2026

Use case

Brewing a bot: RAG and web data fuel the perfect coffee recommendation

Learn how to build a real-time AI chatbot using RAG, web scraping, Zyte API, LangChain, and OpenAI. Scrape JavaScript-heavy websites, store data in a vector database, and generate accurate answers from fresh web data.

Ayan Pahwa10 min readMarch 5, 2026