PINGDOM_CHECK

Web Scraping Copilot is live. Build Scrapy spiders 3× faster, free in VS Code.

Install Now
  • Data Services
  • Pricing
  • Login
    Sign up👋 Contact Sales

Zyte Developers

Coding tools & hacks straight to your inbox

Become part of the community and receive a bi-weekly dosage of all things code.

Join us
    • Zyte Data
    • News & Articles
    • Search
    • Social Media
    • Product
    • Data for AI
    • Job Posting
    • Real Estate
    • Zyte API - Ban Handling
    • Zyte API - Headless Browser
    • Zyte API - AI Extraction
    • Web Scraping Copilot
    • Zyte API Enterprise
    • Scrapy Cloud
    • Solution Overview
    • Blog
    • Webinars
    • Case Studies
    • White Papers
    • Documentation
    • Web Scraping Maturity Self-Assesment
    • Web Data compliance
    • Meet Zyte
    • Jobs
    • Terms and Policies
    • Trust Center
    • Support
    • Contact us
    • Pricing
    • Do not sell
    • Cookie settings
    • Sign up
    • Talk to us
    • Cost estimator
Subscribe to our Blog

The latest from Shane Evans

Blog post thumbnail
Leadership

The future I dreamed of is dawning

January 21, 2026
Blog post thumbnail
Announcement

Reflecting on the 2022 Web Data Extraction Summit: A Memorable Experience

October 25, 2022
Blog post thumbnail
Announcement

5 Reasons to Attend Extract Summit 2022: A Must-Attend Event

June 14, 2022
Blog post thumbnail
Leadership

Measuring Web product data quality for accurate decisions

August 24, 2021
Blog post thumbnail
Announcement

Scrapinghub is Now Zyte: Embracing a New Identity

February 2, 2021
Blog post thumbnail
Leadership

Transitioning to Remote Working as a Company: Lessons Learned

March 25, 2020
Blog post thumbnail
Use case

Zyte Crawls The Deep Web

February 24, 2015
Blog post thumbnail
Open Source

Portia: The Open-Source Visual Web Scraper

April 1, 2014
Blog post thumbnail
Announcement

Open Source | Zyte | Looking Back At 2013

December 31, 2013
Blog post thumbnail
Product Update

Introducing Dash

July 27, 2013
Blog post thumbnail
Leadership

Why MongoDB Is A Bad Choice For Storing Scraped Data

May 13, 2013
Blog post thumbnail
How To

Finding Similar Items

July 23, 2012
Blog post thumbnail
Open Source

Autoscraping Casts A Wider Net

February 27, 2012

G2.com

Capterra.com

Proxyway.com

EWDCI logoMost loved workplace certificateZyte rewardISO 27001 iconG2 rewardG2 rewardG2 reward

© Zyte Group Limited 2026
Zyte's CEO Shane Evans shares a 15-year vision for effortless, AI-driven web data extraction and introduces the 2026 Web Scraping Industry Report with 26 actionable insights.
We held the 2022 Web Data Extraction Summit three weeks ago. I wanted to extend a huge thank you to everyone who came, especially our guest speakers, who shared some great insights throughout the day.
We are delighted to announce that Extract Summit 2022 will be returning to an in-person format after two years of being virtual. This time, it’s going to be in London!
We put Zyte’s own Automatic Extraction API head-to-head with a commercial rival - and an open-source alternative - to find out who’s product extraction top dog.
Zyte is participating in Memex, an ambitious DARPA project that tackles the huge challenge of crawling, indexing, and making sense of areas of the Deep Web, that is, web content not being indexed by traditional search engines such as Google, Bing and others.
We’re proud to announce the developer release of Portia, our new open source visual scraping tool based on Scrapy. Check out this video!
This time last year Pablo and I were chatting about the previous year and what to expect in 2013. I noticed that our team had almost doubled in size in the previous year and we wondered could that possibly continue in 2013?
We're excited to introduce Dash, a major update to our scraping platform.
This release is the final step in migrating to our new storage back end and contains improvements to almost every part of our infrastructure. In this post I'd like to introduce some of the highlights.
MongoDB was used early on at Zyte to store scraped data because it's convenient. Scraped data is represented as (possibly nested) records which can be serialized to JSON.
We have recently started letting more users into the private beta for our Automatic Extraction. We're receiving a lot of applications following the shutdown of Needlebase and we're increasing our capacity to accommodate these users.
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans
Shane Evans