PINGDOM_CHECK

Explore resources by topic or category

Blog

The trade-offs in crawling infrastructure in the modern anti-bot landscape

Daniel Cave
8 mins
April 10, 2024
In this article, I’llexplain the problem of anti-bot technology for web scraping developers through the lens of the anti-bot distribution curve (a view of the top 250,000 websites and the relative complexity of their anti-bot tech) and the landscape of anti-bot tech across the web.

Blog

Compliant Web Scraping with AI

Callum Henry
6 mins
March 15, 2024
Zyte’s flagship product, Zyte API, now includes built-in features that automate crawling using spider templates, and our patented AI-powered automated extraction, which gives you quality structured data quickly without writing custom parsing code.

Blog

AI Scraping now available in Zyte API

Mitch Holt
5 minutes
March 4, 2024
We’re thrilled to announce to our global developer community that Zyte API now comes out of the box with all the features that power our complete solution for scraping with AI, enabling developers to build and launch spiders, unblock websites and extract data from a single UI three times faster than legacy scraping vendors and proxy APIs.

Blog

Web Scraping vs Data Mining | What's the Difference?

Sarah Lang
5 Mins
February 17, 2024
Data mining and web scraping – sounds like two buzzwords meaning the same thing. Quite often data mining is misunderstood as the process of obtaining information from a website;

Blog

The Scraper’s System Part 2: Explorer’s Compass to analyze websites

Neha Setia Nagpal
8 min
February 16, 2024
In the first part, we discussed a template to define the clear purpose of your web scraping system that can help you design your crawlers better and prepare you for the uncertainty involved in a large scale web scraping project.

Blog

The challenges e-commerce retailers face managing their web scraping proxies

Ian Kerins
7 min
February 16, 2024
In this article we discuss some main challenges that e-commerce retailers face on a daily basis due to the amount of web data needed and how to solve them.

Blog

Zyte API is the Successor to Smart Proxy Manager

Daniel Cave
4 mins
February 15, 2024
After a decade of dedicated service, Smart Proxy Manager (SPM) is being retired for new customers. We’re excited to introduce our flagship Web Scraping API, Zyte API, as its worthy successor — the best and only way to access ethical and industry-leading managed proxy solutions.

Blog

Introducing Zyte API Proxy Mode

Adrian Chaves
3 mins
February 6, 2024
Zyte API is the next iteration of Zyte’s best-in class proxy and website unblocking technology. We built it as an HTTP API. This was a conscious design decision: an API to support all your web scraping needs would not work well with the limitations of a proxy API.

Blog

Court Rules Meta's Terms Do Not Prohibit Scraping of Public Data

Sanaea Daruwalla
7 mins
January 30, 2024
In 2023, Meta sued Bright Data for scraping data from Facebook and Instagram, alleging that its scraping breached Facebook and Instagram’s terms of service and is thus a breach of contract.

Blog

Celebrating Ethics in Web Scraping With the EWDCI Certification

Sanaea Daruwalla
4 mins
November 21, 2023
The reputation of web scraping hasn’t always been the best. Unsavory actors have cast a shadow over the reputable parts of the web scraping industry at large, and it has to stop.

Blog

Simplify Your Web Scraping Project with Zyte API

Daniel Cave
5 mins
November 15, 2023
Web scraping developers often find themselves in a struggle to manage bans and blocks. Every time they resolve a ban, it's only a matter of time before their scrapers encounter the same issue again.

Blog

Zyte API Aced the Proxyway Test of Web Unblocking APIs

Iain Lennon
7 mins
November 9, 2023
Proxyway, one of the most respected proxy provider research blogs, released its in-depth review of five leading proxy APIs on the market. The report, In-Depth Look into Popular Proxy APIs (Web Unblockers), showed that Zyte API is by far the best option when looking at the three major fundamentals: success rate, speed, and cost.