Explore resources by topic or category
Browse by Category
Blog
Web Scraping Challenges & Their Cost-Efficient Solutions
Eugenia Evseeva
4 min
November 8, 2023
Web scraping challenges, ranging from IP bans and data accuracy to legal compliance issues, can trip up businesses trying to use web data to fuel machine learning and to make better decisions.
Blog
What Is Product Intelligence and Why Your Business Needs It
Eugenia Evseeva
4 min
October 25, 2023
In 2020, Quibi entered the streaming market with high hopes. They wanted to offer a unique service and capture a market share of the booming industry. But the company didn't conduct extensive product intelligence research.
Blog
How Zyte API takes care of the fundamental needs of your web scraping project!
Neha Setia Nagpal
3 mins
September 13, 2023
Blog
Use cURL for web scraping: A Beginner's Guide
Felipe Boff Nunes
16 Mins
September 11, 2023
cURL stands for "Client URL", it is an open-source command-line tool that allows users to transfer data to or from a web server using various network protocols such as HTTP, HTTPS, FTP, and more. By providing a command line interface, it enables users to collect data from websites with ease. It is widely used for tasks such as API interaction and remote file downloading or uploading.
Blog
The Art of Using Data to Make Decisions in Business
Felipe Boff Nunes
15 Mins
August 31, 2023
Flipping a coin, going with your gut, closing your eyes and begging the universe for guidance on how to proceed — these all have their place, sure. But using data to make decisions is a far more reliable approach in business.
Blog
Scrapy Cloud secrets: Hub Crawl Frontier and how to use it
Julio Cesar Batista
6 Mins
August 24, 2023
Imagine a long crawling process, like extracting data from a website for a whole month. We can start it and leave it running until we get the results.
Blog
How Web Scraping and Graph Databases Can Power Recommendation Engines
Neha Setia Nagpal
11 Mins
August 15, 2023
I recently had the pleasure of participating in the third episode of Graphversation, a monthly live stream series that brings together graph experts and Neo4j enthusiasts for engaging and enlightening discussions about the captivating world of graphs.
Blog
How to Extract Data From HTML Table
Pawel Miech
5 Mins
August 13, 2023
HTML tables are a very common format for displaying information. When building scrapers you often need to extract data from HTML tables on web pages and turn it into some different structured format, for example, JSON, CSV, or Excel. In this article, we discuss how to extract data from HTML tables using Python and Scrapy.
Blog
Storing and Curating Your Web Crawling Data
Fernando Tadao Ito
9 Mins
August 4, 2023
Web crawlers are becoming increasingly popular in the era of big data, especially now with the advent of Large Language Models (LLMs) such as ChatGPT and LLaMA. The sheer amount of data that is publicly available from the web has a wide variety of applications including market research, sentiment analysis, and predictive modeling.
Blog
Introducing Zyte API enterprise
Iain Lennon
3 Mins
May 22, 2023
Today we’re excited to announce to the Zyte and Web Scraping communities our new offering: Zyte API Enterprise.
Blog
Python lxml tutorial | Guide to Web Scraping with python lxml library
Felipe Boff Nunes
6 Mins
May 18, 2023
Whether you're trying to analyze market trends or gather data for research, web scraping can be a useful skill to have. This technique allows you to extract specific pieces of data from websites automatically and process them for further analysis or use.
Blog
Social Media & News Data Extraction | Zyte
Marie Moynihan
4 Mins
March 31, 2023
Data extraction from news sites and social media platforms is becoming an increasingly common practice. Popular use cases range from ensuring more informed investment decisions to protecting brand reputation.