In this third post in our solution architecture series, we will share with you our step-by-step process for conducting a legal review of every web scraping project we work on.
When it comes to using web data as alternative data for investment decision making, one topic rules them all: compliance.
Proxy management is the thorn in the side of most web scrapers. Without a robust and fully featured proxy infrastructure, you will often experience constant reliability issues and hours spent putting out proxy fires - a situation no web scraping professional wants to deal with.
I was recently invited to speak at the IAPP Europe Data Protection Congress in Brussels about web scraping and GDPR.
When it comes to web scraping, one key element is often overlooked until it becomes a big problem.
Chau Tung Lam Nguyen Bhatt Google Summer of Code (GSoC) was such a great experience for students like me. I learned so much about open source communities as well as contributing to their complex projects.
Unless you’ve been living under a rock for the past few months you know that the EU’s General Data Protection Regulation (GDPR) is upon us.
It got very easy to do Machine Learning: you install an ML library like scikit-learn or xgboost, choose an estimator, feed it some training data, and get a model that can be used for predictions.
During the 2016 Collision Conference held in New Orleans, our Content Strategist Cecilia Haynes interviewed conference speaker Dr. Tyrone Grandison.
What does “the Future of Work” mean to you? To us, it describes how we approach life at Scrapinghub.
This is a tale of trial, tribulation, and triumph. It is the story of how I overcame obstacles including an inconveniently placed grove of eucalyptus trees, armed with little more than a broom and a pair of borrowed binoculars, to establish a stable internet connection.