Explore how AI agents are reshaping web traffic into hostile, negotiated, and invited access lanes. Learn what this means for bots, scraping, and the future of data access.
by
How online retailers use web data to compete on price, promotion, and availability
by
The recipe for a request: Scaling data extraction through investigation
Web scraping can look deceptively easy these days. There are numerous open-source libraries/frameworks, visual scraping tools, and data extraction tools that make it very easy to scrape data from a website.
A Sneak Peek Inside What Hedge Funds Think of Alternative Financial Data
Unbeknownst to many, there is a data revolution happening in finance. In their never ending search for alpha hedge funds and investment banks are increasingly turning to new alternative sources of data to give them an informational edge over the market.
Want To Predict Fitbit’s Quarterly Revenue? Eagle Alpha Did It Using Web Scraped Product Data
Throughout the history of the financial markets information has been power.
GDPR Compliance Tools for Web Scraping Crawlers
Over the last couple weeks, GDPR has brought data protection center stage. What was once a fringe concern for most businesses overnight became a burning problem that needed to be solved immediately.
Announcement
Looking Back at 2017: Achievements and Innovations
It’s been another standout year for Scrapinghub and the scraping community at large. Together we crawled 79.1 billion pages (nearly double 2016), with over 103 billion scraped records; what a year!
Product Update
A Faster, Updated Zyte
We’re very excited to announce a new look for Zyte!
Scraping The Steam Game Store With Scrapy
This is a guest post from the folks over at Intoli, one of the awesome companies providing Scrapy commercial support and longtime Scrapy fans.
Leadership
Do Androids Dream Of Electric Sheep?
It got very easy to do Machine Learning: you install an ML library like scikit-learn or xgboost, choose an estimator, feed it some training data, and get a model that can be used for predictions.
How To
Deploy Your Scrapy Spiders From GitHub | Scrapy Cloud
Up until now, your deployment process using Scrapy Cloud has probably been something like this: code and test your spiders locally, commit and push your changes to a GitHub repository, and finally deploy them to Scrapy Cloud using shub deploy.
Looking Back At 2016
We started 2016 with an eye on blowing 2015 out of the water. Mission accomplished.
How To Increase Sales With Online Reputation Management
One negative review can cost your business up to 22% of its prospects. This was one of the sobering findings in a study highlighted on Moz last year.
Web Scraping Price Monitoring
Computers are great at repetitive tasks. They don't get distracted, bored, or tired.