Browsed by
Category: Scrapinghub

A Sneak Peek Inside What Hedge Funds Think of Alternative Financial Data

A Sneak Peek Inside What Hedge Funds Think of Alternative Financial Data

Unbenounced to many, there is a data revolution happening in finance. In their never ending search for alpha hedge funds and investment banks are increasingly turning to new alternative sources of data to give them an informational edge over the market. On the 31st May, Scrapinghub got the chance to see this revolution first hand. Mills Horton and Thad Chappell of Scrapinghub were invited to Eagle Alpha’s Alternative Data Showcase in New York City, and had some of the leading…

Read More Read More

Want to Predict Fitbit’s Quarterly Revenue? Eagle Alpha Did It Using Web Scraped Product Data

Want to Predict Fitbit’s Quarterly Revenue? Eagle Alpha Did It Using Web Scraped Product Data

Throughout the history of the financial markets information has been power. The trader with access to the most accurate information can quickly gain an edge over the market. Two hundred years ago, in the age before telegrams and news services, knowing the results of battles, elections and campaigns before anybody else was a huge advantage. Fifty years ago, before Reuters began digitizing company statements, access to company financials gave fundamentals-based investors like Benjamin Graham and Warren Buffett an edge over…

Read More Read More

How Data Compliance Companies Are Turning To Web Crawlers To Take Advantage of the GDPR Business Opportunity

How Data Compliance Companies Are Turning To Web Crawlers To Take Advantage of the GDPR Business Opportunity

Over the last couple weeks, GDPR has brought data protection center stage. What was once a fringe concern for most businesses overnight became a burning problem that needed to be solved immediately. With the sweeping changes that GDPR has introduced, it has proven itself to be a huge headache for companies big and small. However, GDPR has been a goldmine for some savvy companies who positioned themselves to take full advantage of the surge in demand for data compliance solutions….

Read More Read More

Looking Back at 2017

Looking Back at 2017

It’s been another standout year for Scrapinghub and the scraping community at large. Together we crawled 79.1 billion pages (nearly double 2016), with over 103 billion scraped records; what a year! We’ll do our best here to give you the highlights of 2017 and whet your appetite for what you can expect in 2018 – let’s get into it: What’s New Let’s start with some of what was new in 2017! In July we launched a new offering specifically for…

Read More Read More

A Faster, Updated Scrapinghub

A Faster, Updated Scrapinghub

We’re very excited to announce a new look for Scrapinghub! We’ve been improving your experience by streamlining common workflows, and integrating with common tools (like our integration with Github). Today’s release is another step in that direction. Here is what’s new:   New styles We hope you like the new look! Most things are in the same place as before, so we hope it’s a seamless transition.   Onboarding For those new to the platform there’s an improved onboarding experience…

Read More Read More

Deploy your Scrapy Spiders from GitHub

Deploy your Scrapy Spiders from GitHub

Up until now, your deployment process using Scrapy Cloud has probably been something like this: code and test your spiders locally, commit and push your changes to a GitHub repository, and finally deploy them to Scrapy Cloud using shub deploy. However, having the development and the deployment processes in isolated steps might bring you some issues, such as unversioned and outdated code running in production. The good news is that, from now on, you can have your code automatically deployed…

Read More Read More

Looking Back at 2016

Looking Back at 2016

We started 2016 with an eye on blowing 2015 out of the water. Mission accomplished. Together with our users, we crawled more in 2016 than the rest of Scrapinghub’s history combined: a whopping 43.7 billion web pages, resulting in 70.3 billion scraped records! Great work everyone! In the what follows, we’ll give you a whirlwind tour of what we’ve been up to in 2016, along with a quick peek at what you can expect in 2017. Platform Scrapy Cloud It’s…

Read More Read More

How to Increase Sales with Online Reputation Management

How to Increase Sales with Online Reputation Management

One negative review can cost your business up to 22% of its prospects. This was one of the sobering findings in a study highlighted on Moz last year. With over half of shoppers rating reviews as important in their buying decision, no company large or small can afford to ignore stats like these – let alone the reviews themselves. In what follows I’ll let you in on how web scraping can help you stay on top. What is online reputation…

Read More Read More

How to Build your own Price Monitoring Tool

How to Build your own Price Monitoring Tool

Computers are great at repetitive tasks. They don’t get distracted, bored, or tired. Automation is how you should be approaching tedious tasks that are absolutely essential to becoming a successful business or when carrying out mundane responsibilities. Price monitoring, for example, is a practice that every company should be doing, and is a task that readily lends itself to automation. In this tutorial, I’ll walk you through how to create your very own price monitoring tool from scratch. While I’m…

Read More Read More

How You Can Use Web Data to Accelerate Your Startup

How You Can Use Web Data to Accelerate Your Startup

In just the US alone, there were 27 million individuals running or starting a new business in 2015. With this fiercely competitive startup scene, business owners need to take advantage of every resource available, especially given a high probability of failure. Enter web data. Web data is abundant and those who harness it can do everything from keeping an eye on competitors to ensuring customer satisfaction. Web Data and Web Scraping You can get web data through a process called…

Read More Read More