How to leverage alternative data in asset management

Whether you are managing a hedge fund trying to find innovative sources of alpha or are an analyst looking to future proof your company’s financial investments, as big data continues to disrupt the investment research landscape, getting on top of these alternative datasets as early as possible is the key to capturing the immense alpha left in this data.

Building Blocks of an Unstoppable Web Scraping Infrastructure

More and more businesses leverage the power of web scraping. Extracting data from the web is becoming popular. But it doesn't mean that the technical challenges are gone. Building a sustainable web scraping infrastructure takes expertise and experience. Here, at Scrapinghub we scrape 9 billion pages per month. In this article, we are going to summarize what the essential elements of web...

Backconnect Proxy: Explanation & Comparison To Other Proxies

Scaling up your web scraping project is not an easy task. Adding proxies is one of the first actions you will need to take. You will need to manage a healthy proxy pool to avoid bans. There are a lot of proxy services/providers, each having a whole host of different types of proxies. In this blog post, you are going to learn how backconnect proxies work and when you should use them.

...

4 Sectors That Benefited Most from Business Intelligence Software

Data moves around the marketplace. It can be sourced internally or externally and collected from vendors, manufacturers, retailers, wholesalers, consumers, and other players in the marketplace. This data is then processed and used by businesses in making insights and decisions regarding new business ventures, product ideas, conflict resolution, and process improvement.

Price Scraping: The Best Free Tool To Scrape Prices

Price scraping is something that you need to do if you want to extract pricing data from websites. It might look easy and just a minor technical detail that needs to be handled but in reality, if you don’t know the best way to get those price values from the HTMLs, it can be a headache over time.

How to use Crawlera with Scrapy

Crawlera is a proxy service, specifically designed for web scraping. In this article, you are going to learn how to use Crawlera inside your Scrapy spider.

TRY CRAWLERA FOR FREE

Scrapy, Matplotlib and MySQL: Real Estate Data Analysis

In this article, we will extract real estate listings from one of the biggest real estate sites and then analyze the data. Similar to our previous web data analysis blog post, I will show you a simple way to extract web data with python and then perform descriptive analysis on the dataset.

Web Scraping Questions & Answers Part II

In our article last week, we answered some of the best questions we got during Extract Summit. In today’s post we share with you the second part of this series. We are covering questions on Web Scraping Infrastructure and How Machine Learning can be used in Web Scraping.

Scrapy & AutoExtract API integration

We’ve just released a new open-source Scrapy middleware which makes it easy to integrate AutoExtract into your existing Scrapy spider. If you haven’t heard about AutoExtract yet, it’s an AI-based web scraping tool which automatically extracts data from web pages without the need to write any code. Learn more about AutoExtract here.

Web Scraping Questions & Answers Part I

As you know we held the first ever Web Data Extraction Summit last month. During the talks, we had a lot of questions from the audience. We have divided the questions into two parts - in the first part, we will cover questions on Web Scraping at Scale - Proxy and Anti-Ban Best Practice, and Legal Compliance, GDPR in the World of Web Scraping. Enjoy! You can also check out the full talks on...