Browsed by
Category: Portia

This Month in Open Source at Scrapinghub March 2016

This Month in Open Source at Scrapinghub March 2016

Welcome to This Month in Open Source at Scrapinghub! In this monthly column, we share all the latest updates on our open source projects including Scrapy, Splash, Portia, and Frontera. If you’re interested in learning more or even becoming a contributor, reach out to us by email at opensource [@] scrapinghub.com or on Twitter @scrapinghub. Scrapy The big news for Scrapy lately is that Python 3 is now supported for the majority of use cases, the exceptions being FTP and…

Read More Read More

Migrate your Kimono Projects to Portia

Migrate your Kimono Projects to Portia

Heads up, Kimono Labs users! Today, we are releasing a tool to help you migrate your Kimono projects to Portia. All you have to do is provide your Kimono credentials and let it convert your Kimono projects into Portia projects. You will then be able to run those projects on Scrapy Cloud or on your own Portia instance, since Portia is open source. Stay tuned for the Portia 2.0 beta release coming soon! Portia 2.0 comes with a brand new user interface…

Read More Read More

Portia: The Open Source Alternative to Kimono Labs

Portia: The Open Source Alternative to Kimono Labs

Attention Kimono users: we’ve created an exporter so you can easily convert your projects from Kimono to Portia! Imagine your business depended heavily on a third party tool and one day that company decided to shut down its service with only 2 weeks notice. That, unfortunately, is what happened to users of Kimono Labs yesterday. And it’s one of the many reasons why we love open source so much. Portia is an open source visual scraping tool developed by Scrapinghub…

Read More Read More

The Road to Loading JavaScript in Portia

The Road to Loading JavaScript in Portia

Support for JavaScript has been a much requested feature ever since Portia’s first release 2 years ago. The wait is nearly over and we are happy to inform you that we will be launching these changes in the very near future. If you’re feeling adventurous you can try it out on the develop branch at Github. This post aims to highlight the path we took to achieving JavaScript support in Portia. The Plan As with everything in software, we started…

Read More Read More

Scrape Data Visually with Portia and Scrapy Cloud

Scrape Data Visually with Portia and Scrapy Cloud

It’s been several months since we first integrated Portia into our Scrapy Cloud platform, and last week we officially began to phase out Autoscraping in favor of Portia. In case you aren’t familiar with Portia, it’s an open source tool we developed for visually scraping websites. Portia allows you to make templates of pages you want to scrape and uses those templates to create a spider to scrape similar pages. Autoscraping is our predecessor to Portia, and for the time…

Read More Read More