What is Portia?
Portia is an open source tool that lets you get data from websites. It facilitates and automates the process of data extraction. This visual web scraper works straight from your browser, so you don't need to download or install anything.
Portia is a tool in the Web Scraping API category of a tech stack.
Portia is an open source tool with 8.1K GitHub stars and 1.3K GitHub forks. Here’s a link to Portia's open source repository on GitHub
Who uses Portia?
17 developers on StackShare have stated that they use Portia.
- Extracts data from websites based on visual selections by the user
- Creates generic web scrapers which are capable of extracting data from any web page with a similar structure
- Exports scraped data in CSV, JSON, JSON-lines and XML
- There is a hosted version available as a free service on Scrapy Cloud which lets Portia leverage from all the features of a cloud-based production platform including scaling and scheduling jobs, data storage, QA features, and add ons
Portia Alternatives & Comparisons
What are some alternatives to Portia?
See all alternatives
It is the most popular web scraping framework in Python. An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.
It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. It commonly saves programmers hours or days of work.
import.io is a free web-based platform that puts the power of the machine readable web in your hands. Using our tools you can create an API or crawl an entire website in a fraction of the time of traditional methods, no coding required.
You don't need to write any code or install any software to extract data with Kimono. The easiest way to use Kimono is to add our bookmarklet to your browser's bookmark bar. Then go to the website you want to get data from and click the bookmarklet. Select the data you want and Kimono does the rest. We take care of hosting the APIs that you build with Kimono and running them on the schedule you specify. Use the API output in JSON or as CSV files that you can easily paste into a spreadsheet.