Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Web Scraping and Data Extraction ParseHub is a free and powerful web scraping tool. With our advanced web scraper, extracting data is as easy as clicking on the data you need. ParseHub lets you turn any website into a spreadsheet or API w | Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, near real-time indexing, dynamic clustering, database integration, rich document (e.g., Word, PDF) handling, and geospatial search. Solr is highly reliable, scalable and fault tolerant, providing distributed indexing, replication and load-balanced querying, automated failover and recovery, centralized configuration and more. Solr powers the search and navigation features of many of the world's largest internet sites. |
Works with single-page apps, multi-page apps;Uses machine learning for its state-of-the-art relationship engine;Instantly shows sample data as you're working | Advanced full-text search capabilities;
Optimized for high volume web traffic;
Standards-based open interfaces - XML, JSON and HTTP;
Comprehensive HTML administration interfaces;
Server statistics exposed over JMX for monitoring;
Linearly scalable, auto index replication, auto-failover and recovery;
Near real-time indexing;
Flexible and adaptable with XML configuration;
Extensible plugin architecture |
Statistics | |
Stacks 32 | Stacks 805 |
Followers 92 | Followers 644 |
Votes 19 | Votes 126 |
Pros & Cons | |
Pros
| Pros
|
Integrations | |
| No integrations available | |

Our mission is to make you a search expert. Push data to our API to make it searchable in real time. Build your dream front end with one of our web or mobile UI libraries. Tune relevance and get analytics right from your dashboard.

import.io is a free web-based platform that puts the power of the machine readable web in your hands. Using our tools you can create an API or crawl an entire website in a fraction of the time of traditional methods, no coding required.

Extract data from websites and turn them to API. We will handle all the rotating proxies and Chrome rendering for you. Many specialists have to handle Javascript rendering, headless browser update and maintenance, proxies diversity and rotation. It is a simple API that does all the above for you.

It is a free client-side Windows web scraping software that turns unstructured or semi-structured data from websites into structured data sets, no coding necessary. Extracted data can be exported as API, CSV, Excel or exported into a database.
dejaVu fits the unmet need of being a hackable data browser for Elasticsearch. Existing browsers were either built with a legacy UI and had a lacking user experience or used server side rendering (I am looking at you, Kibana).

You don't need to write any code or install any software to extract data with Kimono. The easiest way to use Kimono is to add our bookmarklet to your browser's bookmark bar. Then go to the website you want to get data from and click the bookmarklet. Select the data you want and Kimono does the rest. We take care of hosting the APIs that you build with Kimono and running them on the schedule you specify. Use the API output in JSON or as CSV files that you can easily paste into a spreadsheet.

It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. It commonly saves programmers hours or days of work.

Apify is a platform that enables developers to create, customize and run cloud-based programs called actors that can, among other things, be used to extract data from any website using a few lines of JavaScript.

Elassandra is a fork of Elasticsearch modified to run on top of Apache Cassandra in a scalable and resilient peer-to-peer architecture. Elasticsearch code is embedded in Cassanda nodes providing advanced search features on Cassandra tables and Cassandra serve as an Elasticsearch data and configuration store.

It is a full-text search engine library inspired by Apache Lucene and written in Rust. It is not an off-the-shelf search engine server, but rather a crate that can be used to build such a search engine.