Need advice about which tool to choose?Ask the StackShare community!
Lucene vs ArangoSearch: What are the differences?
What is Lucene? A high-performance, full-featured text search engine library written entirely in Java. Lucene Core, our flagship sub-project, provides Java-based indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities.
What is ArangoSearch? Full-text search engine including similarity ranking capabilities. It is a C++ based full-text search engine including similarity ranking capabilities natively integrated into ArangoDB It allows users to combine two information retrieval techniques: boolean and generalized ranking retrieval. Search results “approved” by the boolean model can be ranked by relevance to the respective query using the Vector Space Model in conjunction with BM25 or TFIDF weighting schemes..
Lucene and ArangoSearch can be primarily classified as "Search Engines" tools.
Some of the features offered by Lucene are:
- over 150GB/hour on modern hardware
- small RAM requirements -- only 1MB heap
- incremental indexing as fast as batch indexing
On the other hand, ArangoSearch provides the following key features:
- Complex Searches with Boolean Operators
- Relevance-Based Matching
- Phrase and Prefix Matching
Pros of ArangoSearch
Pros of Lucene
- Fast1
- Small1