Lucene Core, our flagship sub-project, provides Java-based indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. | It lets you either batch index and search data stored in an SQL database, NoSQL storage, or just files quickly and easily — or index and search data on the fly, working with it pretty much as with a database server. | A fast, lightweight and schema-less search backend. It ingests search texts and identifier tuples that can then be queried against in microseconds. |
over 150GB/hour on modern hardware;small RAM requirements -- only 1MB heap;incremental indexing as fast as batch indexing;index size roughly 20-30% the size of text indexed;ranked searching -- best results returned first;many powerful query types: phrase queries, wildcard queries, proximity queries, range queries;fielded searching (e.g. title, author, contents);sorting by any field;multiple-index searching with merged results;allows simultaneous update and searching;flexible faceting, highlighting, joins and result grouping;fast, memory-efficient and typo-tolerant suggesters;pluggable ranking models, including the Vector Space Model and Okapi BM25;configurable storage engine (codecs) | Output formats: HTML (including Windows HTML Help), LaTeX (for printable PDF versions), ePub, Texinfo, manual pages, plain text;Extensive cross-references: semantic markup and automatic links for functions, classes, citations, glossary terms and similar pieces of information;Hierarchical structure: easy definition of a document tree, with automatic links to siblings, parents and children;Automatic indices: general index as well as a language-specific module indices;Code handling: automatic highlighting using the Pygments highlighter;Extensions: automatic testing of code snippets, inclusion of docstrings from Python modules (API docs), and more | Search terms are stored in collections, organized in buckets; Search results return object identifiers;; Search query typos are corrected; Insert and remove items in the index; Auto-complete any word in real-time;; Full Unicode compatibility; Networked channel interface; Easy-to-use libraries |
Statistics | ||
GitHub Stars - | GitHub Stars - | GitHub Stars 21.0K |
GitHub Forks - | GitHub Forks - | GitHub Forks 604 |
Stacks 173 | Stacks 906 | Stacks 4 |
Followers 230 | Followers 300 | Followers 24 |
Votes 2 | Votes 32 | Votes 0 |
Pros & Cons | ||
Pros
| Pros
| No community feedback yet |
Integrations | ||

It builds completely static HTML sites that you can host on GitHub pages, Amazon S3, or anywhere else you choose. There's a stack of good looking themes available. The built-in dev-server allows you to preview your documentation as you're writing it. It will even auto-reload and refresh your browser whenever you save your changes.

Search the world's information, including webpages, images, videos and more. Google has many special features to help you find exactly what you're looking for.

An open-source, high-performance, distributed SQL database built for resilience and scale. Re-uses the upper half of PostgreSQL to offer advanced RDBMS features, architected to be fully distributed like Google Spanner.

Searchkick learns what your users are looking for. As more people search, it gets smarter and the results get better. It’s friendly for developers - and magical for your users.

It uses the tools you use to make application building a snap. It is built on the battle-tested Apache Zookeeper, it makes it easy to scale up and down.

It is an open-source Vector Search Engine and Vector Database written in Rust. It deploys as an API service providing search for the nearest high-dimensional vectors. With Qdrant, embeddings or neural network encoders can be turned into full-fledged applications for matching, searching, recommending, and much more.

It is an open-source vector search engine. It allows you to store data objects and vector embeddings from your favorite ML-models, and scale seamlessly into billions of data objects.

We help your website visitors find what they are looking for. AddSearch is a lightning fast, accurate and customizable site search engine with a Search API. AddSearch works on all devices and is easy to install, customize and tweak.
It is a C++ based full-text search engine including similarity ranking capabilities natively integrated into ArangoDB. It allows users to combine two information retrieval techniques: boolean and generalized ranking retrieval. Search results “approved” by the boolean model can be ranked by relevance to the respective query using the Vector Space Model in conjunction with BM25 or TFIDF weighting schemes.

It organizes your search results into topics. With an instant overview of what's available, you will quickly find what you're looking for.