Lucene logo

Lucene

A high-performance, full-featured text search engine library written entirely in Java
113
90
+ 1
0

What is Lucene?

Lucene Core, our flagship sub-project, provides Java-based indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities.
Lucene is a tool in the Search Engines category of a tech stack.

Who uses Lucene?

Companies
40 companies reportedly use Lucene in their tech stacks, including Twitter, Slack, and Evernote.

Developers
72 developers on StackShare have stated that they use Lucene.

Lucene Integrations

Why developers like Lucene?

Here’s a list of reasons why companies and developers use Lucene
Top Reasons
Be the first to leave a pro
Lucene Reviews

Here are some stack decisions, common use cases and reviews by companies and developers who chose Lucene in their tech stack.

StackShare Editors
StackShare Editors
Solr
Solr
Lucene
Lucene

"Slack provides two strategies for searching: Recent and Relevant. Recent search finds the messages that match all terms and presents them in reverse chronological order. If a user is trying to recall something that just happened, Recent is a useful presentation of the results.

Relevant search relaxes the age constraint and takes into account the Lucene score of the document — how well it matches the query terms (Solr powers search at Slack). Used about 17% of the time, Relevant search performed slightly worse than Recent according to the search quality metrics we measured: the number of clicks per search and the click-through rate of the search results in the top several positions. We recognized that Relevant search could benefit from using the user’s interaction history with channels and other users — their ‘work graph’."

See more

Lucene's Features

  • over 150GB/hour on modern hardware
  • small RAM requirements -- only 1MB heap
  • incremental indexing as fast as batch indexing
  • index size roughly 20-30% the size of text indexed
  • ranked searching -- best results returned first
  • many powerful query types: phrase queries, wildcard queries, proximity queries, range queries
  • fielded searching (e.g. title, author, contents)
  • sorting by any field
  • multiple-index searching with merged results
  • allows simultaneous update and searching
  • flexible faceting, highlighting, joins and result grouping
  • fast, memory-efficient and typo-tolerant suggesters
  • pluggable ranking models, including the Vector Space Model and Okapi BM25
  • configurable storage engine (codecs)

Lucene Alternatives & Comparisons

What are some alternatives to Lucene?
Solr
Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, near real-time indexing, dynamic clustering, database integration, rich document (e.g., Word, PDF) handling, and geospatial search. Solr is highly reliable, scalable and fault tolerant, providing distributed indexing, replication and load-balanced querying, automated failover and recovery, centralized configuration and more. Solr powers the search and navigation features of many of the world's largest internet sites.
Elasticsearch
Elasticsearch is a distributed, RESTful search and analytics engine capable of storing data and searching it in near real time. Elasticsearch, Kibana, Beats and Logstash are the Elastic Stack (sometimes called the ELK Stack).
Sphinx
It lets you either batch index and search data stored in an SQL database, NoSQL storage, or just files quickly and easily — or index and search data on the fly, working with it pretty much as with a database server.
Apache Solr
It uses the tools you use to make application building a snap. It is built on the battle-tested Apache Zookeeper, it makes it easy to scale up and down.
MkDocs
It builds completely static HTML sites that you can host on GitHub pages, Amazon S3, or anywhere else you choose. There's a stack of good looking themes available. The built-in dev-server allows you to preview your documentation as you're writing it. It will even auto-reload and refresh your browser whenever you save your changes.
See all alternatives

Lucene's Followers
90 developers follow Lucene to keep up with related blogs and decisions.
Nurullah Özdemir
Kevin Grimes
Mihai Raulea
Pranav Waila
Navid Mehralizadeh
Max Levy
lyc1116
FIRAT OLTULU
John Alton
Sudhanshu Gupta