Why developers like Elasticsearch

What is Elasticsearch?

Elasticsearch is a distributed, RESTful search and analytics engine capable of storing data and searching it in near real time. Elasticsearch, Kibana, Beats and Logstash are the Elastic Stack (sometimes called the ELK Stack).

Elasticsearch is a tool in the Search as a Service category of a tech stack.

Elasticsearch is an open source tool with GitHub stars and GitHub forks. Here’s a link to Elasticsearch's open source repository on GitHub

Who uses Elasticsearch?

Companies

4100 companies reportedly use Elasticsearch in their tech stacks, including Uber, Shopify, and Udemy.

Uber

Shopify

Udemy

LaunchDarkly

Instacart

Slack

Robinhood

CRED

Tokopedia

Developers

29352 developers on StackShare have stated that they use Elasticsearch.

Pubu

Clan Of The Cloud

SaaS

flx

Scaleway

Marin Software

Personal

Hitalo Silva

Vishwajeet Vatharkar

Elasticsearch Integrations

Kibana, Logstash, Datadog, Contentful, and Couchbase are some of the popular tools that integrate with Elasticsearch. Here's a list of all 97 tools that integrate with Elasticsearch.

Kibana

Logstash

Datadog

Contentful

Couchbase

LangChain

Redash

Jaeger

Buddy

Pros of Elasticsearch

328

Powerful api

315

Great search engine

231

Open source

214

Restful

200

Near real-time search

Free

Search everything

Easy to get started

Analytics

Distributed

Fast search

More than a search engine

Great docs

Awesome, great tool

Highly Available

Easy to scale

Potato

Document Store

Great customer support

Intuitive API

Nosql DB

Great piece of software

Reliable

Fast

Easy setup

Open

Easy to get hot data

Github

Elaticsearch

Actively developing

Responsive maintainers on GitHub

Ecosystem

Not stable

Scalability

Community

Decisions about Elasticsearch

Here are some stack decisions, common use cases and reviews by companies and developers who chose Elasticsearch in their tech stack.

Simon Bettison

Managing Director at Bettison.org Limited · Mar 25, 2019 | 8 upvotes · 834.8K views

Shared insights

Bettison.org Limited

In 2012 we made the very difficult decision to entirely re-engineer our existing monolithic LAMP application from the ground up in order to address some growing concerns about it's long term viability as a platform.

Full application re-write is almost always never the answer, because of the risks involved. However the situation warranted drastic action as it was clear that the existing product was going to face severe scaling issues. We felt it better address these sooner rather than later and also take the opportunity to improve the international architecture and also to refactor the database in. order that it better matched the changes in core functionality.

PostgreSQL was chosen for its reputation as being solid ACID compliant database backend, it was available as an offering AWS RDS service which reduced the management overhead of us having to configure it ourselves. In order to reduce read load on the primary database we implemented an Elasticsearch layer for fast and scalable search operations. Synchronisation of these indexes was to be achieved through the use of Sidekiq's Redis based background workers on Amazon ElastiCache. Again the AWS solution here looked to be an easy way to keep our involvement in managing this part of the platform at a minimum. Allowing us to focus on our core business.

Rails ls was chosen for its ability to quickly get core functionality up and running, its MVC architecture and also its focus on Test Driven Development using RSpec and Selenium with Travis CI providing continual integration. We also liked Ruby for its terse, clean and elegant syntax. Though YMMV on that one!

Unicorn was chosen for its continual deployment and reputation as a reliable application server, nginx for its reputation as a fast and stable reverse-proxy. We also took advantage of the Amazon CloudFront CDN here to further improve performance by caching static assets globally.

We tried to strike a balance between having control over management and configuration of our core application with the convenience of being able to leverage AWS hosted services for ancillary functions (Amazon SES , Amazon SQS Amazon Route 53 all hosted securely inside Amazon VPC of course!).

Whilst there is some compromise here with potential vendor lock in, the tasks being performed by these ancillary services are no particularly specialised which should mitigate this risk. Furthermore we have already containerised the stack in our development using Docker environment, and looking to how best to bring this into production - potentially using Amazon EC2 Container Service

Testadmin T

May 5, 2020 | 3 upvotes · 112.2K views

Needs advice

Elasticsearch

Fauna

and

MongoDB

I would like to assess search functionality along with some analytical use cases like aggregating, faceting etc.,. I would like to know which is the best database to go with among Elasticsearch, MongoDB and FaunaDB.

Matan Fleishman

May 27, 2020 | 4 upvotes · 140.2K views

Needs advice

Elasticsearch

and

PostgreSQL

Hi, I need advice on which Database tool to use in the following scenario:

I work with Cesium, and I need to save and load CZML snapshot and update objects for a recording program that saves files containing several entities (along with the time of the snapshot or update). I need to be able to easily load the files according to the corresponding timeline point (for example, if the update was recorded at 13:15, I should be able to easily load the update file when I click on the 13:15 point on the timeline). I should also be able to make geo-queries relatively easily.

I am currently thinking about Elasticsearch or PostgreSQL, but I am open to suggestions. I tried looking into Time Series Databases like TimescaleDB but found that it is unnecessarily powerful than my needs since the update time is a simple variable.

Thanks for your advice in advance!

Nilesh Akhade

Technical Architect at Self Employed · Jul 8, 2020 | 5 upvotes · 555.2K views

Needs advice

Apache Flink

Kafka Streams

and

Apache Spark

We have a Kafka topic having events of type A and type B. We need to perform an inner join on both type of events using some common field (primary-key). The joined events to be inserted in Elasticsearch.

In usual cases, type A and type B events (with same key) observed to be close upto 15 minutes. But in some cases they may be far from each other, lets say 6 hours. Sometimes event of either of the types never come.

In all cases, we should be able to find joined events instantly after they are joined and not-joined events within 15 minutes.

Tom Pattison

Mar 16, 2020 | 1 upvote · 94K views

Chose

Elasticsearch

over

Apache Solr

Checkout-com

Elasticsearch Used for powering full-text search applications.

The latest in payments technology and company news | Checkout.com

Sunil Chaudhari

Team Lead at XYZ · Jun 15, 2020 | 2 upvotes · 572.5K views

Needs advice

Metricbeat

and

Prometheus

Hi, We have a situation, where we are using Prometheus to get system metrics from PCF (Pivotal Cloud Foundry) platform. We send that as time-series data to Cortex via a Prometheus server and built a dashboard using Grafana. There is another pipeline where we need to read metrics from a Linux server using Metricbeat, CPU, memory, and Disk. That will be sent to Elasticsearch and Grafana will pull and show the data in a dashboard.

Is it OK to use Metricbeat for Linux server or can we use Prometheus?

What is the difference in system metrics sent by Metricbeat and Prometheus node exporters?

Regards, Sunil.

See all decisions

Blog Posts

Faster Flink Adoption with Self-Service Diagnosis Tool at Pint...

Oct 6 2021 at 8:21AM

678

Open Sourcing Querybook, Pinterest’s Collaborative Big Data Hu...

Jun 23 2021 at 5:13PM

+17

10966

How Mixmax Uses Node and Go to Process 250M Events a day

Jul 1 2019 at 11:22PM

Mixmax

+12

3128

Observability with the ELK Stack

May 21 2019 at 12:20AM

Elastic

5305

Redux: Scaling LaunchDarkly From 4 to 200 Billion Feature Flag...

Mar 12 2019 at 3:11PM

LaunchDarkly

+23

6846

Stream & Go: News Feeds for Over 300 Million End Users

Jan 18 2018 at 7:43AM

Stream

+42

40941

Dubsmash: Scaling To 200 Million Users With 3 Engineers

Dec 14 2017 at 10:02AM

Dubsmash

+47

72823

The Stack That Helped Opendoor Buy and Sell Over $1B in Homes

Mar 9 2017 at 8:02AM

Opendoor

+39

31782

Jobs that mention Elasticsearch as a desired skillset

Manager I, Site Reliability Engineering

San Francisco, CA, US; , CA, US

View Job Details

+18

Staff Software Engineer - Site Reliability

Toronto, ON, CA

View Job Details

+20

Fullstack Engineer

LaunchDarkly

Oakland, California, United States

View Job Details

+15

Manager I, Site Reliability Engineering

San Francisco, CA, US; , CA, US

View Job Details

+18

Staff Software Engineer - Site Reliability

Toronto, ON, CA

View Job Details

+20

See all jobs

Elasticsearch's Features

Distributed and Highly Available Search Engine
Multi Tenant with Multi Types
Various set of APIs including RESTful
Clients available in many languages including Java, Python, .NET, C#, Groovy, and more
Document oriented
Reliable, Asynchronous Write Behind for long term persistency
(Near) Real Time Search
Built on top of Apache Lucene
Per operation consistency
Inverted indices with finite state transducers for full-text querying
BKD trees for storing numeric and geo data
Column store for analytics
Compatible with Hadoop using the ES-Hadoop connector
Open Source under Apache 2 and Elastic License

Elasticsearch Alternatives & Comparisons

What are some alternatives to Elasticsearch?

Datadog

Datadog is the leading service for cloud-scale monitoring. It is used by IT, operations, and development teams who build and operate applications that run on dynamic or hybrid cloud infrastructure. Start monitoring in minutes with Datadog!

Solr

Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, near real-time indexing, dynamic clustering, database integration, rich document (e.g., Word, PDF) handling, and geospatial search. Solr is highly reliable, scalable and fault tolerant, providing distributed indexing, replication and load-balanced querying, automated failover and recovery, centralized configuration and more. Solr powers the search and navigation features of many of the world's largest internet sites.

Lucene

Lucene Core, our flagship sub-project, provides Java-based indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities.

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

Algolia

Our mission is to make you a search expert. Push data to our API to make it searchable in real time. Build your dream front end with one of our web or mobile UI libraries. Tune relevance and get analytics right from your dashboard.

See all alternatives

Related Comparisons