391
277
+ 1
37

What is Zookeeper?

A centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications.
Zookeeper is a tool in the Open Source Service Discovery category of a tech stack.

Who uses Zookeeper?

Companies
155 companies reportedly use Zookeeper in their tech stacks, including Uber, Pinterest, and Shopify.

Developers
221 developers on StackShare have stated that they use Zookeeper.

Zookeeper Integrations

Datadog, Druid, StackStorm, Kubeless, and SignalFx are some of the popular tools that integrate with Zookeeper. Here's a list of all 10 tools that integrate with Zookeeper.

Why developers like Zookeeper?

Here’s a list of reasons why companies and developers use Zookeeper
Zookeeper Reviews

Here are some stack decisions, common use cases and reviews by companies and developers who chose Zookeeper in their tech stack.

StackShare Editors
StackShare Editors
HAProxy
HAProxy
Zookeeper
Zookeeper

Early 2013

In early 2013, Airbnb tackled the problem of service discovery and load balancing in the context of a service oriented architecture (SOA) by building and releasing an open source tool called SmartStack. SmartStack is built on two other open source tools created by Airbnb called Nerve and Synapse.

Nerve is a service registration daemon that performs health checks that “creates ephemeral nodes in Zookeeper which contain information about the address/port combos for a backend available to serve requests for a particular service.”

Synapse is a transparent service discovery framework for connecting an SOA that reads the information in Zookeeper for available backends, and then uses that information to configure a local HAProxy process, which then routes requests between clients and services.

See more
John Egan
John Egan
at Pinterest · | 1 upvotes · 14.1K views
atPinterestPinterest
Zookeeper
Zookeeper

Zookeeper manages our state, and tells each node what version of code it should be running. Zookeeper

See more
tim-thimmaiah
tim-thimmaiah
Zookeeper
Zookeeper

Initially, Stitch only supported real-time updates and addressed this problem with a MapReduce job named The Restorator that performed the following actions:

  • Calculated the expected totals
  • Queried Cassandra to get the values it had for each counter
  • Calculated the increments needed to apply to fix the counters
  • Applied the increments

Meanwhile, to stop the sand shifting under its feet, The Restorator needed to coordinate a locking system between itself and the real-time processors, so that the processors did not try to simultaneously apply increments to the same counter, resulting in a race-condition. It used ZooKeeper for this. Zookeeper

See more
John Egan
John Egan
at Pinterest · | 1 upvotes · 4.4K views
atPinterestPinterest
Zookeeper
Zookeeper

Like many large scale web sites, Pinterest’s infrastructure consists of servers that communicate with backend services composed of a number of individual servers for managing load and fault tolerance. Ideally, we’d like the configuration to reflect only the active hosts, so clients don’t need to deal with bad hosts as often. ZooKeeper provides a well known pattern to solve this problem. Zookeeper

See more
Zookeeper
Zookeeper

Used Zookeeper as the resource management system for Mesos/Marathon services. Zookeeper

See more

Zookeeper Alternatives & Comparisons

What are some alternatives to Zookeeper?
Consul
Consul is a tool for service discovery and configuration. Consul is distributed, highly available, and extremely scalable.
etcd
etcd is a distributed key value store that provides a reliable way to store data across a cluster of machines. It’s open-source and available on GitHub. etcd gracefully handles master elections during network partitions and will tolerate machine failure, including the master.
Yarn
Yarn caches every package it downloads so it never needs to again. It also parallelizes operations to maximize resource utilization so install times are faster than ever.
Eureka
Eureka is a REST (Representational State Transfer) based service that is primarily used in the AWS cloud for locating services for the purpose of load balancing and failover of middle-tier servers.
Ambari
This project is aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. It provides an intuitive, easy-to-use Hadoop management web UI backed by its RESTful APIs.
See all alternatives

Zookeeper's Followers
277 developers follow Zookeeper to keep up with related blogs and decisions.
Ryan Hicks
Glasn Host
a a
Hercial Vitalis
Lucky Pratama
Krishnan Subramanian
Suresh Grandhisiri
Anndream Co,ltd.
Matt Shields
Nicolas Iuorio