Apache ZooKeeper
DeveloperApache Software Foundation
Stable release
3.9.5-0[1] Edit this on Wikidata / 2026-02-11; 4 months ago
Written inJava
Operating systemCross-platform
TypeDistributed computing
LicenseApache License 2.0
Websitezookeeper.apache.org
RepositoryZooKeeper Repository

Apache ZooKeeper is an open-source server for highly reliable distributed coordination of cloud applications.[2] It is a project of the Apache Software Foundation.

ZooKeeper is essentially a service for distributed systems offering a hierarchical key-value store, which is used to provide a distributed configuration service, synchronization service, and naming registry for large distributed systems (see Use cases).[3] ZooKeeper was a sub-project of Hadoop but is now a top-level Apache project in its own right.

Overview

edit

ZooKeeper's architecture supports high availability through redundant services. Clients can ask another ZooKeeper leader if the first fails to answer. ZooKeeper nodes store their data in a hierarchical name space, like a file system or a tree data structure. Clients can read from and write to the nodes and in this way have a shared configuration service. ZooKeeper can be viewed as an atomic broadcast system, through which updates are totally ordered. The ZooKeeper Atomic Broadcast (ZAB) protocol is the core of the system.[4]

ZooKeeper is used by companies including Yelp, Rackspace, Yahoo,[5] Odnoklassniki, Reddit,[6] NetApp SolidFire,[7] Meta,[8] Twitter[9] and eBay as well as open source enterprise search systems like Solr and distributed database systems like Apache Pinot.[10][11]

ZooKeeper is modeled after Google's Chubby lock service[12][13] and was originally developed at Yahoo! for streamlining the processes running on big-data clusters by storing the status in local log files on the ZooKeeper servers. These servers communicate with client machines to deliver the required information. ZooKeeper was developed to address issues that emerged during the deployment of distributed big-data applications.

Some of the prime features of Apache ZooKeeper are:

  • Reliable System: the system keeps working even if some nodes stop working.
  • Simple Architecture: there is a shared hierarchical namespace which helps coordinating the processes.
  • Fast Processing: especially fast in "read-dominant" workloads (i.e. workloads in which reads are much more common than writes).
  • Scalable: performance can be improved by adding nodes.

Architecture

edit

Some common terminologies regarding the ZooKeeper architecture:

  • Node: the systems installed on the cluster
  • ZNode: the nodes where the status is updated by other nodes in cluster
  • Client applications: the tools that interact with the distributed applications
  • Server applications: allow the client applications to interact using a common interface

The services in the cluster are replicated and stored on a set of servers (called an "ensemble"), each of which maintains an in-memory database containing the entire data tree of state as well as a transaction log and snapshots stored persistently. Multiple client applications can connect to a server, and each client maintains a TCP connection through which it sends requests and heartbeats and receives responses and watch events for monitoring.[14]

Use cases

edit

Typical use cases for ZooKeeper are:

Client libraries

edit

In addition to the client libraries included with the ZooKeeper distribution, several third-party libraries, including Apache Curator and Kazoo, extend ZooKeeper's capabilities. These libraries offer enhanced ease of use, additional features and support for a broader range of programming languages.

Apache projects using ZooKeeper

edit

See also

edit

References

edit
  1. ^ "Release 3.9.5-0". 11 February 2026. Retrieved 1 April 2026.
  2. ^ "Apache Zookeeper4". Retrieved 31 January 2021.
  3. ^ "Index - Apache ZooKeeper - Apache Software Foundation". cwiki.apache.org. Retrieved 2016-08-26.
  4. ^ "Zookeeper Overview".
  5. ^ "ZooKeeper/Powered By". Archived from the original on 2013-12-09. Retrieved 2012-01-25.
  6. ^ "Why Reddit was down on Aug 11". 16 August 2016.
  7. ^ "5 Big DaaS Challenges and How to Overcome Them | NetApp Newsroom". NetApp Newsroom. 2016-06-20. Retrieved 2017-05-24.[permanent dead link]
  8. ^ "Location-Aware Distribution: Configuring servers at scale". Facebook Code. 2018-07-19. Retrieved 2018-07-20.
  9. ^ "ZooKeeper at Twitter". Twitter Engineering Blog. 2018-10-11. Retrieved 2018-12-08.
  10. ^ "SolrCloud".
  11. ^ "Apache Pinot: Architecture".
  12. ^ Burrows, Mike (2006). "The Chubby lock service for loosely-coupled distributed systems". 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI).
  13. ^ Chandra, Tushar Deepak; Griesemer, Robert; Redstone, Joshua (2007). "Paxos Made Live - An Engineering Perspective (2006 Invited Talk)". Google Research. Retrieved 2020-03-03.
  14. ^ "Apache Zookeeper 3.9 Documentation".
  15. ^ "Apache Kafka".
edit

📚 Artikel Terkait di Wikipedia

Apache Hadoop

such as Apache Pig, Apache Hive, Apache HBase, Apache Phoenix, Apache Spark, Apache ZooKeeper, Apache Impala, Apache Flume, Apache Sqoop, Apache Oozie,

Apache Druid

failure. The cluster includes external dependencies for coordination (Apache ZooKeeper), metadata storage (e.g. MySQL, PostgreSQL, or Derby), and a deep storage

Apache Accumulo

Google's Bigtable. It is a system built on top of Apache Hadoop, Apache ZooKeeper, and Apache Thrift. Written in Java, Accumulo has cell-level access labels

Outline of the Java programming language

Java Edition NetBeans Apache Software Foundation – Apache Commons, Apache Maven, Apache Tomcat, Apache Kafka Eclipse Foundation – Adoptium, Eclipse IDE

Apache ActiveMQ

persistence store via a shared filesystem, or true replication using Apache ZooKeeper. ActiveMQ supports a horizontal scaling mechanism called a Network

Zookeeper (disambiguation)

originally released as a browser game Zookeeper (comics), a fictional character from the Teen Titans comics Apache ZooKeeper, a service for coordinating processes

Apache Pinot

Helix is embedded as an agent within the different components and uses Apache ZooKeeper for coordination and maintaining the overall cluster state and health

Etcd

Chubby, a distributed lock manager developed Google in 2006, as well as Apache ZooKeeper. It supports TLS/SSL encryption, exposes a client-facing gRPC API,