Apache Kudu
Other namesKudu
DeveloperApache Kudu Committers and PMC Members
Stable release
1.18.0 / 14 July 2025; 11 months ago (2025-07-14)[1]
Written inC++
Operating systemLinux, macOS
TypeDatabase management system, Distributed data store
LicenseApache License 2.0[2]
Websitekudu.apache.org Edit this on Wikidata
RepositoryKudu Repository

Apache Kudu is a free and open source column-oriented data store of the Apache Hadoop ecosystem. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides completeness to Hadoop's storage layer to enable fast analytics on fast data.[3]

The open source project to build Apache Kudu began as internal project at Cloudera.[4] The first version Apache Kudu 1.0 was released 19 September 2016.[5]

Comparison with other storage engines

edit

Kudu was designed and optimized for OLAP workloads. Like HBase, it is a real-time store that supports key-indexed record lookup and mutation.[6] Kudu differs from HBase since Kudu's datamodel is a more traditional relational model, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/Bigtable".[6]

See also

edit

References

edit
  1. ^ "Apache Kudu - Releases". Retrieved 19 December 2025. Kudu 1.18.0 was released on July 14, 2025.
  2. ^ "Project Status". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21. Is Kudu open source? Yes, Kudu is open source and licensed under the Apache Software License, version 2.0. Apache Kudu is a top level project (TLP) under the umbrella of the Apache Software Foundation.
  3. ^ "Home". kudu.apache.org.
  4. ^ "Why was Kudu developed internally at Cloudera before its release?". 2017-05-21. Retrieved 2017-05-21.
  5. ^ "Apache Kudu releases". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21. Kudu 1.0.0 was released on September 19, 2016. It is the first release not considered "beta". [...] Kudu 0.5.0 (beta) was released on Sep 28, 2015. It was the first public version of Kudu.
  6. ^ a b "Why build a new storage engine? Why not just improve Apache HBase to increase its scan speed?". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21.
edit

📚 Artikel Terkait di Wikipedia

Time series database

ISSN 2150-8097. S2CID 221352039. "Benchmarking Time Series workloads on Apache Kudu using TSBS". 18 March 2020. Fu, Yupeng; Soman, Chinmay (9 June 2021)

Apache Impala

and Kudu Big Data Projects to Apache". Application Development Trends. Retrieved October 10, 2016. "The Apache Software Foundation Announces Apache Impala

Apache Parquet

open-source software portal Apache Arrow Apache Pig Apache Hive Apache Impala Apache Drill Apache Kudu Apache Spark Apache Thrift Trino (SQL query engine)

Apache Drill

including Apache Hadoop, MapR, CDH and Amazon EMR NoSQL: MongoDB, Apache HBase, Apache Cassandra Online Analytical Processing: Apache Kudu, Apache Druid,

Kudu (disambiguation)

District, Sakha Republic Apache Kudu, a column-oriented data store of the Apache Hadoop ecosystem Atlas Kudu, an airplane Kudus (disambiguation) This disambiguation

List of data science software

Programming System XploRe Tools for Data processing and analysis: AIDA Alteryx Apache Kudu Aphelion ClickHouse Cubes (OLAP server) DADiSP DAP Data Analysis Expressions

List of column-oriented DBMSes

extensions available from Imply Data. Apache Kudu C++ Released in 2016 to complete the Apache Hadoop ecosystem Apache Pinot Java Open sourced in 2015 for

Apache Spark

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit