Apache Kafka® is a distributed streaming platform. As a streaming platform, Kafka offers the option of distributing, publishing and storing data streams and, similar to a messaging queue or a messaging system, making data streams available to subscribers.
At the same time, Kafka provides the option of transforming these data streams via corresponding interfaces. Kafka is operated as a cluster on one or more servers, which can extend across several data centers. The Kafka cluster stores records in categories, which are referred to as topics.
Storage is distributed across the cluster; data replication is possible. Kafka is written in Java and harmonizes with various big data technologies, for example.