Docker--Bitnami/kafka

Bitnami package for Apache Kafka

What is Apache Kafka?

Apache Kafka is a distributed streaming platform designed to build real-time pipelines and can be used as a message broker or as a replacement for a log aggregation solution for big data applications.

Overview of Apache Kafka⁠ Trademarks: This software listing is packaged by Bitnami. The respective trademarks mentioned in the offering are owned by the respective companies, and use of them does not imply any affiliation or endorsement.

TL;DR

复制代码

docker run --name mysql -e ALLOW_EMPTY_PASSWORD=yes bitnami/mysql:latest

Why use Bitnami Images?

Bitnami closely tracks upstream source changes and promptly publishes new versions of this image using our automated systems.
With Bitnami images the latest bug fixes and features are available as soon as possible.
Bitnami containers, virtual machines and cloud images use the same components and configuration approach - making it easy to switch between formats based on your project needs.
All our images are based on minideb⁠ -a minimalist Debian based container image that gives you a small base container image and the familiarity of a leading Linux distribution- or scratch -an explicitly empty image-.
All Bitnami images available in Docker Hub are signed with Notation⁠. Check this post⁠ to know how to verify the integrity of the images.
Bitnami container images are released on a regular basis with the latest distribution packages available.

Looking to use Apache Kafka in production? Try VMware Tanzu Application Catalog⁠, the commercial edition of the Bitnami catalog.

How to deploy Apache Kafka in Kubernetes?

Deploying Bitnami applications as Helm Charts is the easiest way to get started with our applications on Kubernetes. Read more about the installation in the Bitnami Apache Kafka Chart GitHub repository⁠.

Bitnami containers can be used with Kubeapps⁠ for deployment and management of Helm Charts in clusters.

Why use a non-root container?

Non-root container images add an extra layer of security and are generally recommended for production environments. However, because they run as a non-root user, privileged tasks are typically off-limits. Learn more about non-root containers in our docs⁠.

Only latest stable branch maintained in the free Bitnami catalog

Starting December 10th 2024, only the latest stable branch of any container will receive updates in the free Bitnami catalog. To access up-to-date releases for all upstream-supported branches, consider upgrading to Bitnami Premium. Previous versions already released will not be deleted. They are still available to pull from DockerHub.

Please check the Bitnami Premium page in our partner Arrow Electronics⁠ for more information.

Supported tags and respective `Dockerfile` links

Learn more about the Bitnami tagging policy and the difference between rolling tags and immutable tags in our documentation page⁠.

You can see the equivalence between the different tags by taking a look at the tags-info.yaml file present in the branch folder, i.e bitnami/ASSET/BRANCH/DISTRO/tags-info.yaml.

Subscribe to project updates by watching the bitnami/containers GitHub repo⁠.

Get this image

The recommended way to get the Bitnami Apache Kafka Docker Image is to pull the prebuilt image from the Docker Hub Registry.

复制代码

docker pull bitnami/mysql:latest

To use a specific version, you can pull a versioned tag. You can view the list of available versions in the Docker Hub Registry.

复制代码

docker pull bitnami/mysql:[TAG]

If you wish, you can also build the image yourself by cloning the repository, changing to the directory containing the Dockerfile and executing the docker build command. Remember to replace the APP, VERSION and OPERATING-SYSTEM path placeholders in the example command below with the correct values.

复制代码

git clone https://github.com/bitnami/containers.git
cd bitnami/APP/VERSION/OPERATING-SYSTEM
docker build -t bitnami/APP:latest .

Persisting your data

If you remove the container all your data and configurations will be lost, and the next time you run the image the database will be reinitialized. To avoid this loss of data, you should mount a volume that will persist even after the container is removed.

Note: If you have already started using your database, follow the steps on backing up and restoring to pull the data from your running container down to your host.

The image exposes a volume at /bitnami/kafka for the Apache Kafka data. For persistence you can mount a directory at this location from your host. If the mounted directory is empty, it will be initialized on the first run.

Using Docker Compose:

This requires a minor change to the docker-compose.yml⁠ file present in this repository:

复制代码

kafka:
  ...
  volumes:
    - /path/to/kafka-persistence:/bitnami/kafka
  ...

NOTE: As this is a non-root container, the mounted files and directories must have the proper permissions for the UID 1001.

Connecting to other containers

Using Docker container networking⁠, an Apache Kafka server running inside a container can easily be accessed by your application containers.

Containers attached to the same network can communicate with each other using the container name as the hostname.

Using the Command Line

In this example, we will create an Apache Kafka client instance that will connect to the server instance that is running on the same docker network as the client.

Step 1: Create a network

复制代码

docker network create app-tier --driver bridge

Step 2: Launch the Apache Kafka server instance

Use the --network app-tier argument to the docker run command to attach the Apache Kafka container to the app-tier network.

复制代码

docker run -d --name kafka-server --hostname kafka-server \
    --network app-tier \
    -e KAFKA_CFG_NODE_ID=0 \
    -e KAFKA_CFG_PROCESS_ROLES=controller,broker \
    -e KAFKA_CFG_LISTENERS=PLAINTEXT://:9092,CONTROLLER://:9093 \
    -e KAFKA_CFG_LISTENER_SECURITY_PROTOCOL_MAP=CONTROLLER:PLAINTEXT,PLAINTEXT:PLAINTEXT \
    -e KAFKA_CFG_CONTROLLER_QUORUM_VOTERS=0@kafka-server:9093 \
    -e KAFKA_CFG_CONTROLLER_LISTENER_NAMES=CONTROLLER \
    bitnami/kafka:latest

Step 3: Launch your Apache Kafka client instance

Finally we create a new container instance to launch the Apache Kafka client and connect to the server created in the previous step:

复制代码

docker run -it --rm \
    --network app-tier \
    bitnami/kafka:latest kafka-topics.sh --list  --bootstrap-server kafka-server:9092

Using a Docker Compose file

When not specified, Docker Compose automatically sets up a new network and attaches all deployed services to that network. However, we will explicitly define a new bridge network named app-tier. In this example we assume that you want to connect to the Apache Kafka server from your own custom application image which is identified in the following snippet by the service name myapp.

复制代码

version: '2'

networks:
  app-tier:
    driver: bridge

services:
  kafka:
    image: 'bitnami/kafka:latest'
    networks:
      - app-tier
    environment:
      - KAFKA_CFG_NODE_ID=0
      - KAFKA_CFG_PROCESS_ROLES=controller,broker
      - KAFKA_CFG_LISTENERS=PLAINTEXT://:9092,CONTROLLER://:9093
      - KAFKA_CFG_LISTENER_SECURITY_PROTOCOL_MAP=CONTROLLER:PLAINTEXT,PLAINTEXT:PLAINTEXT
      - KAFKA_CFG_CONTROLLER_QUORUM_VOTERS=0@kafka:9093
      - KAFKA_CFG_CONTROLLER_LISTENER_NAMES=CONTROLLER
  myapp:
    image: 'YOUR_APPLICATION_IMAGE'
    networks:
      - app-tier

IMPORTANT:

Please update the YOUR_APPLICATION_IMAGE placeholder in the above snippet with your application image

In your application container, use the hostname kafka to connect to the Apache Kafka server

Launch the containers using:

复制代码

docker-compose up -d

Configuration

Environment variables

Customizable environment variables

Name	Description	Default Value
`KAFKA_MOUNTED_CONF_DIR`	Kafka directory for mounted configuration files.	`${KAFKA_VOLUME_DIR}/config`
`KAFKA_INTER_BROKER_USER`	Kafka inter broker communication user.	`user`
`KAFKA_INTER_BROKER_PASSWORD`	Kafka inter broker communication password.	`bitnami`
`KAFKA_CONTROLLER_USER`	Kafka control plane communication user.	`controller_user`
`KAFKA_CONTROLLER_PASSWORD`	Kafka control plane communication password.	`bitnami`
`KAFKA_CERTIFICATE_PASSWORD`	Password for certificates.	`nil`
`KAFKA_TLS_TRUSTSTORE_FILE`	Kafka truststore file location.	`nil`
`KAFKA_TLS_TYPE`	Choose the TLS certificate format to use.	`JKS`
`KAFKA_TLS_CLIENT_AUTH`	Configures kafka broker to request client authentication.	`required`
`KAFKA_OPTS`	Kafka deployment options.	`nil`
`KAFKA_CFG_SASL_ENABLED_MECHANISMS`	Kafka `sasl.enabled.mechanisms` configuration override.	`PLAIN,SCRAM-SHA-256,SCRAM-SHA-512`
`KAFKA_KRAFT_CLUSTER_ID`	Kafka cluster ID when using Kafka Raft mode (KRaft).	`nil`
`KAFKA_SKIP_KRAFT_STORAGE_INIT`	If set to true, skip Kraft storage initialization when process.roles are configured.	`false`
`KAFKA_CLIENT_LISTENER_NAME`	Name of the listener intended to be used by clients, if set, configures the producer/consumer accordingly.	`nil`
`KAFKA_ZOOKEEPER_PROTOCOL`	Authentication protocol for Zookeeper connections. Allowed protocols: `PLAINTEXT`, `SASL, SSL`, and `SASL_SSL`.	`PLAINTEXT`
`KAFKA_ZOOKEEPER_PASSWORD`	Kafka Zookeeper user password for SASL authentication.	`nil`
`KAFKA_ZOOKEEPER_USER`	Kafka Zookeeper user for SASL authentication.	`nil`
`KAFKA_ZOOKEEPER_TLS_KEYSTORE_PASSWORD`	Kafka Zookeeper keystore file password and key password.	`nil`
`KAFKA_ZOOKEEPER_TLS_TRUSTSTORE_PASSWORD`	Kafka Zookeeper truststore file password.	`nil`
`KAFKA_ZOOKEEPER_TLS_TRUSTSTORE_FILE`	Kafka Zookeeper truststore file location.	`nil`
`KAFKA_ZOOKEEPER_TLS_VERIFY_HOSTNAME`	Verify Zookeeper hostname on TLS certificates.	`true`
`KAFKA_ZOOKEEPER_TLS_TYPE`	Choose the TLS certificate format to use. Allowed values: `JKS`, `PEM`.	`JKS`
`KAFKA_CLIENT_USERS`	List of additional users to `KAFKA_CLIENT_USER` that will be created into Zookeeper when using `SASL_SCRAM` for client communications. Separated by commas, semicolons or whitespaces.	`user`
`KAFKA_CLIENT_PASSWORDS`	Passwords for the users specified at `KAFKA_CLIENT_USERS`. Separated by commas, semicolons or whitespaces.	`bitnami`
`KAFKA_HEAP_OPTS`	Kafka heap options for Java.	`-Xmx1024m -Xms1024m`
`JAVA_TOOL_OPTIONS`	Java tool options.	`nil`

Read-only environment variables

Name	Description	Value
`KAFKA_BASE_DIR`	Kafka installation directory.	`${BITNAMI_ROOT_DIR}/kafka`
`KAFKA_VOLUME_DIR`	Kafka persistence directory.	`/bitnami/kafka`
`KAFKA_DATA_DIR`	Kafka directory where data is stored.	`${KAFKA_VOLUME_DIR}/data`
`KAFKA_CONF_DIR`	Kafka configuration directory.	`${KAFKA_BASE_DIR}/config`
`KAFKA_CONF_FILE`	Kafka configuration file.	`${KAFKA_CONF_DIR}/server.properties`
`KAFKA_CERTS_DIR`	Kafka directory for certificate files.	`${KAFKA_CONF_DIR}/certs`
`KAFKA_INITSCRIPTS_DIR`	Kafka directory for init scripts.	`/docker-entrypoint-initdb.d`
`KAFKA_LOG_DIR`	Directory where Kafka logs are stored.	`${KAFKA_BASE_DIR}/logs`
`KAFKA_HOME`	Kafka home directory.	`$KAFKA_BASE_DIR`
`KAFKA_DAEMON_USER`	Kafka system user.	`kafka`
`KAFKA_DAEMON_GROUP`	Kafka system group.	`kafka`

Additionally, any environment variable beginning with KAFKA_CFG_ will be mapped to its corresponding Apache Kafka key. For example, use KAFKA_CFG_BACKGROUND_THREADS in order to set background.threads or KAFKA_CFG_AUTO_CREATE_TOPICS_ENABLE in order to configure auto.create.topics.enable.

复制代码

docker run --name kafka -e KAFKA_CFG_PROCESS_ROLES ... -e KAFKA_CFG_AUTO_CREATE_TOPICS_ENABLE=true bitnami/kafka:latest

or by modifying the docker-compose.yml⁠ file present in this repository:

复制代码

kafka:
  ...
  environment:
    - KAFKA_CFG_AUTO_CREATE_TOPICS_ENABLE=true
  ...

Apache Kafka development setup example

To use Apache Kafka in a development setup, create the following docker-compose.yml file:

复制代码

version: "3"
services:
  kafka:
    image: 'bitnami/kafka:latest'
    ports:
      - '9092:9092'
    environment:
      - KAFKA_CFG_NODE_ID=0
      - KAFKA_CFG_PROCESS_ROLES=controller,broker
      - KAFKA_CFG_LISTENERS=PLAINTEXT://:9092,CONTROLLER://:9093
      - KAFKA_CFG_LISTENER_SECURITY_PROTOCOL_MAP=CONTROLLER:PLAINTEXT,PLAINTEXT:PLAINTEXT
      - KAFKA_CFG_CONTROLLER_QUORUM_VOTERS=0@kafka:9093
      - KAFKA_CFG_CONTROLLER_LISTENER_NAMES=CONTROLLER

To deploy it, run the following command in the directory where the docker-compose.yml file is located:

复制代码

docker-compose up -d

Kafka with Zookeeper

Apache Kafka Raft (KRaft) makes use of a new quorum controller service in Kafka which replaces the previous controller and makes use of an event-based variant of the Raft consensus protocol. This greatly simplifies Kafka's architecture by consolidating responsibility for metadata into Kafka itself, rather than splitting it between two different systems: ZooKeeper and Kafka.

More Info can be found here: https://developer.confluent.io/learn/kraft/⁠

NOTE: According to KIP-833⁠, KRaft is now in a production-ready state.

However, if you want to keep using ZooKeeper, you can use the following configuration:

复制代码

version: "2"

services:
  zookeeper:
    image: docker.io/bitnami/zookeeper:3.9
    ports:
      - "2181:2181"
    volumes:
      - "zookeeper_data:/bitnami"
    environment:
      - ALLOW_ANONYMOUS_LOGIN=yes
  kafka:
    image: docker.io/bitnami/kafka:3.4
    ports:
      - "9092:9092"
    volumes:
      - "kafka_data:/bitnami"
    environment:
      - KAFKA_CFG_ZOOKEEPER_CONNECT=zookeeper:2181
    depends_on:
      - zookeeper

volumes:
  zookeeper_data:
    driver: local
  kafka_data:
    driver: local

Accessing Apache Kafka with internal and external clients

In order to use internal and external clients to access Apache Kafka brokers you need to configure one listener for each kind of client.

To do so, add the following environment variables to your docker-compose:

复制代码

    environment:
      - KAFKA_CFG_NODE_ID=0
      - KAFKA_CFG_PROCESS_ROLES=controller,broker
      - KAFKA_CFG_CONTROLLER_QUORUM_VOTERS=0@<your_host>:9093
+     - KAFKA_CFG_LISTENERS=PLAINTEXT://:9092,CONTROLLER://:9093,EXTERNAL://:9094
+     - KAFKA_CFG_ADVERTISED_LISTENERS=PLAINTEXT://kafka:9092,EXTERNAL://localhost:9094
+     - KAFKA_CFG_LISTENER_SECURITY_PROTOCOL_MAP=CONTROLLER:PLAINTEXT,EXTERNAL:PLAINTEXT,PLAINTEXT:PLAINTEXT
      - KAFKA_CFG_CONTROLLER_LISTENER_NAMES=CONTROLLER

And expose the external port:

(the internal, client one can still be used within the docker network)

复制代码

    ports:
-     - '9092:9092'
+     - '9094:9094'

Note : To connect from an external machine, change localhost above to your host's external IP/hostname and include EXTERNAL://0.0.0.0:9094 in KAFKA_CFG_LISTENERS to allow for remote connections.

Producer and consumer using external client

These clients, from the same host, will use localhost to connect to Apache Kafka.

复制代码

services:
  mysql:
  ...
    environment:
      - ALLOW_EMPTY_PASSWORD=yes
      - MYSQL_DATABASE=my_database
  ...

If running these commands from another machine, change the address accordingly.

Producer and consumer using internal client

These clients, from other containers on the same Docker network, will use the kafka container service hostname to connect to Apache Kafka.

复制代码

kafka-console-producer.sh --producer.config /opt/bitnami/kafka/config/producer.properties --bootstrap-server kafka:9092 --topic test
kafka-console-consumer.sh --consumer.config /opt/bitnami/kafka/config/consumer.properties --bootstrap-server kafka:9092 --topic test --from-beginning

Similarly, application code will need to use bootstrap.servers=kafka:9092

More info about Apache Kafka listeners can be found in this great article⁠

Security

In order to configure authentication, you must configure the Apache Kafka listeners properly. Let's see an example to configure Apache Kafka with SASL_SSL authentication for communications with clients, and SASL authentication for controller-related communications.

The environment variables below should be defined to configure the listeners, and the SASL credentials for client communications:

复制代码

KAFKA_CFG_LISTENERS=SASL_SSL://:9092,CONTROLLER://:9093
KAFKA_CFG_ADVERTISED_LISTENERS=SASL_SSL://localhost:9092
KAFKA_CLIENT_USERS=user
KAFKA_CLIENT_PASSWORDS=password
KAFKA_CLIENT_LISTENER_NAME=SASL_SSL
KAFKA_CFG_LISTENER_SECURITY_PROTOCOL_MAP=CONTROLLER:SASL_PLAINTEXT,SASL_SSL:SASL_SSL
KAFKA_CFG_SASL_MECHANISM_CONTROLLER_PROTOCOL=PLAIN
KAFKA_CONTROLLER_USER=controller_user
KAFKA_CONTROLLER_PASSWORD=controller_password

You must also use your own certificates for SSL. You can drop your Java Key Stores or PEM files into /opt/bitnami/kafka/config/certs. If the JKS or PEM certs are password protected (recommended), you will need to provide it to get access to the keystores:

KAFKA_CERTIFICATE_PASSWORD=myCertificatePassword

If the truststore is mounted in a different location than /opt/bitnami/kafka/config/certs/kafka.truststore.jks, /opt/bitnami/kafka/config/certs/kafka.truststore.pem, /bitnami/kafka/config/certs/kafka.truststore.jks or /bitnami/kafka/config/certs/kafka.truststore.pem, set the KAFKA_TLS_TRUSTSTORE_FILE variable.

The following script can help you with the creation of the JKS and certificates:

$kafka-generate-ssl.sh\]([https://raw.githubusercontent.com/conflue⁠](https://raw.githubusercontent.com/conflue "https://raw.githubusercontent.com/conflue⁠")$

Docker--Bitnami/kafka

Bitnami package for Apache Kafka

What is Apache Kafka?

TL;DR

Why use Bitnami Images?

How to deploy Apache Kafka in Kubernetes?

Why use a non-root container?

Only latest stable branch maintained in the free Bitnami catalog

Supported tags and respective Dockerfile links

Get this image

Persisting your data

Connecting to other containers

Using the Command Line

Using a Docker Compose file

Configuration

Environment variables

Apache Kafka development setup example

Kafka with Zookeeper

Accessing Apache Kafka with internal and external clients

Security

Supported tags and respective `Dockerfile` links