Scaling

You can modify the capacity of your Event Streams system in a number of ways. See the following sections for details about the different methods, and their impact on your installation.

The pre-requisite guidance gives various examples of different production configurations on which you can base your deployment. To verify it meets your requirements, you should test the system with a workload that is representative of the expected throughput. For this purpose, Event Streams provides a workload generator application to test different message loads.

If this testing shows that your system does not have the capacity needed for the workload, whether this results in excessive lag or delays, or more extreme errors such as OutOfMemory errors, then you can incrementally make the increases detailed in the following sections, re-testing after each change to identify a configuration that meets your specific requirements.

A performance report based on example case studies is also available to provide guidance for setting these values.

Note: Although the testing for the report was based on Apache Kafka version 2.3.0, the performance numbers are broadly applicable to current versions of Kafka as well.

Modifying the settings

These settings are defined in the EventStreams custom resource under the spec.strimziOverrides property. For more information on modifying these settings see modifying installation.

Increase the number of Kafka brokers in the cluster

The number of Kafka brokers is defined in the EventStreams custom resource in the spec.strimziOverrides.nodePools section.

You can scale your Kafka cluster by either:

Increasing the number of brokers in an existing pool by updating the replicas value.
Adding a new Kafka node pool by defining a new entry under spec.strimziOverrides.nodePools and specifying a unique name and the desired number of replicas.

For example to configure Event Streams to use 6 Kafka brokers in a pool:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  strimziOverrides:
    nodePools:
      - name: kafka
        replicas: 6

Note: Controller node pools cannot be scaled or modified dynamically in KRaft mode. For more information, see KRaft limitations.

For more information about scaling up Kafka node pools, see the Strimzi documentation.

Increase the CPU request or limit settings for the Kafka brokers

The CPU settings for the Kafka brokers are defined in the EventStreams custom resource in the requests and limits properties under spec.strimziOverrides.kafka.resources (for all brokers in the Kafka cluster) or under spec.strimziOverrides.nodePools[].resources (for brokers within a node pool).

Example 1: Configure same CPU Settings for all node pools

To configure Event Streams Kafka brokers in all pools to have a CPU request set to 2 CPUs and limit set to 4 CPUs:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  strimziOverrides:
    # ...
    kafka:
      # ...
      resources:
        requests:
          cpu: 2000m
        limits:
          cpu: 4000m
      # ...

This configuration applies the same CPU settings across all pools. A description of the syntax for these values can be found in the Kubernetes documentation.

Example 2: Configure different CPU settings for specific node pools

If you need to set different resource configurations for individual Kafka node pools, you can define the resource properties under spec.strimziOverrides.nodePools.resources.

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  strimziOverrides:
    # ...
      nodePools:
        - name: kafka
          resources:
            requests:
              cpu: 2000m
            limits:
              cpu: 4000m
          # ...
        - name: kafka-2
          resources:
            requests:
              cpu: 4000m
            limits:
              cpu: 8000m
          # ...

Increase the memory request or limit settings for the Kafka brokers and Kafka controller nodes

The memory settings for the Kafka brokers and controller nodes are defined in the EventStreams custom resource in the requests and limits properties under spec.strimziOverrides.nodePools[].resources, based on the assigned roles.

Alternatively, to apply the same memory settings to all Kafka nodes, you can configure them globally under spec.strimziOverrides.kafka.resources in the EventStreams custom resource. All Kafka node pools will automatically inherit these settings unless overridden at the node pool level.

For example to configure a pool of Kafka brokers and controllers to have a memory request set to 4GB and limit set to 8GB:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  strimziOverrides:
    # ...
    kafka:
      # ...
    nodePools:
      - name: kafka
        role: broker
        resources:
          requests:
            memory: 4096Mi
          limits:
            memory: 8096Mi
      - name: controller
        role: controller
        resources:
          requests:
            memory: 4096Mi
          limits:
            memory: 8096Mi

The syntax for these values can be found in the Kubernetes documentation.

Modifying the resources available to supporting components

The resource settings for each supporting component are defined in the EventStreams custom resource in their corresponding component key the requests and limits properties under spec.<component>.resources. For example, to configure the Apicurio Registry to have a memory request set to 4GB and limit set to 8GB:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  apicurioRegistry:
    # ...
    resources:
      requests:
        memory: 4096Mi
      limits:
        memory: 8096Mi

The syntax for these values can be found in the Kubernetes documentation.

Modifying the JVM settings for Kafka brokers

If you have specific requirements, you can modify the JVM settings for the Kafka brokers.

Note: Take care when modifying these settings as changes can have an impact on the functioning of the product.

Note: Only a selected subset of the available JVM options can be configured.

JVM settings for the pools of Kafka brokers are defined in the EventStreams custom resource in the spec.strimziOverrides.kafka.jvmOptions property for all Kafka brokers. Alternatively, you can configure the JVM settings separately for each Kafka node pool by using spec.strimziOverrides.kafka.nodePools.jvmOptions.

For example, to set JVM options for all Kafka brokers:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  strimziOverrides:
    # ...
    kafka:
      # ...
      jvmOptions:
        -Xms: 4096m
        -Xmx: 4096m

Increase the disk space available to each Kafka broker

The Kafka brokers need sufficient storage to meet the retention requirements for all of the topics in the cluster. Disk space requirements grow with longer retention periods for messages, increased message sizes and additional topic partitions.

The amount of storage made available to brokers in Kafka node pools is defined at the time of installation in the EventStreams custom resource in the spec.strimziOverrides.nodePools[].storage.size property. For example:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
# ...
spec:
  # ...
  strimziOverrides:
    # ...
      nodePools:
        - name: kafka
          storage:
            # ...
            size: 100Gi

Migrating Kafka broker storage

You can migrate the storage used by your Kafka brokers without creating a new cluster. By using Kafka node pools and Cruise Control, you can move broker workloads from one storage configuration to another while maintaining cluster availability and data integrity.

Important: The migration involves moving large amounts of data between brokers. The rebalance operation must be carefully planned, as the rebalance process adds additional workload on the cluster and might temporarily impact the cluster performance.

Typically, you migrate Kafka broker storage when you want to update or optimize the existing storage configuration. For example, you might want to:

Migrate to a faster or more cost-efficient storage class.
Resize or replace existing broker disks.

Before you begin, ensure that:

Node pools that have a broker role are configured in your cluster.
Cruise Control is enabled.

The following steps describe how to migrate broker storage without recreating your Kafka cluster.

Update your EventStreams custom resource to add a new Kafka node pool with the required storage configuration.

For example, the following configuration includes an existing node pool (brokers-gp2) with the gp2-ebs storage class and a new node pool (brokers-gp3) with the gp3-ebs storage class:

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
metadata:
  name: my-cluster
spec:
  strimziOverrides:
    nodePools:
      - name: brokers-gp2
        roles:
          - broker
        replicas: 3
        storage:
          type: persistent-claim
          size: 1Ti
          class: gp2-ebs
      - name: brokers-gp3
        annotations:
          eventstreams.ibm.com/next-node-ids: '[5-10]'
        roles:
          - broker
        replicas: 3
        storage:
          type: persistent-claim
          size: 1Ti
          class: gp3-ebs

Apply the configuration and wait for the new broker pods to become ready. Event Streams creates the new broker pods alongside the existing ones that use the previous storage.

Note: This step adds new Kafka broker pods to the cluster. Ensure that your cluster has sufficient resources to run the new and existing pods in parallel.

When the new brokers are ready, define a KafkaRebalance custom resource (for example, kafkarebalance.yaml) to instruct Cruise Control to migrate partition replicas from the previous brokers to the new node pool. For example:
```
apiVersion: eventstreams.ibm.com/v1beta2
kind: KafkaRebalance
metadata:
  name: migrate-storage
  labels:
    eventstreams.ibm.com/cluster: my-cluster
  annotations:
    eventstreams.ibm.com/rebalance-auto-approval: "true"
spec:
  mode: remove-brokers
  brokers: [0, 1, 2]   
```
The values 0, 1, 2 in the brokers field represent the IDs of the previous broker nodes that are being removed from the cluster. You can find the IDs of the previous brokers to remove in the nodePools[*].nodeIds section of the EventStreams custom resource of your cluster.
Run the following command to apply the KafkaRebalance configuration and start the migration.
```
kubectl apply -f kafkarebalance.yaml
```
Cruise Control automatically moves partition replicas from previous brokers to the new brokers in the new node pool.

For more information about how Cruise Control optimizes Kafka clusters and handles rebalancing, see optimizing Kafka clusters with Cruise Control.
Run the following command to check the Cruise Control rebalance status and verify migration progress:
```
kubectl get kafkarebalances
```
When the status shows Ready, Cruise Control has completed the migration and redistributed all partition replicas to the new brokers. For example:
```
Name               Cluster        Status
migrate-storage    my-cluster     Ready
```

After the migration completes and the cluster is stable, remove the previous broker node pool from your EventStreams custom resource.

For example, remove the definition of the previous broker node pool (brokers-gp2):

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
metadata:
  name: my-cluster
spec:
  strimziOverrides:
    nodePools:
      - name: brokers-gp2
        roles:
          - broker
        replicas: 3
        storage:
          type: persistent-claim
          size: 1Ti
          class: gp2-ebs

After removal, the updated configuration must include only the new node pool (brokers-gp3):

apiVersion: eventstreams.ibm.com/v1beta2
kind: EventStreams
metadata:
  name: my-cluster
spec:
  strimziOverrides:
    nodePools:
      - name: brokers-gp3
        roles:
          - broker
        replicas: 3
        storage:
          type: persistent-claim
          size: 1Ti
          class: gp3-ebs

Apply the updated configuration to remove the previous broker nodes from the cluster.

Note: The corresponding KafkaNodePool custom resource is not automatically deleted when the node pool is removed from the EventStreams custom resource. Run the following command to delete it manually:
```
 kubectl delete kafkanodepool <node-pool-name> -n <namespace>
```