Switching Kafka

LogScale uses Kafka for queuing incoming messages and for storing shared state when running LogScale in a cluster setup. It's possible for LogScale to snapshot its state and continue running using a new Kafka cluster. This can be useful in situations where you want to change infrastructure or if there are problems with the current Kafka/ZooKeeper cluster. One example could be if all ZooKeeper machines have written the disk full and afterwards ZooKeeper will not start because of file inconsistencies. This section describes the procedure for doing a Kafka switch.

Danger

All LogScale processes must be completely stopped before performing this action.

Stop Sending Data to LogScale

If it is possible, stop sending data to LogScale, then wait for LogScale to process all data on the ingest queue. The LogScale Stats dashboard in the LogScale repository have an Events Processed After Ingest Queue graph by host per second and an Ingest Latency graph. If there is data on the ingest queue after closing LogScale it will be lost as the queue is reset or another queue will be used.

You'll need to stop all LogScale processes on all machines. You'll also need to stop all Kafka processes and all ZooKeeper processes on all machines.

The next step depends on the deployment model:

When using Kafka deployed with ZooKeeper
- Switch Kafka and ZooKeeper
- Restarting Kafka, ZooKeeper and LogScale
When using Kafka deployed with Kraft
- Switch Kafka using Kraft Mode
- Restarting Kafka using Kraft and LogScale

Switching Kafka Recap

It's worth reviewing the steps above again. In short, to do a Kafka switch you'll need to do the following steps:

Stop all LogScale processes on all nodes.
Stop all Kafka processes on all nodes.
Stop all ZooKeeper processes on all nodes (up to LogScale 1.107).
Delete Kafka data (or use new Kafka queues).
Delete ZooKeeper data (if using).
Start all ZooKeeper processes on all nodes (up to LogScale 1.107).
Verify the ZooKeeper cluster (up to LogScale 1.107).
Start all Kafka processes on all nodes.
Verify the Kafka cluster.
Start one LogScale node and let it change epoch.
Verify the epoch has changed.
Start the other LogScale processes on all nodes.

Self-Hosted Overview

Instance Administration

Organization Essentials

Configuring Security

Authentication & Identity Providers

Users & permissions

Cluster Management

Health Checks

Configuration Settings

Ingesting Data

Configuration Variables

LogScale URLs & Endpoints

Limits & Standards

Deployment Overview

Planning Your Deployment

Provisioning

Installing Using Containers

Installing On Bare Metal or Cloud Instance

Reference Architectures

LogScale Kubernetes Reference Architecture

Installing Load Balancers

Deploying Auxiliary Services

Humio Operator

Data Analysis Overview

LogScale User Interface

Repositories & Views

Parsing Data

Searching Data

Writing Queries

Dashboards & Widgets

Automation

Query Language Syntax

Query Functions

Template Language

Keyboard Shortcuts

Switching Kafka

Danger

Stop Sending Data to LogScale

Switching Kafka Recap

Enter search term