Cluster Management

A LogScale cluster has some additional requirements for configuration, monitoring, and management. This section explains common monitoring tasks when running a cluster.

Understanding cluster health

Cluster health depends on multiple factors working together. Monitor these key indicators to ensure your cluster operates correctly.

Data replication status — Verify data replication according to your configured replication factor. The Cluster Nodes page shows three replication states:
- Perfect — Data is fully replicated to the target replication factor.
- Low — Data has not yet reached full replication. This is normal during transfers but should resolve quickly.
- Absent — Data cannot be found on any node. This indicates node failures or availability issues that require immediate attention.
Kafka synchronization — Kafka manages data distribution across the cluster. The Kafka Cluster page shows in-sync partition counts. All partitions should remain synchronized for proper cluster operation.
Node availability — All nodes should be reachable and running the same LogScale version. The Health Checks provide programmatic health status with three states:
- OK — All health checks are within normal parameters.
- WARN — At least one check needs investigation.
- DOWN — The node is not functioning and should be removed from load balancers.
Resource utilization — Monitor disk usage, ingest latency, and query performance. Health checks trigger warnings when disk usage exceeds 90% or when ingest latency rises above 30 seconds by default.

Monitoring priorities

Different monitoring tools serve different purposes. Use this guidance to determine which tool to check when.

Daily health checks — Review the Cluster Nodes page to verify replication status and node availability. Check that Perfect replication percentage remains high and that no data shows as Absent.
During incidents — Use the Health Checks API for quick programmatic status checks. Integrate health checks with your monitoring and alerting systems.
Performance investigation — Check the Query Monitor to identify heavy queries. Review LogScale Metrics for detailed performance data.
Before upgrades — Verify that all nodes show the same version on the Cluster Nodes page. Ensure replication status shows Perfect before starting rolling upgrades.
Capacity planning — Monitor disk usage trends and transfer rates on the Cluster Nodes page. Watch for increasing Transfers values that might indicate the need for additional nodes.

Cluster management using GraphQL

To use the GraphQL API for retrieving information on a cluster, see the documentation pages on the cluster() and clusterManagementSettings() query fields. For checking connections, consider using the checkLocalClusterConnection() and checkRemoteClusterConnection() query fields. Related to that, look at the pages on the createRemoteClusterConnection(), deleteClusterConnection(), and similar mutation fields.

Self-Hosted Overview

Instance Administration

Query Administration

Configure Security

Authentication and identity providers

Cluster Management

Health Checks

Ingesting Data

Configuration Variables

LogScale URLs and Endpoints

Limits and Standards

Deployment Overview

Planning Your Deployment

Instance Sizing

Authentication and identity providers

Storage Architecture

Installing Using Containers

Installing On Bare Metal or Cloud Instance

Reference Architectures

Installing Load Balancers

Deploying Auxiliary Services

Configuration Settings

Managing Your Deployment

Testing Your Deployment

Humio Operator

Data Analysis Overview

LogScale Web Interface

Manage Repositories and Views

Manage Your LogScale Account

Parse Data

Search Data

Write Queries

Query Language Syntax

Query Joins and Lookups

Query Functions

Data Visualization

Automation

Template Language

Keyboard Shortcuts

Cluster Management

Understanding cluster health

Monitoring priorities

Cluster management using GraphQL

Enter search term