Making Back-Ups

Removed: Backups Removed v1.57.0

Built-in backups have been removed from the product. Bucket Storage can be used instead as a backup solution.

LogScale has a built-in backup facility. It only requires a separate directory to which LogScale will write; preferably, on another disk or network drive separate from the data directory. When configured, LogScale keeps a full backup of the current state in this directory. A LogScale node can start with no data and restore the previous state from such a backup.

Backup Encryption

The files written on the backup drive are encrypted using a secret provided during LogScale configuration, and allow you to store the backup on a network drive where others may have read access.

Third-Party Backup Software

LogScale is designed to support solutions like Amazon S3 for archiving. This is not a full backup, but will archive the data so that it can be re-ingested by another or the same LogScale server at some later date.

You can do a full backup using any backup software that is able to back up all the files in the LogScale data directory. Any chosen backup software needs to support "sparse files" to be efficient.

Configure Backup

You can back up your LogScale installation by adding a special mounted directory when you run the Docker container. LogScale writes its backup files to this directory. Each LogScale node can have its own backup directory that is not shared with the other nodes. This is possible since it is not always feasible to create one backup directory that can hold all the data in a LogScale cluster. In the backup directory, each LogScale node will create a sub-directory with its node ID as name and use this directory.

First, create an empty directory on the host machine to store data for LogScale. You would do that like so:

shell
$ mkdir /humio-backups-on-host

LogScale recommends creating the backup directory on a different disk from the main LogScale data directory. Make the directory a mount point for a network drive or other similar separation from the main data drive.

Next, edit the LogScale configuration file to set the backup parameters. Add these lines to that file:

ini
BACKUP_NAME=humio-backup
BACKUP_KEY=mysecretkey-myhost-+R+q(AB9QG86xZMCKGyj

LogScale encrypts all backups with a secret key that you provide. This means that you can safely store backups on an unencrypted disk, or send them over the Internet. Keep the secret key safe and store it in another place. You cannot recover the backup if you lose access to it. If you lose the secret, delete all the files in the backup, or provide a new location to backup to, and start over. LogScale will then write a fresh backup.

The next step is to run LogScale using the Docker run command with the following argument like so:

shell
$ run -v /humio-backups-on-host:/backup

This maps the backups directory on the host. In this example, /humio-backups-on-host will send the backup to the /backup directory in the container. LogScale will then start backing up data to the specified directory.

Without Docker

The procedure is similar. Instead of mounting the directory using -v, you specify the location using BACKUP_DIR. A full example configuration is as follows:

ini
BACKUP_NAME=humio-backup
BACKUP_KEY=mysecretkey-myhost-+R+q(AB9QG86xZMCKGyj
BACKUP_DIR=/mnt/my-net-server/humio-backup01

Delete Data in Backup

When is data deleted in backup? Right when retention kicks in and deletes data in LogScale. It is possible to configure a delay so that data is not deleted in the backup until some time has elapsed since it was deleted in LogScale. This is configured using DELETE_BACKUP_AFTER_MILLIS. By default LogScale will not delete data in the backup until seven days after the data was deleted in LogScale.

ini
DELETE_BACKUP_AFTER_MILLIS=604800000

Restore a Backup

LogScale can restore all events that were stored in segment files for a LogScale node in the humio-data directory from the backup. It can also help a node claim the nodeID of a lost node in a cluster.

If your node is lost, but you have reinstalled the OS on the existing hardware, or perhaps found a spare server that will now take on the role of the lost LogScale node, follow these steps.

Create the humio-data directory and place a copy of the uuid file from the lost node — this file is present in the folder.

shell
$ cp /backup/BACKUP-NAME/globaldata/cluster_membership-NODEID-UUID.uuid \
    /data/humio-data/cluster_membership.uuid

LogScale restores missing segment files when it discovers they are missing, and when they are present in the backup folder.