humio/insights Dashboards
Bucket Storage
Widget | Description | Type |
---|---|---|
Bytes read/written from/to bucket storage | logscale
| Time Chart |
Download queue cap size hits |
Shows the number of times the request download queue size cap is
hit
logscale
| Time Chart |
Searches fetching data from bucket storage |
Shows the queries that is fetching data from bucket storage. This
happens when data is not available on local disks in the cluster
logscale
| Table |
Download queue free slots |
Shows the number of free slots for segment downloads in the bucket
storage download queue
logscale
| Time Chart |
Bucket Storage Status | logscale
| Table |
Bucket storage upload latency (max per partition) |
Shows how long the oldest segment has been waiting after it has
been ingested.
logscale
| Time Chart |
Current segment downloads |
Shows the number of segments downloads currently in progress
logscale
| Time Chart |
Data transferred for files
Widget | Description | Type |
---|---|---|
File data sent between nodes |
This graph sums the amount of data transferred from nodes to
nodes. Based on uploaded-files that are mirrored to a node from
another node. This excludes any files mirrored from bucket
storage.
logscale
| Time Chart |
File data sent to nodes from buckets or nodes |
This graph sums the amount of data transferred to nodes either
from bucket or other nodes. Based on uploaded-files that are
mirrored to a node.
logscale
| Time Chart |
File data sent from buckets to nodes |
This graph sums the amount of data transferred from bucket storage
to nodes. Based on uploaded-files that are mirrored to a node from
bucket storage. This excludes any data mirrored from other nodes.
logscale
| Time Chart |
Sample of file data sent from buckets to nodes |
This graphs sums the data transferred from bucket storage to
individual nodes. Based on uploaded-files mirrored from bucket
storage to nodes, excluding data mirrored between nodes. There are
too many nodes to show them all. Instead this is meant to give a
general indication of how well the file transfer is distributed
among nodes.
logscale
| Time Chart |
Sample of file data sent between node pairs |
This graph sums the data transferred between specific node pairs.
Based on uploaded-files that are mirrored between nodes, excluding
data mirrored from bucket storage. There are too many node pairs
to show them all. Instead this is meant to give a general
indication of how well the data transfer is distributed among node
pairs.
logscale
| Time Chart |
Errors
Widget | Description | Type |
---|---|---|
Ingest Latency | logscale
| Time Chart |
Errors Grouped | logscale
| Table |
Slow Warnings | logscale
| Time Chart |
HotSpot Crashes |
This widget looks for fatal exits of the JVM, as these can
indicate the HotSpot compiler crashed. This is fairly rare, but
can at times happen when the LogScale regex engine hits a bug in
the JVM -- but other causes are also possible. This graph should
almost always be empty, and if it shows many incidents grouped
close together, it is likely related to a bug in the JVM. PLEASE
NOTE that this widget only shows relevant data if stdout logs from
the LogScale JVM instances are shipped to the repository being
inspected.
logscale
| Event List |
Aggregate Alerts With Other Errors |
Aggregate alerts with errors, which don't originate from invoking
actions
logscale
| Time Chart |
HTTP Errors 400s | logscale
| Time Chart |
Search Queue | logscale
| Time Chart |
Global Snapshot File Size | logscale
| Time Chart |
Scheduled Search Action Errors | logscale
| Time Chart |
Missing Nodes | logscale
| Time Chart |
Aggregate Alert Action Errors | logscale
| Time Chart |
Errors Over Time | logscale
| Time Chart |
Filter Alert Action Errors | logscale
| Time Chart |
Failed HTTP Checks | logscale
| Time Chart |
Legacy Alerts With Other Errors |
Legacy alerts with errors, which don't originate from invoking
actions
logscale
| Time Chart |
Global Transactions (Per Second) | logscale
| Time Chart |
FDR Ingest Errors | logscale
| Time Chart |
Missing Segments Reported By ClusterManagement | logscale
| Time Chart |
Node Shutdowns | logscale
| Time Chart |
HTTP Errors 500s | logscale
| Time Chart |
Legacy Alert Action Errors | logscale
| Time Chart |
Scheduled Searches With Other Errors |
Scheduled searches with errors, which don't originate from
invoking actions
logscale
| Time Chart |
Filter Alerts With Other Errors |
Filter alerts with errors, which don't originate from invoking
actions
logscale
| Time Chart |
CPU Usage In Percent | logscale
| Time Chart |
Ingest Errors | logscale
| Time Chart |
Hosts
Widget | Description | Type |
---|---|---|
JVM Hiccups (Millis) - Max Time For No-op Operation | logscale
| Time Chart |
Failed HTTP Checks | logscale
| Time Chart |
Humio Versions | logscale
| Time Chart |
Memory: Humio Process (RSS) | logscale
| Time Chart |
Memory: JVM Heap Usage in Percent | logscale
| Time Chart |
Logged events | logscale
| Time Chart |
Secondary Disk Usage (%) | logscale
| Time Chart |
Open File Descriptors | logscale
| Time Chart |
Mini-segment percentage of available primary disk | logscale
| Time Chart |
CPU Load | logscale
| Time Chart |
Memory: System Usage % | logscale
| Time Chart |
JVM Off Heap Memory Allocated | logscale
| Time Chart |
Memory: JVM Heap Usage | logscale
| Time Chart |
Node Shutdowns | logscale
| Time Chart |
Humio Version | logscale
| Table |
CPU Architecture | logscale
| Table |
Primary Disk Usage (%) | logscale
| Time Chart |
JVM Garbage Collection Time | logscale
| Time Chart |
Networking (Bytes per second) | logscale
| Time Chart |
CPU Usage: Thread Group Ticks | logscale
| Time Chart |
Temp disk usage (Bytes) | logscale
| Time Chart |
Missing Nodes | logscale
| Time Chart |
CPU Usage In Percent | logscale
| Time Chart |
Failed HTTP checks between nodes | logscale
| Sankey |
Cluster Time Skew (Millis) | logscale
| Time Chart |
Ingest
Widget | Description | Type |
---|---|---|
HTTP Ingest Endpoint Usage (Per Second) | logscale
| Time Chart |
Datasources Hitting Max Autoshards | logscale
| Time Chart |
Http Ingest Request With Status Code > 400 (Per Second) | logscale
| Time Chart |
Number Of Live Searches Per Host | logscale
| Time Chart |
Datasources Auto Sharding | logscale
| Table |
Ingest/Digest Per Repo (bytes per day after ingest queue) | logscale
| Time Chart |
Ingest Partition Changes | logscale
| Time Chart |
Parsers Using The Most Time (Millis) | logscale
| Table |
Ingest/Digest Per Host (bytes per day after ingest queue) | logscale
| Time Chart |
Ingest Per Host (events per second before ingest queue) | logscale
| Time Chart |
Ticks spend in Digest and Live searches (Millis Per Second) | logscale
| Time Chart |
Ingest Errors (Per Second) | logscale
| Time Chart |
Ingest Offset Progress | logscale
| Time Chart |
HTTP Ingest requests (Per Second) | logscale
| Time Chart |
Ingest Latency Per Host (Digest) | logscale
| Time Chart |
Events Put On Ingest Queue | logscale
| Time Chart |
Ingest Latency Per Partition (Digest) | logscale
| Time Chart |
CPU Usage In Percent | logscale
| Time Chart |
Time Spend In Parsers (Millis per second) | logscale
| Time Chart |
Datasources Increasing Or Decreasing Auto Sharding | logscale
| Time Chart |
Ingest Per Repo (events per second before ingest queue) | logscale
| Time Chart |
Node Shutdowns | logscale
| Time Chart |
Number Of Datasources Per Repo | logscale
| Time Chart |
HTTP Ingest Request Response Time (Millis) | logscale
| Time Chart |
Killed Live Queries Because Of Ingest Delay | logscale
| Time Chart |
HTTP Ingest Request Size (Bytes) | logscale
| Time Chart |
Kafka
Widget | Description | Type |
---|---|---|
Out Of Sync Queues | logscale
| Time Chart |
Ingest Queue: Out Of Sync Partitions | logscale
| Gauge |
Ingest Queue: Uncompressed Bytes Written (Bytes/Sec) | logscale
| Time Chart |
Kafka Topic Partitions | logscale
| Table |
Ingest Queue: Request Size 75th Percentiles (Bytes) | logscale
| Time Chart |
Global: Time Blocked Waiting For Write (P75) (Millis) | logscale
| Time Chart |
TransientChatter Queue: Out Of Sync Partitions | logscale
| Gauge |
Chatter: Bytes Per Second By Kind | logscale
| Time Chart |
Global: Request Per Second | logscale
| Time Chart |
Global Events Queue: Out Of Sync Partitions | logscale
| Gauge |
Ingest Queue: Puts per second (per host) | logscale
| Time Chart |
Chatter: Bytes Per Second | logscale
| Time Chart |
Chatter: Requests Per Second | logscale
| Time Chart |
Global: Transactions Per Second | logscale
| Time Chart |
Lag Reading Ingest Queue |
This is a Kafka metric. records-lag-max. The
lag in terms of number of records for each partition in this
window. An increasing value over time is your best indication that
the consumer group is not keeping up with the producers.
logscale
| Time Chart |
Ingest Queue: Put Response Times 75.th Percentile (Millis) | logscale
| Time Chart |
Lookup tables
Widget | Description | Type |
---|---|---|
Largest files loaded - original disk size |
Largest file (size on disk) loaded into into the file cache within
the given timespan
logscale
| Time Chart |
Number of cached lookup tables |
Number of lookup tables registered in the lookup table coordinator
logscale
| Time Chart |
Number of lookup files in cache |
Number of files registered in the uploaded file cache
logscale
| Time Chart |
Longest load time |
Longest load time for a file loaded into the file cache within the
given timespan
logscale
| Time Chart |
Number of cached index-structures |
Number of index-structures registered in the lookup table
coordinator
logscale
| Time Chart |
Uploaded file cache size |
Size of files in the uploaded file cache
logscale
| Time Chart |
Largest files loaded - size in memory |
Largest file (size in memory) loaded into into the file cache
within the given timespan
logscale
| Time Chart |
Number of queued loading tasks |
Maximum number of pending tasks queued for loading lookup tables
since latest sample for metrics
logscale
| Time Chart |
Number of queries with files in cache |
Number of queries registered in the lookup table coordinator
logscale
| Time Chart |
Overview
Widget | Description | Type |
---|---|---|
Ingest/Digest Per Host (bytes per day after ingest queue) | logscale
| Time Chart |
Ingest Per Host (events per second before ingest queue) | logscale
| Time Chart |
Search Queue | logscale
| Time Chart |
Errors Grouped | logscale
| Table |
Requests Per Second | logscale
| Time Chart |
Ingest Latency Per Host (Digest) | logscale
| Time Chart |
CPU Usage In Percent | logscale
| Time Chart |
Ingest Errors | logscale
| Time Chart |
Errors Over Time | logscale
| Time Chart |
JVM Hiccup (Millis) - Max Time For No-op Operation | logscale
| Time Chart |
Parsers
Widget | Description | Type |
---|---|---|
Parsers Using The Most Time (Millis) | logscale
| Table |
Parser Limiting Job / Top parsers | logscale
| Table |
Ingest/Digest Per Host (bytes per day after ingest queue) | logscale
| Time Chart |
Parser Limiting Job (blocks at 100) | logscale
| Time Chart |
503 - Ingest parsing exceeded the acceptable amount of time | logscale
| Time Chart |
Ingest Latency Per Host (Digest) | logscale
| Time Chart |
CPU Usage In Percent | logscale
| Time Chart |
Time Spend In Parsers (Millis per second) | logscale
| Time Chart |
Node Shutdowns | logscale
| Time Chart |
Parsers being throttled | logscale
| Event List |
Permissions
Widget | Description | Type |
---|---|---|
Permission cache hit vs. miss |
For each request stemming from a user, all the permissions for
this user is in some cases fetched. A cache exists to ensure that
these permissions are not fetched multiple times. This time chart
shows all permissions lookup vs. permission lookups resulting in a
cache miss. The chart is divided into each level of permission.
The purpose is to measure the value of the caches.
logscale
| Time Chart |
Average time spent on permissions checking (ms) |
This chart shows the average time spent on a permission check
across all permissions.
logscale
| Time Chart |
Number of permission checks pr. permission |
This chart shows the number of permission checks over time for
each permission.
logscale
| Time Chart |
Average time spent on permission check pr. permission (ms) |
This chart shows the time spent on average pr permission check for
each permission.
logscale
| Time Chart |
Replication
Widget | Description | Type |
---|---|---|
Segment bytes unassigned from host due to eviction |
Segment bytes unassigned from host due to eviction either through
EvictedHostRemoval or EvictedHostReplacement.
logscale
| Time Chart |
Segment bytes unassigned from host due to OR |
Segment bytes assigned to host due to over-replication.
logscale
| Time Chart |
Segment bytes assigned to host due to host eviction. |
Segment bytes assigned to host due to eviction of another host.
logscale
| Time Chart |
Segment bytes assigned to host due to zoning |
Segment bytes assigned to host to improve zoning.
logscale
| Time Chart |
Segment bytes assigned to host due to UR |
Segment bytes assigned to host due to under-replication.
logscale
| Time Chart |
Segment bytes assigned to host due to disk balance |
Segment bytes assigned to host due to disk imbalance.
logscale
| Time Chart |
Request-Response
Widget | Description | Type |
---|---|---|
Ingest endpoint Usage (Per Second) | logscale
| Time Chart |
Ingest Requests (Per Second) | logscale
| Time Chart |
Streaming Query submits (Per Minute) | logscale
| Time Chart |
External Query Requests (Per Second) | logscale
| Time Chart |
Internal Query Requests (Per Second) | logscale
| Time Chart |
Ingest Request Responsetime (Millis) | logscale
| Time Chart |
Query Submits (Per Minute) | logscale
| Time Chart |
Ingest Request Size (Bytes) | logscale
| Time Chart |
Statuscode >=400 (Per Second) | logscale
| Time Chart |
Ingest Requests With Statuscode >= 400 | logscale
| Time Chart |
Query Submits Per Repo (Per Minute) | logscale
| Time Chart |
Response Times (Millis) | logscale
| Time Chart |
Requests Per Second | logscale
| Time Chart |
Search
Widget | Description | Type |
---|---|---|
Starved Searches | logscale
| Time Chart |
Serialised Query State Max Size | logscale
| Time Chart |
HTTP External Query Requests (Per Second) | logscale
| Time Chart |
Queries created in QuerySessions (Per Minute) | logscale
| Time Chart |
CPU Usage In Percent | logscale
| Time Chart |
Query Memory Allocation Cost | logscale
| Time Chart |
Maximum Serialised State Sizes (Megabytes) | logscale
| Table |
Time Spend Reading segments | logscale
| Time Chart |
Top Cost queries By User | logscale
| Table |
Search Queue | logscale
| Time Chart |
Query CPU Ticks (Historical) | logscale
| Time Chart |
Query Restarts By Reason | logscale
| Time Chart |
HTTP Query Submits Per Repo (Per Minute) | logscale
| Time Chart |
Local Query Segment Queue | logscale
| Time Chart |
Query CPU Ticks (Live) | logscale
| Time Chart |
Read Segment Files Performance (bytes/sec) | logscale
| Time Chart |
CPU usage: Thread Group Ticks | logscale
| Time Chart |
Slow Warnings To Users | logscale
| Time Chart |
Live Queries Per Host | logscale
| Time Chart |
Top Cost queries | logscale
| Table |
Top Queries In Mapper Threads | logscale
| Table |
Query Threaddumps with Query IDs | logscale
| Time Chart |
Query Historical Cost | logscale
| Time Chart |
HTTP Query Submits (Per Minute) | logscale
| Time Chart |
HTTP Streaming Query Requests (Per Minute) | logscale
| Time Chart |
Streaming Query Submits Per Repo (Per Minute) | logscale
| Time Chart |
HTTP internal Query Requests (Per Second) | logscale
| Time Chart |
Query Total Cost | logscale
| Time Chart |
Query Live Cost | logscale
| Time Chart |
Segments And Datasources
Widget | Description | Type |
---|---|---|
Merged Segments Sizes (Bytes) | logscale
| Time Chart |
Time Spend Merging A Segment | logscale
| Time Chart |
Segment Merge Latency (ms) | logscale
| Time Chart |
Segment Merges Per Hour | logscale
| Time Chart |
Time Spend Merging (Seconds Per Second) | logscale
| Time Chart |
Blocks In Merged Segment | logscale
| Time Chart |
Global Snapshot File Size | logscale
| Time Chart |
Number Of Datasources In Repositories | logscale
| Time Chart |