collect()

`collect()`

Collects fields from multiple events into one event. It has a limit of 1Kb per key when used as part of a groupBy() operation. This limits the number of values you can index during the aggregation.

Parameter	Type	Required	Default Value	Description
`fields`^[a]	array of strings	required		Names of the fields to keep.
`limit`	integer	optional^[b]	`2000`	Limit to number of distinct values in collect.
		Minimum	`1`
`multival`	boolean	optional^[b]	`true`	Collects the resulting value as multivalue (a single field value using `separator`).
`separator`	string	optional^[b]	`\n`	Separator used for multiple values.
^[a]The parameter name `fields` can be omitted. ^[b]Optional parameters use their default value unless explicitly set.

Hide omitted argument names for this function

Show omitted argument names for this function

The collect() function is limited in the memory for while collecting data before the data is aggregated. The limit changes depending on whether collect() runs as a top level function — in which case its limit is 10 MiB:

logscale

#type = humio #kind=logs
| collect(myField)

or whether it runs in a subquery, or as a sub-aggregator to another function — in which case its limit is 1 MiB:

logscale

#type=humio #kind=logs
groupBy(myField, function=collect(myOtherField))

Warning

Collecting the @timestamp field currently only works when a single timestamp exists. You can work around this restriction by renaming or making another field and collecting that instead, for example:

logscale

timestamp := @timestamp
| collect(timestamp)

If you do not need more than a single value, consider using the selectLast() function or setting limit=1, if you experience that the @timestamp field not having a value.

`collect()` Examples

Click + next to an example below to get the full details.

Collect and Group Events by Specified Field - Example 1

Collect and group events by specified field using collect() as part of a groupBy() operation

Collect and Group Events by Specified Field - Example 2

Collect and group events by specified field using collect() as part of a groupBy() operation

LocalAddressIP4	RemoteAddressIP4	aipCount	aip
192.168.1.100	203.0.113.50	3	[10.0.0.1, 10.0.0.2, 10.0.0.3]
10.0.0.5	198.51.100.75	1	[172.16.0.1]
172.16.0.10	8.8.8.8	5	[192.0.2.1, 192.0.2.2, 192.0.2.3, 192.0.2.4, 192.0.2.5]

Sort Timestamps With `groupBy()`

Sorting fields based on aggregated field values

thread	timestamp
BootstrapInfoJob	10:09
DataSynchJob	10:09
Global event loop	10:10
LocalLivequeryMonitor	10:09
LogCollectorManifestUpdate	10:09
TransientChatter event loop	10:10
aggregate-alert-job	10:09
alert-job	10:09
block-processing-monitor-job	10:09
bloom-scheduler	10:09
bucket-entity-config	10:09
bucket-overcommit-metrics-job	10:09
bucket-storage-download	10:09
bucket-storage-prefetch	10:09
chatter-runningqueries-logger	10:09
chatter-runningqueries-stats	10:09

Data Analysis Overview

LogScale User Interface

Repositories & Views

Parsing Data

Searching Data

Writing Queries

Query Language Syntax

Query Functions

Dashboards & Widgets

Automation

Template Language

Keyboard Shortcuts

`collect()`

Warning

`collect()` Examples

Collect and Group Events by Specified Field - Example 1

Query

Introduction

Step-by-Step

Summary and Results

Collect and Group Events by Specified Field - Example 2

Query

Introduction

Step-by-Step

Summary and Results

Sort Timestamps With `groupBy()`

Query

Introduction

Step-by-Step

Summary and Results

Enter search term

Data Analysis Overview

LogScale User Interface

Repositories & Views

Parsing Data

Searching Data

Writing Queries

Query Language Syntax

Query Functions

Dashboards & Widgets

Automation

Template Language

Keyboard Shortcuts

Warning

collect() Examples

Collect and Group Events by Specified Field - Example 1

Query

Introduction

Step-by-Step

Summary and Results

Collect and Group Events by Specified Field - Example 2

Query

Introduction

Step-by-Step

Summary and Results

Sort Timestamps With groupBy()

Query

Introduction

Step-by-Step

Summary and Results

Enter search term

`collect()`

`collect()` Examples

Sort Timestamps With `groupBy()`