Version: 2.3.0-beta

Split

Split transform plugin

Description

A string cutting function is defined, which is used to split the specified field in the Sql plugin.

tip

This transform both supported by engine Spark and Flink.

Options

Spark
Flink

name	type	required	default value
separator	string	no	" "
fields	array	yes	-
source_field	string	no	raw_message
target_field	string	no	root
common-options	string	no	-

separator [string]

Separator, the input string is separated according to the separator. The default separator is a space (" ") . Note: If you use some special characters in the separator, you need to escape it. e.g. "\|"

source_field [string]

The source field of the string before being split, if not configured, the default is raw_message

target_field [string]

target_field can specify the location where multiple split fields are added to the Event. If it is not configured, the default is _root_ , that is, all split fields will be added to the top level of the Event. If a specific field is specified, the divided field will be added to the next level of this field.

name	type	required	default value
separator	string	no	,
fields	array	yes	-
common-options	string	no	-

separator [string]

The specified delimiter, the default is ,

fields [list]

In the split field name list, specify the field names of each character string after splitting in order. If the length of the fields is greater than the length of the separation result, the extra fields are assigned null characters.

common options [string]

Transform plugin common parameters, please refer to Transform Plugin for details

Examples

Spark
Flink

Split the message field in the source data according to &, you can use field1 or field2 as the key to get the corresponding value

split {
    source_field = "message"
    separator = "&"
    fields = ["field1", "field2"]
}

Split the message field in the source data according to , , the split field is info , you can use info.field1 or info.field2 as the key to get the corresponding value

split {
    source_field = "message"
    target_field = "info"
    separator = ","
    fields = ["field1", "field2"]
}

Use Split as udf in sql.

  # This just created a udf called split
  Split{
    separator = "#"
    fields = ["name","age"]
  }
  # Use the split function (confirm that the fake table exists)
  sql {
    sql = "select * from (select raw_message,split(raw_message) as info_row from fake) t1"
  }

Split

Description​

Options​

separator [string]​

source_field [string]​

target_field [string]​

separator [string]​

fields [list]​

common options [string]​

Examples​

Description

Options

separator [string]

source_field [string]

target_field [string]

separator [string]

fields [list]

common options [string]

Examples