Skip to main content
Version: 2.3.0-beta

Druid

Druid sink connector

Description

Write data to Apache Druid.

tip

Engine Supported and plugin name

  • Spark
  • Flink: Druid

Options

nametyperequireddefault value
coordinator_urlStringyes-
datasourceStringyes-
timestamp_columnStringnotimestamp
timestamp_formatStringnoauto
timestamp_missing_valueStringno-
parallelismIntno-

coordinator_url [String]

The URL of Coordinator service in Apache Druid.

datasource [String]

The DataSource name in Apache Druid.

timestamp_column [String]

The timestamp column name in Apache Druid, the default value is timestamp.

timestamp_format [String]

The timestamp format in Apache Druid, the default value is auto, it could be:

  • iso

    • ISO8601 with 'T' separator, like "2000-01-01T01:02:03.456"
  • posix

    • seconds since epoch
  • millis

    • milliseconds since epoch
  • micro

    • microseconds since epoch
  • nano

    • nanoseconds since epoch
  • auto

    • automatically detects ISO (either 'T' or space separator) or millis format
  • any Joda DateTimeFormat string

timestamp_missing_value [String]

The timestamp missing value in Apache Druid, which is used for input records that have a null or missing timestamp. The value of timestamp_missing_value should be in ISO 8601 format, for example "2022-02-02T02:02:02.222".

parallelism [Int]

The parallelism of an individual operator, for DruidSink

Example

Simple

DruidSink {
coordinator_url = "http://localhost:8081/"
datasource = "wikipedia"
}

Specified timestamp column and format

DruidSink {
coordinator_url = "http://localhost:8081/"
datasource = "wikipedia"
timestamp_column = "timestamp"
timestamp_format = "auto"
}

Specified timestamp column, format and missing value

DruidSink {
coordinator_url = "http://localhost:8081/"
datasource = "wikipedia"
timestamp_column = "timestamp"
timestamp_format = "auto"
timestamp_missing_value = "2022-02-02T02:02:02.222"
}