Clickhouse
Clickhouse sink connector
Descriptionâ
Used to write data to Clickhouse.
Key featuresâ
The Clickhouse sink plug-in can achieve accuracy once by implementing idempotent writing, and needs to cooperate with aggregatingmergetree and other engines that support deduplication.
Write data to Clickhouse can also be done using JDBC
Optionsâ
name | type | required | default value |
---|---|---|---|
host | string | yes | - |
database | string | yes | - |
table | string | yes | - |
username | string | yes | - |
password | string | yes | - |
fields | string | yes | - |
clickhouse.* | string | no | |
bulk_size | string | no | 20000 |
split_mode | string | no | false |
sharding_key | string | no | - |
common-options | no | - |
host [string]â
ClickHouse
cluster address, the format is host:port
, allowing multiple hosts
to be specified. Such as "host1:8123,host2:8123"
.
database [string]â
The ClickHouse
database
table [string]â
The table name
username [string]â
ClickHouse
user username
password [string]â
ClickHouse
user password
fields [array]â
The data field that needs to be output to ClickHouse
, if not configured, it will be automatically adapted according to the sink table schema
.
clickhouse [string]â
In addition to the above mandatory parameters that must be specified by clickhouse-jdbc
, users can also specify multiple optional parameters, which cover all the parameters provided by clickhouse-jdbc
.
The way to specify the parameter is to add the prefix clickhouse.
to the original parameter name. For example, the way to specify socket_timeout
is: clickhouse.socket_timeout = 50000
. If these non-essential parameters are not specified, they will use the default values given by clickhouse-jdbc
.
bulk_size [number]â
The number of rows written through Clickhouse-jdbc each time, the default is 20000
.
split_mode [boolean]â
This mode only support clickhouse table which engine is 'Distributed'.And internal_replication
option
should be true
. They will split distributed table data in seatunnel and perform write directly on each shard. The shard weight define is clickhouse will be
counted.
sharding_key [string]â
When use split_mode, which node to send data to is a problem, the default is random selection, but the 'sharding_key' parameter can be used to specify the field for the sharding algorithm. This option only worked when 'split_mode' is true.
common optionsâ
Sink plugin common parameters, please refer to Sink Common Options for details
Examplesâ
sink {
Clickhouse {
host = "localhost:8123"
database = "default"
table = "fake_all"
username = "default"
password = ""
split_mode = true
sharding_key = "age"
}
}
sink {
Clickhouse {
host = "localhost:8123"
database = "default"
table = "fake_all"
username = "default"
password = ""
}
}
Changelogâ
2.2.0-beta 2022-09-26â
- Add ClickHouse Sink Connector
2.3.0-beta 2022-10-20â
- [Improve] Clickhouse Support Int128,Int256 Type (3067)