Skip to main content
Version: 1.x

Alluxio

Output plugin : Alluxio

Description

Write Rows to Alluxio.

Options

nametyperequireddefault value
optionsobjectno-
partition_byarrayno-
pathstringyes-
path_time_formatstringnoyyyyMMddHHmmss
save_modestringnoerror
formatstringnojson
options [object]

Custom parameters.

partition_by [array]

Partition the data based on the fields.

path [string]

File path on Alluxio. Start with alluxio://.

path_time_format [string]

If path contains time variables, such as xxxx-${now}, path_time_format can be used to specify the format of Alluxio path, default is yyyy.MM.dd. The commonly used time formats are listed below:

SymbolDescription
yYear
MMonth
dDay of month
HHour in day (0-23)
mMinute in hour
sSecond in minute

The detailed time format syntax:Java SimpleDateFormat.

save_mode [string]

Save mode, supports overwrite, append, ignore and error. The detail of save_mode see save-modes.

format [string]

format, supports csv, json, parquet and text.

Note

if use alluxio with zookeeper, please add below in start-seatunnel.sh

driverJavaOpts="-Dalluxio.user.file.writetype.default=CACHE_THROUGH -Dalluxio.zookeeper.address=your.zookeeper.address:zookeeper.port -Dalluxio.zookeeper.enabled=true"
executorJavaOpts="-Dalluxio.user.file.writetype.default=CACHE_THROUGH -Dalluxio.zookeeper.address=your.zookeeper.address:zookeeper.port -Dalluxio.zookeeper.enabled=true"

or you can also add below in spark{} in seatunnel configuration after 1.5.0

spark.driverJavaOpts="-Dalluxio.user.file.writetype.default=CACHE_THROUGH -Dalluxio.zookeeper.address=your.zookeeper.address:zookeeper.port -Dalluxio.zookeeper.enabled=true"
spark.executorJavaOpts="-Dalluxio.user.file.writetype.default=CACHE_THROUGH -Dalluxio.zookeeper.address=your.zookeeper.address:zookeeper.port -Dalluxio.zookeeper.enabled=true"

Example

alluxio {
path = "alluxio:///var/logs-${now}"
format = "json"
path_time_format = "yyyy.MM.dd"
}