Skip to main content
Version: 1.x

Repartition

Filter plugin : Repartition​

Description​

Adjust the number of underlying spark rdd partition to increase or decrease degree of parallelism. This filter is mainly to adjust the data processing performance.

Options​

nametyperequireddefault value
num_partitionsnumberyes-
num_partitions [number]​

Target partition number.

Examples​

repartition {
num_partitions = 8
}