This is the documentation for CDH 5.1.x. Documentation for other versions is available at Cloudera Documentation.

Apache Parquet (incubating) Known Issues

— Parquet file writes run out of memory if (number of partitions) times (block size) exceeds available memory

The Parquet output writer allocates one block for each table partition it is processing and writes partitions in parallel. The writer will run out of memory if (number of partitions) times (Parquet block size) is greater than the available memory.

Bug: None

Severity: Medium

Workaround: None; if necessary, reduce the number of partitions in the table.

Page generated September 3, 2015.