Skip to content

Commit b85c19f

Browse files
committed
docs: Document Parquet write options
1 parent 29db67f commit b85c19f

File tree

1 file changed

+10
-1
lines changed

1 file changed

+10
-1
lines changed

mkdocs/docs/configuration.md

+10-1
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,16 @@ You can also set the FileIO explicitly:
6262
| ---------- | -------------------------------- | ----------------------------------------------------------------------------------------------- |
6363
| py-io-impl | pyiceberg.io.fsspec.FsspecFileIO | Sets the FileIO explicitly to an implementation, and will fail explicitly if it can't be loaded |
6464

65-
For the FileIO there are several configuration options available:
65+
General configuration:
66+
67+
| Key | Options | Default | Description |
68+
| --------------------------------- | --------------------------------- | ------- | ------------------------------------------------------------------------------------------- |
69+
| `write.parquet.compression-codec` | `{uncompressed,zstd,gzip,snappy}` | zstd | Sets the Parquet compression coddec. |
70+
| `write.parquet.compression-level` | Integer | null | Parquet compression level for the codec. If not set, it is up to PyIceberg |
71+
| `write.parquet.page-size-bytes` | Size in bytes | 1MB | Set a target threshold for the approximate encoded size of data pages within a column chunk |
72+
| `write.parquet.dict-size-bytes` | Size in bytes | 1MB | Set the dictionary page size limit per row group |
73+
74+
Below the FileIO specific configuration is shown.
6675

6776
### S3
6877

0 commit comments

Comments
 (0)