MaterializeInc · mgree · Jun 4, 2025 · May 20, 2025 · May 21, 2025 · May 21, 2025
@@ -0,0 +1,337 @@
+---
+title: "EXPLAIN ANALYZE"
+description: "Reference page for `EXPLAIN ANALYZE`. `EXPLAIN ANALYZE` is used to understand the performance of indexes and materialized views."
+menu:
+  main:
+    parent: commands
+---
+
+`EXPLAIN ANALYZE`:
+
+- Reports on the performance of indexes and materialized views.
+- Provide the execution plan annotated with TopK hints. The TopK
+  query pattern groups by some key and return the first K elements within each
+  group according to some ordering.
+
+{{< warning >}}
+`EXPLAIN` is not part of Materialize's stable interface and is not subject to
+our backwards compatibility guarantee. The syntax and output of `EXPLAIN` may
+change arbitrarily in future versions of Materialize.
+{{< /warning >}}
+
+## Syntax
+
+```mzsql
+EXPLAIN ANALYZE
+      CPU [, MEMORY] [WITH SKEW]
+    | MEMORY [, CPU] [WITH SKEW]
+    | HINTS
+FOR INDEX <name> | MATERIALIZED VIEW <name>
+[ AS SQL ]
+;
+```
+
+{{< tip >}}
+If you want to specify both `CPU` or `MEMORY`, they may be listed in any order;
+however, each may appear only once.
+{{</ tip >}}
+
+Parameter    | Description
+-------------|-----
+**CPU**      | Annotates the LIR plan with the consumed CPU time information `total_elapsed` for each operator (not inclusive of its child operators).
+**MEMORY**   | Annotates the LIR plan with the consumed memory information `total_memory` and number of records `total_records` for each operator.
+**WITH SKEW** | *Optional.* If specified, includes additional information about average and per-worker consumption and ratios (of `CPU` and/or `MEMORY`).
+**HINTS**    | Annotates the LIR plan with [TopK hints].
+**AS SQL**   | *Optional.* If specified, returns the SQL associated with the specified `EXPLAIN ANALYZE` command without executing it. You can modify this SQL as a starting point to create customized queries.
+
+## Privileges
+
+The privileges required to execute this statement are:
+
+- `USAGE` privileges on the schemas that all relations in the explainee are contained in.
+
+## Examples
+
+The attribution examples in this
+section reference the `wins_by_item` index (and the underlying `winning_bids`
+view) from the [quickstart
+guide](/get-started/quickstart/#step-2-create-the-source):
+
+```sql
+CREATE SOURCE auction_house
+FROM LOAD GENERATOR AUCTION
+(TICK INTERVAL '1s', AS OF 100000)
+FOR ALL TABLES;
+
+CREATE VIEW winning_bids AS
+  SELECT DISTINCT ON (a.id) b.*, a.item, a.seller
+    FROM auctions AS a
+    JOIN bids AS b
+      ON a.id = b.auction_id
+   WHERE b.bid_time < a.end_time
+     AND mz_now() >= a.end_time
+   ORDER BY a.id, b.amount DESC, b.bid_time, b.buyer;
+
+CREATE INDEX wins_by_item ON winning_bids (item);
+```
+
+### `EXPLAIN ANALYZE MEMORY`
+
+The following examples reports on the memory usage of the index `wins_by_item`:
+
+```mzsql
+EXPLAIN ANALYZE MEMORY FOR INDEX wins_by_item;
+```
+
+For the index,s `EXPLAIN ANALYZE MEMORY` reports on the memory usage and the
+number of records for each operator in the dataflow:
+
+
+| operator                         | total_memory | total_records |
+| :------------------------------- | -----------: | ------------: |
+| Arrange 7                        | 1116 kB      | 39976         |
+|   Get::PassArrangements u274     | null         | null          |
+| **TopK::Basic 5**                | **100 MB**   | **1899733**   |
+|   Join::Differential 2 » 4       | null         | null          |
+|     Arrange 3                    | 5672 kB      | 220375        |
+|       Get::PassArrangements u271 | null         | null          |
+|     Arrange 1                    | 3501 kB      | 188785        |
+|       Get::Collection u270       | null         | null          |
+
+The results show the `TopK` operator is overwhelmingly responsible for memory
+usage.
+
+
+### `EXPLAIN ANALYZE CPU`
+
+The following examples reports on the cpu usage of the index `wins_by_item`:
+
+```mzsql
+EXPLAIN ANALYZE CPU FOR INDEX wins_by_item;
+```
+
+For the index, `EXPLAIN ANALYZE CPU` reports on total time spent in each
+operator (not inclusive of its child operators) in the dataflow:
+
+| operator                         | total_elapsed   |
+| :------------------------------- | --------------: |
+| Arrange 7                        | 00:00:03.14266  |
+|   Get::PassArrangements u274     | null            |
+| TopK::Basic 5                    | 00:00:44.079592 |
+|   Join::Differential 2 » 4       | 00:00:06.372705 |
+|     Arrange 3                    | 00:00:21.49465  |
+|       Get::PassArrangements u271 | 00:00:00.066236 |
+|     Arrange 1                    | 00:00:11.212331 |
+|       Get::Collection u270       | 00:00:00.190449 |
+
+### `EXPLAIN ANALYZE CPU, MEMORY`
+
+You can report on both CPU and memory usage simultaneously:
+
+```mzsql
+EXPLAIN ANALYZE CPU, MEMORY FOR INDEX wins_by_item;
+```
+
+You can specify both `CPU` or `MEMORY` in any order; however, each may appear
+only once. The order of `CPU` and `MEMORY` in the statement determines the order
+of the output columns
+
+For example, in the above example where the `CPU` was listed before `MEMORY`,
+the CPU time (`total_elasped`) column is listed before the `MEMORY` information
+`total_memory` and `total_records`.
+
+| operator                         | total_elapsed   | total_memory | total_records |
+| :------------------------------- | --------------: | -----------: | ------------: |
+| Arrange 7                        | 00:00:03.151386 | 1234 kB      | 42359         |
+|   Get::PassArrangements u274     | null            | null         |               |
+| TopK::Basic 5                    | 00:00:44.347959 | 105 MB       | 2013547       |
+|   Join::Differential 2 » 4       | 00:00:06.389385 | null         | null          |
+|     Arrange 3                    | 00:00:21.558754 | 5431 kB      | 233533        |
+|       Get::PassArrangements u271 | 00:00:00.06644  | null         | null          |
+|     Arrange 1                    | 00:00:11.246103 | 3546 kB      | 191168        |
+|       Get::Collection u270       | 00:00:00.190935 | null         | null          |
+
+
+### `EXPLAIN ANALYZE ... WITH SKEW`
+
+In clusters with more than one worker, [worker
+skew](/transform-data/dataflow-troubleshooting/#is-work-distributed-equally-across-workers)
+can occur when data is unevenly distributed across workers. Extreme cases of
+skew can seriously impact performance. You can use `EXPLAIN ANALYZE ... WITH
+SKEW` to identify this scenario. The `WITH SKEW` option includes the per worker
+and average worker performance numbers for each operator, along with each
+worker's ratio compared to the average.
+
+For the below example, assume there are 2 workers in the cluster.
+
+{{< tip >}}
+
+To determine how many workers a given cluster size has, you can query
+[`mz_catalog.mz_cluster_replica_sizes`](/sql/system-catalog/mz_catalog/#mz_cluster_replica_sizes).
+
+{{</ tip >}}
+
+You can explain `MEMORY` and/or `CPU` with the `WITH SKEW` option. For example,
+the following runs `EXPLAIN ANALYZE MEMORY WITH SKEW`:
+
+```mzsql
+EXPLAIN ANALYZE MEMORY WITH SKEW FOR INDEX wins_by_item;
+```
+
+The results include the per worker and average worker performance numbers for
+each operator, along with each worker's ratio compared to the average:
+
+| operator                         | worker_id | memory_ratio | worker_memory | avg_memory | total_memory | records_ratio | worker_records | avg_records | total_records |
+| :------------------------------- | --------: | -----------: | ------------: | ---------: | -----------: | ------------: | -------------: | ----------: | ------------: |
+| Arrange 7                        | 0         | 1.24         | 768 kB        | 619 kB     | 1238 kB      | 1.2           | 25485          | 21246       | 42492         |
+| Arrange 7                        | 1         | 0.76         | 470 kB        | 619 kB     | 1238 kB      | 0.8           | 17007          | 21246       | 42492         |
+|   Get::PassArrangements u274     | null      | null         | null          | null       | null         | null          | null           | null        | null          |
+| TopK::Basic 5                    | 0         | 1            | 53 MB         | 53 MB      | 105 MB       | 1             | 1011624        | 1010259.5   | 2020519       |
+| TopK::Basic 5                    | 1         | 1            | 52 MB         | 53 MB      | 105 MB       | 1             | 1008895        | 1010259.5   | 2020519       |
+|   Join::Differential 2 » 4       | null      | null         | null          | null       | null         | null          | null           | null        | null          |
+|     Arrange 3                    | 0         | 1            | 2726 kB       | 2724 kB    | 5448 kB      | 1             | 117297         | 117167.5    | 234335        |
+|     Arrange 3                    | 1         | 1            | 2722 kB       | 2724 kB    | 5448 kB      | 1             | 117038         | 117167.5    | 234335        |
+|       Get::PassArrangements u271 | null      | null         | null          | null       | null         | null          | null           | null        | null          |
+|     Arrange 1                    | 0         | 1            | 1779 kB       | 1778 kB    | 3556 kB      | 1             | 95955          | 95750       | 191500        |
+|     Arrange 1                    | 1         | 1            | 1778 kB       | 1778 kB    | 3556 kB      | 1             | 95545          | 95750       | 191500        |
+|       Get::Collection u270       | null      | null         | null          | null       | null         | null          | null           | null        | null          |
+
+The `ratio` column tells you whether a worker is particularly over- or
+under-loaded:
+
+- a `ratio` below 1 indicates a worker doing a below average amount of work.
+
+- a `ratio` above 1 indicates a worker doing an above average amount of work.
+
+While there will always be some amount of variation, very high ratios indicate a
+skewed workload. Here the memory ratios are close to 1, indicating there is very
+little worker skew.
+
+### `EXPLAIN ANALYZE HINTS`
+
+`EXPLAIN ANALYZE HINTS` can annotate your plan (specifically, each TopK
+operator) with suggested [TopK hints]; i.e., [`DISTINCT ON INPUT GROUP SIZE=`
+value](/transform-data/idiomatic-materialize-sql/top-k/#query-hints-1).
+
+For example, the following runs `EXPLAIN ANALYZE HINTS` on the `wins_by_item`
+index:
+
+```mzsql
+EXPLAIN ANALYZE HINTS FOR INDEX wins_by_item;
+```
+
+The result shows that the `wins_by_item` index has only one `TopK` operator and
+suggests the hint (i.e, the `DISTINCT ON INPUT GROUP SIZE=` value) of `255.0`.
+
+| operator                         | levels | to_cut | hint  | savings |
+| :------------------------------- | -----: | -----: | ----: | ------: |
+| Arrange 7                        | null   | null   | null  | null    |
+|   Get::PassArrangements u274     | null   | null   | null  | null    |
+| TopK::Basic 5                    | 8      | 6      | 255.0 | 75 MB   |
+|   Join::Differential 2 » 4       | null   | null   | null  | null    |
+|     Arrange 3                    | null   | null   | null  | null    |
+|       Get::PassArrangements u271 | null   | null   | null  | null    |
+|     Arrange 1                    | null   | null   | null  | null    |
+|       Get::Collection u270       | null   | null   | null  | null    |
+
+With the hint information, you can recreate the view and index to improve memory
+usage:
+
+```sql
+DROP VIEW winning_bids CASCADE;
+
+CREATE VIEW winning_bids AS
+    SELECT DISTINCT ON (a.id) b.*, a.item, a.seller
+      FROM auctions AS a
+      JOIN bids AS b
+        ON a.id = b.auction_id
+     WHERE b.bid_time < a.end_time
+       AND mz_now() >= a.end_time
+   OPTIONS (DISTINCT ON INPUT GROUP SIZE = 255) -- use hint!
+  ORDER BY a.id,
+    b.amount DESC,
+    b.bid_time,
+    b.buyer;
+
+CREATE INDEX wins_by_item ON winning_bids (item);
+```
+
+Re-running the `TopK`-hints query will show only `null` hints; i.e., there are
+no hints because our `TopK` is now appropriately sized.
+
+To see if the indexe's memory usage has improved with the hint, rerun the
+following `EXPLAIN ANALYZE MEMORY` command:
+
+```mzsql
+EXPLAIN ANALYZE MEMORY FOR INDEX wins_by_item`;
+```
+
+The results show that the `TopK` operator uses `30MB` of memory, a third of the
+[~100MB of memory it was using before](#explain-analyze-memory):
+
+| operator                         | total_memory | total_records |
+| -------------------------------- | ------------ | ------------- |
+| Arrange 7                        | 1093 kB      | 42720         |
+|   Get::PassArrangements u286     | null         | null          |
+| **TopK::Basic 5**                | **30 MB**    | **625638**    |
+|   Join::Differential 2 » 4       | null         | null          |
+|     Arrange 3                    | 5447 kB      | 235570        |
+|       Get::PassArrangements u271 | null         | null          |
+|     Arrange 1                    | 3485 kB      | 191730        |
+|       Get::Collection u270       | null         | null          |
+
+### `EXPLAIN ANALYZE ... AS SQL`
+
+Under the hood:
+
+- For returning Memory/CPU information, `EXPLAIN ANALYZE` runs SQL queries that
+correlate [`mz_introspection` performance
+information](https://materialize.com/docs/sql/system-catalog/mz_introspection/)
+with the LIR operators in
+[`mz_introspection.mz_lir_mapping`](../../sql/system-catalog/mz_introspection/#mz_lir_mapping).
+
+- For TopK hints, `EXPLAIN ANALYZE` uses
+[`mz_introspection.mz_expected_group_size_advice`](/sql/system-catalog/mz_introspection/#mz_expected_group_size_advice)
+introspection source to offer hints on sizing `TopK` operators.
+
+You can append `AS SQL` to any `EXPLAIN ANALYZE` statement to see the SQL that
+would be run (without running it). You can then customize this SQL to report
+finer grained or other information. For example:
+
+```mzsql
+EXPLAIN ANALYZE HINTS FOR INDEX wins_by_item AS SQL;
+```
+
+The results show the SQL that `EXPLAIN ANALYZE` would run to get the TopK hints
+for the `wins_by_items` index:
+
+```none
+SELECT
+    repeat(' ', nesting * 2) || operator AS operator,
+    megsa.levels AS levels,
+    megsa.to_cut AS to_cut,
+    megsa.hint AS hint,
+    pg_size_pretty(savings) AS savings
+FROM
+    mz_introspection.mz_lir_mapping AS mlm
+        JOIN
+            mz_introspection.mz_dataflow_global_ids AS mdgi
+            ON (mlm.global_id = mdgi.global_id)
+        LEFT JOIN
+            mz_introspection.mz_expected_group_size_advice AS megsa
+            ON
+                (
+                    megsa.dataflow_id = mdgi.id
+                        AND
+                    mlm.operator_id_start <= megsa.region_id
+                        AND
+                    megsa.region_id < mlm.operator_id_end
+                )
+        JOIN
+            mz_introspection.mz_mappable_objects AS mo
+            ON (mlm.global_id = mo.global_id)
+WHERE mo.name = 'materialize.public.wins_by_item'
+ORDER BY mlm.lir_id DESC;
+```
+
+[TopK hints]: /transform-data/idiomatic-materialize-sql/top-k/#query-hints-1
@@ -125,7 +125,7 @@ The following query hints are valid within the `OPTION` clause.
 Hint | Value type | Description
 ------|------------|------------
 `AGGREGATE INPUT GROUP SIZE` | `uint8` | How many rows will have the same group key in an aggregation. Materialize can render `min` and `max` expressions more efficiently with this information.
-`DISTINCT ON INPUT GROUP SIZE` | `uint8` | How many rows will have the same group key in a `DISTINCT ON` expression. Materialize can render [Top K patterns](/transform-data/idiomatic-materialize-sql/top-k/) based on `DISTINCT ON` more efficiently with this information.
+`DISTINCT ON INPUT GROUP SIZE` | `uint8` | How many rows will have the same group key in a `DISTINCT ON` expression. Materialize can render [Top K patterns](/transform-data/idiomatic-materialize-sql/top-k/) based on `DISTINCT ON` more efficiently with this information. To determine the query hint size, see [`EXPLAIN ANALYZE HINTS`](/sql/explain-analyze/#explain-analyze-hints).
 `LIMIT INPUT GROUP SIZE` | `uint8` | How many rows will be given as a group to a `LIMIT` restriction. Materialize can render [Top K patterns](/transform-data/idiomatic-materialize-sql/top-k/) based on `LIMIT` more efficiently with this information.
 
 For examples, see the [Optimization](/transform-data/optimization/#query-hints) page.

@@ -183,7 +183,7 @@ ORDER BY fieldA, fieldZ ... ;
 ```
 
 For more information on setting `DISTINCT ON INPUT GROUP SIZE`, see
-[Optimization](/transform-data/optimization/#query-hints).
+[`EXPLAIN ANALYZE HINTS`](/sql/explain-analyze/#explain-analyze-hints).
 
 ## Examples