|
14 | 14 | REPORT_MD_TEMPLATE = """# Benchmark Report {{ id }} |
15 | 15 | config_id: `{{ report_id }}` |
16 | 16 |
|
17 | | -We have detected **{{ status }}** in benchmark results for `{{ report_id }}` (id: `{{ id }}`). |
18 | | -(HUD benchmark regression page coming soon...) |
| 17 | +See Page https://hud.pytorch.org/benchmark/regression/report/{{ id }} for more details. |
19 | 18 |
|
20 | | -> **Status:** {{ status }} · **Frequency:** {{ frequency }} |
| 19 | +Report Status: **{{ status }}** |
21 | 20 |
|
22 | | -## Summary |
23 | | -| Metric | Value | |
24 | | -| :-- | --: | |
25 | | -| Total | {{ summary.total_count | default(0) }} | |
26 | | -| Regressions | {{ summary.regression_count | default(0) }} | |
27 | | -| Suspicious | {{ summary.suspicious_count | default(0) }} | |
28 | | -| No Regression | {{ summary.no_regression_count | default(0) }} | |
29 | | -| Insufficient Data | {{ summary.insufficient_data_count | default(0) }} | |
30 | | -
|
31 | | -## Data Windows |
32 | | -Baseline is a single reference value (e.g., mean, max, min, latest) aggregated from the previous few days, |
33 | | -used to detect regressions by comparing against metric values in the target window. |
34 | | -
|
35 | | -### Baseline window (used to calculate baseline value) |
36 | | -- **Start:** `{{ baseline.start.timestamp | default('') }}` (commit: `{{ baseline.start.commit | default('') }}`) |
37 | | -- **End:** `{{ baseline.end.timestamp | default('') }}` (commit: `{{ baseline.end.commit | default('') }}`) |
38 | | -
|
39 | | -### Target window (used to compare against baseline value) |
40 | | -- **Start:** `{{ target.start.timestamp | default('') }}` (commit: `{{ target.start.commit | default('') }}`) |
41 | | -- **End:** `{{ target.end.timestamp | default('') }}` (commit: `{{ target.end.commit | default('') }}`) |
42 | | -
|
43 | | -{% if regression_items and regression_items|length > 0 %} |
44 | | -## Regression Glance |
45 | | -{% if url %} |
46 | | -Use items below in [HUD]({{ url }}) to see regression. |
47 | | -{% endif %} |
48 | | -
|
49 | | -{% set items = regression_items if regression_items|length <= 10 else regression_items[:10] %} |
50 | | -{% if regression_items|length > 10 %} |
51 | | -… (showing first 10 only, total {{ regression_items|length }} regressions) |
52 | | -{% endif %} |
53 | | -{% for item in items %} |
54 | | -{% set kv = item.group_info|dictsort %} |
55 | | -{{ "" }}|{% for k, _ in kv %}{{ k }} |{% endfor %}{{ "\n" -}} |
56 | | -|{% for _k, _ in kv %}---|{% endfor %}{{ "\n" -}} |
57 | | -|{% for _k, v in kv %}{{ v }} |{% endfor %}{{ "\n\n" -}} |
58 | | -{% if item.baseline_point -%} |
59 | | -- **baseline**: {{ item.baseline_point.value}}, |
60 | | -- **startTime**: {{ item.baseline_point.timestamp }}, **endTime**: {{ target.end.timestamp }} |
61 | | -- **lcommit**: `{{ item.baseline_point.commit }}`, **rcommit**: `{{ target.end.commit }}` |
62 | | -{{ "\n" }} |
63 | | -{%- endif %} |
64 | | -{% endfor %} |
65 | | -{% endif %} |
| 21 | +- Total: {{ summary.total_count | default(0) }} |
| 22 | +- Regressions: {{ summary.regression_count | default(0) }} |
| 23 | +- Suspicious: {{ summary.suspicious_count | default(0) }} |
| 24 | +- No Regression: {{ summary.no_regression_count | default(0) }} |
| 25 | +- Insufficient Data: {{ summary.insufficient_data_count | default(0) }} |
66 | 26 | """ |
67 | 27 |
|
68 | 28 |
|
|
0 commit comments