Draft visualizer #448

mzuenni · 2025-04-05T15:07:47Z

Todo input visualizer:

bt upgrade needs to read generators.yaml, and if it has a visualizer: key, move the target to input_visualizer/ and remove it from generators.yaml. It should then warn that the visualizer itself needs to be rewritten to no longer read from testcase.in, but from stdin.
rename Program.visualizer
move to visualize.py

Todo output visualizers:

implement this
Allow enabling output visualizers for bt run (and bt test?) with a command-line argument
The resulting judgeimage.<ext> of the output visualizer can be copied back to data/ when bt generate runs it on the canonical jury solution, but only when there is no input visualizer, because there can be at most one image per test case.
Related to the above: should we do something when any input validator or generator happens to write image files? => they get overwritten if any visualizer is available
Related to the above: should we do something when the output validator or generator happens to write image files? => they get overwritten if any visualizer is available

mzuenni · 2025-04-06T14:11:08Z

Some Notes:
my implementation uses a different InputVisualizer invocation. The visualizer is called with the input and answer files instead of only reading the input from stdin. On one hand this is closer to the old behaviour BAPCtools always used on the other hand the answer file might be necessary for the visualization especially if its not a valid output. (The name InputVisualizer might be confusing...). It also mkes the invocation of input/output visualizers more similar

thorehusfeldt · 2025-04-09T09:38:00Z

Thank you for doing this.

We currently have

	visualizer?:  #command & =~"^/" & !~"\\{count" | null

Should generators.yaml actually support something like this instead?:

visualizer?: ["in" | "out"]: #command | null

In particular the requirement that visualizer invocations start with / is now void, because the directory structures is fixed. So the path should instead be relative to input_visualizer and output_visualizer.

But it’s not clear to me that this shall be specified in generators at all any more. When output_visualizer exists, it will be run, and with a narrowly specified invocation. The same may be true of input_visualizer. So what’s left to specifify? (Honest question.) Sometimes may want to switch visualization off, so there’s case to be made for

visualizer?: ["in" | "out"]: *true | false

I can also see myself passing arguments to a visualizer, so maybe we want

visualizer?: ["in" | "out"]: string | false

instead?

mpsijm · 2025-04-09T10:07:01Z

But it’s not clear to me that this shall be specified in generators at all any more. When output_visualizer exists, it will be run, and with a narrowly specified invocation. The same may be true of input_visualizer. So what’s left to specify?

Agreed, the current visualizer key in generators.yaml is obsolete now, because the mere existence of the visualizer signals that it should be run. If you want to switch off visualization, we may want to add command-line options like --no{,-input,-output}-visualizer. Or do you mean that you may want to disable the visualizer for certain test groups? And regarding passing arguments to the visualizer, shouldn't those live in test_group.yaml (currently still called testdata.yaml in BAPCtools, rename pending), just like the validator arguments? Perhaps a setting for enabling/disabling the visualizer for a test group should also live in test_group.yaml.

(still didn't have time to look at the code in detail or try it out... just replying to Thore's suggestions, thanks for looking at this! ❤️)

mzuenni · 2025-04-09T10:55:48Z

Should generators.yaml actually support something like this instead?:
[...]
So what’s left to specifify?

I don't think we need to specify anything in generators.yaml if you provide an input_visualizer you already show that you want the visualization. And if you don't want to run it temporarily, there is bt generate --no-visualizer.

If you don't want to run it for some testgroup/test cases you can implemtent this via <input, output>_visualizer_args keys (not that in my implementation visualizers are not called with validator args). We could in theory also add to the spec that _visualizer_args is str | list[str] | False where False indicates that it should not be run... But i don't know if we need that.

thorehusfeldt · 2025-04-09T11:40:45Z

Even more basic question: what does data/sample/01.pdf mean? (Is it the result of running the input visualizer on 01.in?) That would make sense to me, but then the images in levellinglocks would violate this convention. (Because they show instances plus a solution. ) Do we really need data/sample/01.in.pdf now?

mpsijm · 2025-04-09T12:01:45Z

If we'd strictly follow the spec, the resulting images of the input/output visualizer would not automatically end up in data/ at all; this is a choice that we make in BAPCtools. Also, according to the spec, the image in data/ is an "Illustration of the test case" (whether that's only input, or both input and answer, is the choice of the problem setter). I already thought about this in #438 (comment), and in summary, I think we should do something like this (EDIT and from @mzuenni's comment below, it looks like this is the current implementation):

If the input visualizer produces an image, we place that in data/.
Else, if the output visualizer produces an image with the canonical jury submission (i.e. the one from which we generate .ans files), we place that in data/.
If both produce an image, the result of the output visualizer only stays in feedback_dir (this probably already happens for the non-canonical jury submissions anyway).

So I don't think we need to disambiguate 1.{in,ans}.png, unless you think we need multiple images per test case in some cases (currently, the spec only allows one image file per test case, which I think is fine).

mzuenni · 2025-04-09T12:05:39Z

Even more basic question: what does data/sample/01.pdf mean? (Is it the result of running the input visualizer on 01.in?)

Short answer (in terms of BAPCtools): yes.
More detailed answer: Its the visualization of the testcase (what exactly that means is up to you/the problem). In terms of bt generate its the output of the input_visualizer but if no input_visualizer exists we try to use the output_visualizer as a fallback to generate a visualization of the testcase.

Do we really need data/sample/01.in.pdf now?

I don't see the need for this

mzuenni · 2025-04-12T21:22:10Z

@mpsijm will you have time to take a look?

mpsijm

Thank you so much for picking this up! ❤️ I see a lot has changed, and the majority is looking very good 😄 Some nit-pick comments or questions below 🙂

I didn't test super extensively, but I'm very happy that @thorehusfeldt also took the time to write some visualizers! ❤️

bin/generate.py

bin/interactive.py

skel/problem/input_visualizer/readme.md

skel/problem/generators/generators.yaml

skel/problem/input_visualizer/readme.md

bin/run.py

bin/generate.py

mpsijm · 2025-04-14T20:31:15Z

bin/generate.py

In https://icpc.io/problem-package-format/spec/2023-07-draft.html#reporting-additional-feedback, it is mentioned that the output validator can also write image files to the feedback directory. Currently, bt generate only runs the answer validator, but I guess that we may also want to run the output validator on the .ans file, if an output validator exists? I think this could be added to TestcaseRule.validate_ans_and_out?

this already happens right?
if there is a .out we run the output validator and if there ans_is_output is true the output validator is used as answer validator and therefore also run.

Sure, but I'm talking about the case where a .out file does not exist, which is the case for all secret test cases.

Take for example Jib Job. Say that, as implementor of that problem, I want to visualize how the output of submissions looks like, and use this as test case image as well. This problem could have an output_visualizer like this:

output_visualizer

#!/usr/bin/env python3 import sys f = open(sys.argv[1]).readlines() n = int(f[0]) team_jibs = list(map(int, sys.stdin.read().split())) assert len(team_jibs) == n xl, xh, yl, yh = 20000, -10000, 20000, -10000 cranes = [] for line, r in zip(f[1:], team_jibs): x, y, h = map(int, line.split()) cranes.append((x, y, r)) xl = min(xl, x - r) yl = min(yl, y - r) xh = max(xh, x + r) yh = max(yh, y + r) w = xh - xl h = yh - yl if w > h: h *= 1024 / w w = 1024 else: w *= 1024 / h h = 1024 with open(sys.argv[3] + "/judgeimage.svg", "w") as of: of.write( f'<svg xmlns="http://www.w3.org/2000/svg" viewBox="{xl} {yl} {xh - xl} {yh - yl}" width="{w}" height="{h}" style="fill:rgba(0,0,255,0.5)">' ) for x, y, r in cranes: of.write(f'<circle cx="{x}" cy="{y}" r="{r}" />') of.write("</svg>")

I don't need an input_visualizer (or "test case visualizer"), because it will do exactly the same as the output visualizer, with the difference that it would read from a fixed .ans file, rather than read a submission's output from stdin.

Now, I could also decide to write similar code like this in the output validator instead, rather than having a separate output visualizer. But, currently, the output validator is not run during bt generate for this problem.

(to test this, I currently just appended this to the main function of the output validator of Jib Job, because I didn't feel like actually writing the same visualizer in C++ 😛)

ofstream vis(string(argv[3]) + "/judgeimage.svg"); vis << n << endl; vis.close();

in the case where .out does not exist (and ans_is_out) the output Validator should already be run (as well as any other answer validator)?

we just don't copy the image... at least thats how i understand the code right now

mhm this is kind of ugly for hashing... we now have multiple programs that could create the same file... and we don't know which is actually responsible ^^' This would mean that if either validator or visualizer change both need to be rerun?

I think even the current implementation of running the output_visualizer is broken in regards to caching...
The output visualizer can read stuff written by the output_validator but the curren't implementation can't guarantee that these files still exist...

I have been thinking about this a lot and I think for interactive or multi-pass problems using the output_visualizer is a bad idea since it visualized the solution which is not part of the testcase.

For ans_is_outout problems or those with a .out file we can use the feedback dir, we just have no guarantee in what order input, output, and answer validators were run...

the output_visualizer [...] visualized the solution which is not part of the testcase

But it makes sense that the output visualizer visualizes a possible output that represents a solution, right?

we just have no guarantee in what order input, output, and answer validators were run...

Aren't we the ones writing the code? 😛 Especially during bt generate: either all validators are run (and I assume in the order "input"-"answer"-"output"), or they are all suppressed with --no-validators. I guess we could make --no-validators imply --no-visualizer because it may be the case that the visualizer reads files from the feedback dir that were left behind by the validators, but to be fair, this will probably not happen very often, so I don't think we need to add this restriction. It may go wrong in some rare cases, but I guess that's fine?

Aren't we the ones writing the code? 😛

yes but there are these weird cases... for example if testcases are symlinked because of input:. In that case some of the validators are rerun ^^

But it makes sense that the output visualizer visualizes a possible output that represents a solution, right?

not sure about this ^^' it probably depends on the kind of problem. for guess for example i think its quite strange to visualize a binary search?

Yeah, I'm fine with not (fully) supporting visualizers for interactive test cases for now. Same as for the question of an output validator producing image files, we may want to document this somewhere 🙂

Co-authored-by: Maarten Sijm <[email protected]>

bin/generate.py

mpsijm · 2025-04-20T22:17:59Z

Updated the spec in Kattis/problem-package-format#439 (and hopefully also correctly updated BAPCtools accordingly, I'll check tomorrow if the tests passed) 😄

… lists of strings

mzuenni requested a review from mpsijm April 7, 2025 12:34

mzuenni mentioned this pull request Apr 12, 2025

Use new visualizers structure from 2023-07-draft #438

Closed

6 tasks

mzuenni marked this pull request as ready for review April 12, 2025 21:26

mzuenni added 19 commits April 14, 2025 21:12

copied from mpsijm

b37cbfa

update identity

f647c2d

update schemas

b143e87

update doc

f876cc0

default value for action

8dc03ca

move visualizer

6c80df9

remove testcases

c493fd3

add warning for outdated visualizer

0e150f4

add output visualizer (and made open more consistent)

59a5965

add comments

c720c20

implemented output visualizer

5394cb7

change visualizer interface

f4d7c67

handle args

08721a7

dont use answer_validator_args

880134d

properly resolve problem paths

b6ca0b3

fix

3ab0ba8

typing

c566e37

typing

899f232

typing

a4e0637

mzuenni and others added 5 commits April 14, 2025 21:12

fix wsl

b26ba5e

disable output visualizer again

69f93e6

removed outdated warning

5290aeb

always copy output visualizer from skel

154805b

guess output visualizer for an interactive problem

9aa7cd4

mpsijm reviewed Apr 14, 2025

View reviewed changes

mpsijm force-pushed the draft-visualizer branch from 608b5b1 to 9aa7cd4 Compare April 14, 2025 20:33

mzuenni and others added 9 commits April 15, 2025 10:59

update type

00bd79d

more feedback

5f70588

Update skel/problem/input_visualizer/readme.md

32075e5

Co-authored-by: Maarten Sijm <[email protected]>

Update skel/problem/input_visualizer/readme.md

2c95159

Co-authored-by: Maarten Sijm <[email protected]>

Update bin/generate.py

8f7385f

Co-authored-by: Maarten Sijm <[email protected]>

update version

f3e54fa

format

92956c7

format

06193ac

added example

3d28764

mpsijm approved these changes Apr 15, 2025

View reviewed changes

update visualizer logic

1284cd3

mpsijm reviewed Apr 17, 2025

View reviewed changes

bin/generate.py Outdated Show resolved Hide resolved

mzuenni added 4 commits April 17, 2025 11:15

fix mode

07cd74d

fix mode

34b82ce

simplify code

fd94e89

add warning for deprecated root key

2c168ef

mpsijm added 2 commits April 21, 2025 10:36

Problem._parse_testdata_yaml: enforce validator/visualizer args to be…

fab7cfd

… lists of strings

[visualize] Rename InputVisualizer to TestCaseVisualizer

0d2333f

mpsijm force-pushed the draft-visualizer branch from 217eb03 to 0d2333f Compare April 21, 2025 08:37

mzuenni merged commit baae8d8 into draft Apr 24, 2025
6 checks passed

mzuenni deleted the draft-visualizer branch April 24, 2025 19:34

mzuenni mentioned this pull request Apr 24, 2025

Draft #433

Draft

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft visualizer #448

Draft visualizer #448

mzuenni commented Apr 5, 2025 •

edited

Loading

mzuenni commented Apr 6, 2025 •

edited

Loading

thorehusfeldt commented Apr 9, 2025

mpsijm commented Apr 9, 2025

mzuenni commented Apr 9, 2025

thorehusfeldt commented Apr 9, 2025 •

edited

Loading

mpsijm commented Apr 9, 2025 •

edited

Loading

mzuenni commented Apr 9, 2025 •

edited

Loading

mzuenni commented Apr 12, 2025

mpsijm left a comment

mpsijm Apr 14, 2025

mzuenni Apr 15, 2025 •

edited

Loading

mpsijm Apr 15, 2025 •

edited

Loading

mzuenni Apr 15, 2025

mzuenni Apr 15, 2025

mzuenni Apr 16, 2025

mzuenni Apr 16, 2025 •

edited

Loading

mpsijm Apr 17, 2025

mzuenni Apr 17, 2025 •

edited

Loading

mpsijm Apr 17, 2025

mpsijm commented Apr 20, 2025

Draft visualizer #448

Draft visualizer #448

Conversation

mzuenni commented Apr 5, 2025 • edited Loading

mzuenni commented Apr 6, 2025 • edited Loading

thorehusfeldt commented Apr 9, 2025

mpsijm commented Apr 9, 2025

mzuenni commented Apr 9, 2025

thorehusfeldt commented Apr 9, 2025 • edited Loading

mpsijm commented Apr 9, 2025 • edited Loading

mzuenni commented Apr 9, 2025 • edited Loading

mzuenni commented Apr 12, 2025

mpsijm left a comment

Choose a reason for hiding this comment

mpsijm Apr 14, 2025

Choose a reason for hiding this comment

mzuenni Apr 15, 2025 • edited Loading

Choose a reason for hiding this comment

mpsijm Apr 15, 2025 • edited Loading

Choose a reason for hiding this comment

mzuenni Apr 15, 2025

Choose a reason for hiding this comment

mzuenni Apr 15, 2025

Choose a reason for hiding this comment

mzuenni Apr 16, 2025

Choose a reason for hiding this comment

mzuenni Apr 16, 2025 • edited Loading

Choose a reason for hiding this comment

mpsijm Apr 17, 2025

Choose a reason for hiding this comment

mzuenni Apr 17, 2025 • edited Loading

Choose a reason for hiding this comment

mpsijm Apr 17, 2025

Choose a reason for hiding this comment

mpsijm commented Apr 20, 2025

mzuenni commented Apr 5, 2025 •

edited

Loading

mzuenni commented Apr 6, 2025 •

edited

Loading

thorehusfeldt commented Apr 9, 2025 •

edited

Loading

mpsijm commented Apr 9, 2025 •

edited

Loading

mzuenni commented Apr 9, 2025 •

edited

Loading

mzuenni Apr 15, 2025 •

edited

Loading

mpsijm Apr 15, 2025 •

edited

Loading

mzuenni Apr 16, 2025 •

edited

Loading

mzuenni Apr 17, 2025 •

edited

Loading