Skip to content

Save rankings feature#49

Open
warreveys wants to merge 2 commits intotechwolf-ai:mainfrom
warreveys:save-rankings-feature
Open

Save rankings feature#49
warreveys wants to merge 2 commits intotechwolf-ai:mainfrom
warreveys:save-rankings-feature

Conversation

@warreveys
Copy link
Copy Markdown
Collaborator

Description

Add a save_rankings: bool = False flag to workrb.evaluate() that persists per-target ranking score arrays for each ranking-task dataset under <output_folder>/rankings/<model_name>/__<dataset_id>.json. Each artifact also records model_name in its payload so files remain self-describing if moved.

To enable this without recomputing the prediction matrix, RankingTask.evaluate is split into compute_prediction_matrix + compute_metrics_from_prediction_matrix, the default behaviour is unchanged.

Checklist

  • Added new tests for new functionality
  • Tested locally with example tasks
  • Code follows project style guidelines
  • Documentation updated
  • No new warnings introduced

warreveys added 2 commits May 6, 2026 11:40
Add a `save_rankings: bool = False` flag to `workrb.evaluate()` that
persists per-target ranking score arrays for each ranking-task dataset
under `<output_folder>/rankings/<model_name>/<task>__<dataset_id>.json`.
Each artifact also records `model_name` in its payload so files remain
self-describing if moved.

To enable this without recomputing the prediction matrix, `RankingTask.evaluate`
is split into `compute_prediction_matrix` + `compute_metrics_from_prediction_matrix`;
default behavior is unchanged.
@warreveys warreveys marked this pull request as draft May 6, 2026 10:07
@warreveys warreveys marked this pull request as ready for review May 6, 2026 10:24
@warreveys warreveys requested a review from Mattdl May 7, 2026 06:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant