Establish Concrete Metrics for Pipeline Tasks #25

Micky774 · 2021-09-13T17:37:23Z

It may be worth codifying exactly what the goal of the pipeline is. Roughly speaking, it can be described as creating a potent representation for downstream tasks, however classification can be considered one manifestation of this task (e.g. a simple ML model using the representation as features). What other tasks can we use to act as a test of representation power? And what qualitative features are desirable? What baseline classification scores can be achieved with "null models" e.g. random forest or simple conv-net?

TL;DR

How can we directly calculate "representation power"?
What downstream tasks can demonstrate "representation power"?
What baseline performance can we generate on such downstream tasks?

rmattson1008 · 2021-09-17T15:22:02Z

We should check robustness to noise. I don’t think it would be too hard to compare this on “downstream” tasks like the ones in the last scipy paper.

Micky774 · 2021-09-17T17:32:13Z

Agreed, that would be a great addition

Micky774 added this to the Create Baseline Models milestone Sep 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Establish Concrete Metrics for Pipeline Tasks #25

Establish Concrete Metrics for Pipeline Tasks #25

Micky774 commented Sep 13, 2021

rmattson1008 commented Sep 17, 2021

Micky774 commented Sep 17, 2021

Establish Concrete Metrics for Pipeline Tasks #25

Establish Concrete Metrics for Pipeline Tasks #25

Comments

Micky774 commented Sep 13, 2021

rmattson1008 commented Sep 17, 2021

Micky774 commented Sep 17, 2021