Supplementary for `Trustworthy model evaluation on a budget` [Paper]

Published at ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models

Cite

@inproceedings{fostiropoulos2023trustworthy,
  title={Trustworthy model evaluation on a budget},
  author={Fostiropoulos, Iordanis and Brown, Bowman Noah and Itti, Laurent},
  booktitle={ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models}
}

Requirments

Instructions based on Ubuntu 18.04 with python 3.10+

git clone https://github.com/fostiropoulos/trust_ml
cd trust_ml
pip install -e .

Raw Data

The raw data used for our experiments:

Dong, Xuanyi, et al. "Nats-bench: Benchmarking nas algorithms for architecture topology and size." IEEE transactions on pattern analysis and machine intelligence 44.7 (2021): 3634-3646.

NOTE We provide the preprocessed dataset in dataset.pickle

https://github.com/D-X-Y/NATS-Bench

https://drive.google.com/file/d/1vzyK0UVH2D3fTpa1_dSWnp1gvGpAxRul/view?usp=sharing

You can download from command-line:

pip install gdown
gdown https://drive.google.com/uc?id=1vzyK0UVH2D3fTpa1_dSWnp1gvGpAxRul

Reproduce

Exp1 Analysis

python -m trustml.exp1.main

Exp2 Analysis

python -m trustml.exp2.evaluate

Re-Run Experiments

NATS-Bench

To re-make the NATS-Bench dataset you can run python -m trustml.exp1.data

The preprocessed dataset is saved at data/dataset.pickle

Ablation of Sampler

To re-run the ablation of the sampler you can delete data/results.pickle

python -m trustml.exp1.main

Training ResNet on CatDog Dataset

python -m trustml.exp2.train

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
results		results
tests		tests
trustml		trustml
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
.mypy.ini		.mypy.ini
.pylintrc		.pylintrc
MANIFEST.in		MANIFEST.in
README.md		README.md
config.yaml		config.yaml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supplementary for `Trustworthy model evaluation on a budget` [Paper]

Cite

Requirments

Raw Data

Reproduce

Exp1 Analysis

Exp2 Analysis

Re-Run Experiments

NATS-Bench

Ablation of Sampler

Training ResNet on CatDog Dataset

About

Releases

Packages

Languages

fostiropoulos/trustml

Folders and files

Latest commit

History

Repository files navigation

Supplementary for Trustworthy model evaluation on a budget [Paper]

Cite

Requirments

Raw Data

Reproduce

Exp1 Analysis

Exp2 Analysis

Re-Run Experiments

NATS-Bench

Ablation of Sampler

Training ResNet on CatDog Dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Supplementary for `Trustworthy model evaluation on a budget` [Paper]

Packages