The Curse of Recursion: Training on Generated Data Makes Models Forget

This repository contains code for a publication "The Curse of Recursion: Training on Generated Data Makes Models Forget".

The paper can be found on here.

In case of questions please do not hesitate reaching out! To cite please use:

@misc{shumailov2023curse,
      title={The Curse of Recursion: Training on Generated Data Makes Models Forget}, 
      author={Ilia Shumailov and Zakhar Shumaylov and Yiren Zhao and Yarin Gal and Nicolas Papernot and Ross Anderson},
      year={2023},
      eprint={2305.17493},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

The codebase is not really a codebase, I cut out most things related to our specific hardware and slurm setup. Should be an easy backbone to replicate the experiments.

Our runner script is in the runme_base.py, dataset.py does data loading and main.py has all of the lightning specifics.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
Normals_GMM_experiments.ipynb		Normals_GMM_experiments.ipynb
README.md		README.md
VAE_experiments.ipynb		VAE_experiments.ipynb
dataset.py		dataset.py
main.py		main.py
plt_model.py		plt_model.py
runme_base.py		runme_base.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Curse of Recursion: Training on Generated Data Makes Models Forget

About

Releases

Packages

Languages

iliaishacked/curse_recurse

Folders and files

Latest commit

History

Repository files navigation

The Curse of Recursion: Training on Generated Data Makes Models Forget

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages