bird_sound_generation

Multiple architectures for generating bird sounds using BirdCLEF 2023 dataset

Usage

Clone repository
Create and activate the environnement (conda needs to be installed), then make the kernel visible in jupyter (not needed if you always launch jupyter notebook from the birdgen env)
```
conda env create -f env.yml
conda activate birdgen
python -m ipykernel install --user --name=birdgen
```
Download the dataset from https://www.kaggle.com/competitions/birdclef-2023/data and put the train_audio folder in the working dir
Run Selection_cris_via_energie.ipynb if you wish to train models on a dataset more likely to contain bird sounds instead of simply the first 2 seconds of each file
Train models by running the VAE, GAN and VAE-GAN notebooks, you can use tensorboard to watch progress, replace [dir] with ./vae, ./gan or ./vaegan to get logs for a specific category of models, or simply ./ for everything (assuming you are in the bird_soud_generation folder)
```
tensorboard --logdir=[dir]
```
You can use the original dataset or the version where bird sounds are selected
Run inference for each model to generate sound files with the corresponding notebooks, adapt the checkpoints path for the models you trained
Compute the Fréchet Audio Distance between the sounds generated by each model, and the sounds in the validation set using the FAD.ipynb notebook
NDB.ipynb is used to compute JS divergence and Number os statistically Different Beans
human_eval.ipynb launches a website where users can vote for the best audio from a random pair to evaluate which model is better plot_human_eval.ipynb will compute the winrate of each model against the others

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
FAD.ipynb		FAD.ipynb
GAN.ipynb		GAN.ipynb
GAN_audio_brut.ipynb		GAN_audio_brut.ipynb
GAN_inference.ipynb		GAN_inference.ipynb
NDB.ipynb		NDB.ipynb
NDB.png		NDB.png
README.md		README.md
VAE-GAN.ipynb		VAE-GAN.ipynb
VAE-GAN_inference.ipynb		VAE-GAN_inference.ipynb
VAE.ipynb		VAE.ipynb
VAE_inference.ipynb		VAE_inference.ipynb
emissions.csv		emissions.csv
env.yml		env.yml
human_eval.ipynb		human_eval.ipynb
human_eval.png		human_eval.png
ndb.py		ndb.py
plot_human_eval.ipynb		plot_human_eval.ipynb
select_calls_using_energy.ipynb		select_calls_using_energy.ipynb
votes.csv		votes.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bird_sound_generation

Usage

About

Releases

Packages

Contributors 3

Languages

Arthur-Chiron/bird_sound_generation

Folders and files

Latest commit

History

Repository files navigation

bird_sound_generation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages