Skip to content
/ ICGAN Public

An implicit conditioning method for interpretable feature control

Notifications You must be signed in to change notification settings

Reinliu/ICGAN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ICGAN: AN IMPLICIT CONDITIONING METHOD FOR INTERPRETABLE FEATURE CONTROL OF NEURAL AUDIO SYNTHESIS

Generative models are typically conditioned on discrete labels, especially one-hot vectors when conditioned on different classes. However, such labels have drawbacks in limited expressiveness, lack of continuity, and failure to capture the hierarchical semantic relationships between classes. We propose to condition neural audio synthesis models on continuous vectors sampled from Gaussian, which is parameterized by the learned mean and variance from an encoder classifier. Once the model is completely trained, users can interpolate the conditioning vector to morph the sounds among different categories even though the class labels are binary.

Model architecture

Architecture

Environment configuration:

pip install -r requirements.txt

Preprocess:

Configure appropriate parameters for the preprocessing and training using config.json. You should ensure that your dataset folder contains subcategories, because our model integrates class labels as conditioning vectors. Run 'python preprocess.py' and it will save the preprocessed features and data into a folder called 'preprocessed'

Train:

Run

python train.py

to train your model.

Generate:

Run

python generate.py

About

An implicit conditioning method for interpretable feature control

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages