training on own dataset but start getting black images after "sample-10" #101

twentyfiveYang · 2022-10-04T09:02:57Z

Thank you sharing the wonderful code!
I was using my own dataset for training but start getting black images after 9000 steps. Anyone knows how to fix this situation?
Here is my training script and results:

from denoising_diffusion_pytorch import Unet, GaussianDiffusion, Trainer

model = Unet(
    dim = 64,
    dim_mults = (1, 2, 4, 8)
).cuda()

diffusion = GaussianDiffusion(
    model,
    image_size = 64,
    timesteps = 1000,           # number of steps
    sampling_timesteps = 250,   # number of sampling timesteps (using ddim for faster inference [see citation for ddim paper])
    loss_type = 'l1'            # L1 or L2
).cuda()

trainer = Trainer(
    diffusion,
    'G:/code3/DDPM-main/data/train/',
    train_batch_size = 32,
    train_lr = 8e-5,
    train_num_steps = 700000,         # total training steps
    gradient_accumulate_every = 2,    # gradient accumulation steps
    ema_decay = 0.995,                # exponential moving average decay
    amp = True                        # turn on mixed precision
)

trainer.train()

The text was updated successfully, but these errors were encountered:

ruochiz · 2022-10-04T18:57:36Z

Do you see nan loss during the training? Based on what I've tested, if you turn off amp=True, it resolves nan loss under certain cases (with the cost of being slower.)

twentyfiveYang · 2022-10-05T05:30:20Z

Do you see nan loss during the training? Based on what I've tested, if you turn off amp=True, it resolves nan loss under certain cases (with the cost of being slower.)

thank you very much for your reply. It solved my problem

twentyfiveYang changed the title ~~training on own dataset but start to gett black images after "sample-10"~~ training on own dataset but start getting black images after "sample-10" Oct 4, 2022

npielawski mentioned this issue Oct 5, 2022

Added gradient clipping. #102

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training on own dataset but start getting black images after "sample-10" #101

training on own dataset but start getting black images after "sample-10" #101

twentyfiveYang commented Oct 4, 2022

ruochiz commented Oct 4, 2022

twentyfiveYang commented Oct 5, 2022

training on own dataset but start getting black images after "sample-10" #101

training on own dataset but start getting black images after "sample-10" #101

Comments

twentyfiveYang commented Oct 4, 2022

ruochiz commented Oct 4, 2022

twentyfiveYang commented Oct 5, 2022