the training data cc3m has 300w images, did you use all of these images to train the model? According to the config of the sdxl and sd1.5, it seems to use only 20000 max_train_steps * 10 train_batch_size * 8 GPU = 160w images, i wan to know how actual training samples are used, thanks!