About Encoding Annotations #24

jetd1 · 2024-11-14T22:44:46Z

Thanks for releasing this great work! Could you provide more details on how your encode the annotations (such as depth) with the pretrained VAE? Does the model require any fine-tuning / additional training in order to map the annotation into the latent space.

jingheya · 2024-12-09T13:28:04Z

Sorry for my late reply. We do not fine-tune the pre-trained VAE and only adapt the annotations into 3-channel input, similar to Marigold and GeoWizard. Depth maps are repeated to 3 channels, surface normal maps (which originally are 3-channels) remain unchanged.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Encoding Annotations #24

About Encoding Annotations #24

jetd1 commented Nov 14, 2024

jingheya commented Dec 9, 2024

About Encoding Annotations #24

About Encoding Annotations #24

Comments

jetd1 commented Nov 14, 2024

jingheya commented Dec 9, 2024