The structure of Lotus-D #25

Cuirundi · 2024-11-16T09:38:59Z

In the training of Lotus-D, is only the image used as input to the UNet, or is the image concatenated with the ground truth depth as input?

jingheya · 2024-12-09T13:30:23Z

Sorry for my late reply. Only the image latent is used as input to the UNet model (4 channels).

Cuirundi · 2024-12-30T04:19:00Z

Dear Jingle He: Thank you for you reply. There is a question that I still want to consult you. Should image reconstruction and depth estimation be carried out alternately for each batch during training,such as doing depth estimation for one batch and image reconstruction for the next batch.Or should the two tasks be carried out separately in two unet processes for one batch? Thank you again for your reply. Looking forward to your next reply. ------------------ 原始邮件 ------------------ 发件人: "EnVision-Research/Lotus" ***@***.***>; 发送时间: 2024年12月9日(星期一) 晚上9:30 ***@***.***>; ***@***.******@***.***>; 主题: Re: [EnVision-Research/Lotus] The structure of Lotus-D (Issue #25) Sorry for my late reply. Only the image latent is used as input to the UNet model (4 channels). — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The structure of Lotus-D #25

The structure of Lotus-D #25

Cuirundi commented Nov 16, 2024

jingheya commented Dec 9, 2024

Cuirundi commented Dec 30, 2024 via email

The structure of Lotus-D #25

The structure of Lotus-D #25

Comments

Cuirundi commented Nov 16, 2024

jingheya commented Dec 9, 2024

Cuirundi commented Dec 30, 2024 via email