Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Train Model With Multiple Input Images #435

Open
Neltherion opened this issue Feb 18, 2023 · 3 comments
Open

Train Model With Multiple Input Images #435

Neltherion opened this issue Feb 18, 2023 · 3 comments

Comments

@Neltherion
Copy link

Is it possible to change the model to accept more than one image as the input?

If I'm not mistaken, CLIP takes an image and a text as the inputs, extracts the features of these two inputs and finally gives us the logits of the distance of the image to the text.

So, is it possible to give two (or more) input images and extract ONE feature from the input images (just like before)?

I want to somehow mix the two inputs. For example, inputting an image alongside it's semantic segmentation as the input to the model. If it's possible, what parts of the code should I change? Or is this already implemented and usable?

Thanks.

@rom1504
Copy link
Collaborator

rom1504 commented Feb 18, 2023 via email

@Neltherion
Copy link
Author

Thanks, I was really looking for approach #2😁 Is it possible and if it is, any hints on where to start?

@rom1504
Copy link
Collaborator

rom1504 commented Feb 18, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants