You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hello, thanks for your great work High Quality Segmentation.
I want to ask the following 2 questions
I don't see any difference in P position information between images. Even in an image, the information of the variable rel_cell is the same in all locations. So why can CRM generate detailed segmentation masks?
What is the meaning of calculating 3 features in P because it seems to me that it is hard-fixed and concatenated into each feature of the image?
below is the map that I printed out for the variables
The text was updated successfully, but these errors were encountered:
(1) For rel_cell, when the input resolution changes (the target resolution is constant), it is not same. That CRM generates detailed segmentation masks not only results from rel_cell. Other designs also contribute to the final results.
(2) The hard-fixed position information doesn't mean it's unsuitable for representation. Fixed and learnable position information is both used in the implicit representation.
Thanks.
hello, thanks for your great work High Quality Segmentation.
I want to ask the following 2 questions
below is the map that I printed out for the variables
The text was updated successfully, but these errors were encountered: