-
Notifications
You must be signed in to change notification settings - Fork 20
Description
Dear Author,
I really am appreciated and fascinated by your work, and feel thankful of releasing your code.
I know that CLIP4clip + meanP have all the best performance among CLIP4Clip + seqTranf, seqLSTM, and tightTransf,
But I found that in your script, always seqTransf are recommended in sh files.
Is that any special reason that why "sim_header == seqTransf" is default setting?
I had looked your Table 2 on MSVD, your model recorded X-CLIP(ViT-B/32) R@1 scores 47.1 .
Is it mean that when X-Clip with seqTransf is the best than any other mode -meanP, tightTransf- ?
I cannot find that what kind of sim_header retrieved that scores in that table.
If X-CLIP + seqtrasnf is recommended anyway,
any special reason why seqTrasnf outperforms than meanP, unlike Clip4Clip did?
Sincerely,