-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Strategy B3 #12
Comments
Hi @june6423 , Our B3 experiment on Swin Transformer was implemented on the original training code of Swin-Transformer, so there's no b3 config in this repo. Alternatively, if you want to implement B3 on this repo, the strategy is similar to deit_tiny, you can use the following config for KD (T=4):
|
Thanks a lot! Now I want to reproduce results of other KD methods including RKD and CRD (I am working on your Table5 in DIST_KD paper, CIFAR 100) But I failed to find training config and code for training from scratch and other KD methods. I am working on image_classification_sota with d9662f7 version. I am wondering if there is already published code to experiment with these settings, or if I should implement them myself. Thanks for your effort. |
Hi @june6423 , how did you manage to get the data from meta folder for ImageNet? |
I made it meta data file. Make train.txt and val.txt in Here's the example of train.txt. (File path and class number)
|
Hi @june6423, thanks for help. I did that but my validation accuracy is 0. Did you also face this thing? I am trying to distill the knowledge from resnet34 to resnet18. |
Greetings!
I read your paper with great interest and am trying to reproduce some of your experiments.
I want to reproduce your vanilla KD setting using strategy B1, B2, B3 based on your DIST_KD paper.
I found B1 and B2 strategy on your strategies folder, but I couldn't find B3 setting.
configs/strategies/deit/deit_tiny.yaml
appears to be B3, but I'm not sure, which leaves me with a question.Could you give me B3 setting with vanilla KD with temperature 4?
The text was updated successfully, but these errors were encountered: