Is it better to use stratified sampling to divide the validation set? #4

nagisa-eevee · 2021-03-23T05:07:50Z

I noticed that the labels of the validation set are slightly unbalanced, something like this: Counter({3: 113, 1: 112, 5: 107, 0: 100, 6: 99, 7: 98, 9: 98, 2: 94, 8: 90, 4: 89}) with seed 0 under my environment settings. I haven't tested it yet, but maybe a stratified sampling is better？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it better to use stratified sampling to divide the validation set? #4

Is it better to use stratified sampling to divide the validation set? #4

nagisa-eevee commented Mar 23, 2021

Is it better to use stratified sampling to divide the validation set? #4

Is it better to use stratified sampling to divide the validation set? #4

Comments

nagisa-eevee commented Mar 23, 2021