-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(w02_t03) Nemenyi Test Unclear #15
Comments
From Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." The Journal of Machine Learning Research 7 (2006): 1-30: Critical values q_\alpha are based on the Studentized range statistic divided by √2. I think this should be made more clear in Post-Hoc Test II page, as critical difference is actually what we are comparing against. So, we find a mean rank for each algorithm, and then connect them in the graph if their mean is less than the critical difference. If two algorithms are not connected, their performance is different. In the slides, it is stated that lower rank can be considered better, but I think "lower" rank is an ambiguous term as it is more intuitive to think rank 1 is better than rank 2. Maybe it should say rank closer to 1 is the better algorithm. |
I think the difference (or similarity?) between Nemenyi and Bonferroni-Dunn test should be explained in more detail. Again from Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." The Journal of Machine Learning Research 7 (2006): 1-30: The tests differ in the way they adjust the value of α to compensate for multiple comparisons. The Bonferroni-Dunn test (Dunn, 1961) controls the family-wise error rate by dividing α by the number of comparisons made (k−1, in our case). The alternative way to compute the same test is to calculate the CD using the same equation as for the Nemenyi test, but using the critical values for α/(k−1) (for convenience, they are given in Table 5(b)). |
Thanks for providing this valuable feedback. @larskotthoff @berndbischl is this already addressed in the new slides of w02_t03? Or can/should we point to further material here? |
This is not addressed -- can we do this for the next iteration? It doesn't sound like it's super urgent. |
Sure. |
Following things should be clear
q.alpha = qtukey(1 - 0.05, k, Inf) / sqrt(2L); cd.nemenyi = q.alpha * sqrt(k * (k + 1L) / (6L * n))
)The text was updated successfully, but these errors were encountered: