Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exp16: Evaluate NVP_6 #43

Closed
Micky774 opened this issue Mar 2, 2020 · 5 comments
Closed

Exp16: Evaluate NVP_6 #43

Micky774 opened this issue Mar 2, 2020 · 5 comments
Labels
appearance Appearance module experiment This is an experiment. High Priority

Comments

@Micky774
Copy link
Collaborator

Micky774 commented Mar 2, 2020

Using the same hyper-parameters as in experiment 15 (#40), evaluate NVP_6 in comparison to NVP_5 and NVP_4

@Micky774 Micky774 added experiment design For experiments that need to be designed. appearance Appearance module experiment This is an experiment. High Priority trial ready This experiment is ready to be run and removed experiment design For experiments that need to be designed. labels Mar 2, 2020
@Micky774
Copy link
Collaborator Author

Micky774 commented Mar 2, 2020

Scripted and ready to run.

@Micky774
Copy link
Collaborator Author

Micky774 commented Mar 3, 2020

Extending NVP_6 run to 120 epochs to achieve comparable training time to NVP_4

@Micky774 Micky774 removed the trial ready This experiment is ready to be run label Mar 4, 2020
@Micky774
Copy link
Collaborator Author

Micky774 commented Mar 4, 2020

Running

@Micky774
Copy link
Collaborator Author

Micky774 commented Mar 4, 2020

Results show NVP_6 being competitive with NVP_4 by number of weight updates, however NVP_6 performs more weight updates per unit time so it outperforms NVP_4 with respect to time.

However, note that in the extended trial both models run for similar times however NVP_6 does not beat out NVP_4 as was predicted -- it instead reaches a lower bound on loss that it fails to surpass. This may be alleviated by learning rate / batch size scheduling, and should be revisited later.

We should also consider extending NVP_4's run to match the extended NVP_6 run in terms of weight update count to determine whether NVP_4 suffers from the same or similar lower bound on loss convergence.

@Micky774 Micky774 closed this as completed Mar 4, 2020
@Micky774
Copy link
Collaborator Author

Micky774 commented Mar 5, 2020

Extending NVP_4 to run for an additional 30 epochs, reaching 120 epochs total to match NVP_6 to determine whether one has a lower floor of convergence for loss than the other with all else being equal.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
appearance Appearance module experiment This is an experiment. High Priority
Projects
None yet
Development

No branches or pull requests

1 participant