Question on Reproducibility and Hyperparameter Settings #3

zw-SIMM · 2024-11-02T08:53:55Z

I am working on reproducing the results from the paper using the codebase and default settings in the repository, but I am having difficulty achieving the reported performance levels. Specifically, I used a 1:1000 ratio of negative samples for both molecules and sequences in training. Despite trying different scripts (train.py, train_contra.py, train_rnn.py, and train_tfmr.py), the best Top-1-N accuracy I obtained is only around 0.10-0.20, which is significantly lower than the reported results (~0.3).

For each model, I used the following parameters provided in your repository:

--mol_embedding_type unimol --pro_embedding_type esm --batch_size 1000

Questions:
Did you try specific hyperparameters for each model (e.g., train.py, train_contra.py, train_rnn.py, train_tfmr.py)?
Could you please provide the detailed hyperparameter settings (e.g., learning rate, batch size, number of epochs, optimizer parameters) that were used to achieve the reported results?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on Reproducibility and Hyperparameter Settings #3

Question on Reproducibility and Hyperparameter Settings #3

zw-SIMM commented Nov 2, 2024 •

edited

Loading

Question on Reproducibility and Hyperparameter Settings #3

Question on Reproducibility and Hyperparameter Settings #3

Comments

zw-SIMM commented Nov 2, 2024 • edited Loading

zw-SIMM commented Nov 2, 2024 •

edited

Loading