Different Embeddings for [E1] [/E1] [E2] [/E2] tokens

Hi, I just trained my model locally and checked the results of my trained models against the ones on the `README`. I found that they are different. I believe this is due to the embeddings of the previously mentioned tokens change every time the model is instantiated. For instance, trying with the same phrase, if I instantiated the model and predicted, the output would be different from the next time I instantiated and predicted the same phrase.

I believe in the `__init__` method of the `infer_from_trained` class, with the method `resize_token_embeddings()`at line 83 of the `infer.py` file, the embeddings are being extended to have the 4 extra tokens, but the embeddings are being initialized randomly and this causes the results to vary.

Am I understanding it correctly? Or am I mistaken? Any help would be appreciated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Different Embeddings for [E1] [/E1] [E2] [/E2] tokens #30

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Different Embeddings for [E1] [/E1] [E2] [/E2] tokens #30

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions