Training NER component using pre-trained tok2vec #13695
Unanswered
mikelgda
asked this question in
Help: Coding & Implementations
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I want to train a NER component from scratch and I suppose it would be faster to re-use the
tok2vec
component fromen_core_web_lg
. However, I'm having issues defining the configuration file for this.First I tried using the recommended configuration from the documentation but I think this also trains the
tok2vec
component, which includes the static vectors fromtok2vec
.I also tried modifying the configuration to source the
tok2vec
component fromen_core_web_lg
and then adding a listener for a NER component created from scratch like thiswhich is taking the recommended configuration and replacing the
tok2vec
by the pre-trained component. However, this shows a bug sincecomponents.tok2vec.model.encoder.width
is not accessible because I am sourcing thetok2vec
component.Last, I also tried using the recommended configuration and adding the
tok2vec
to thefrozen_components
andannotating_components
lists, but I also get a bug because thetok2vec
is not trained although it has the static vectors fromen_core_web_lg
.My main question is how to source the
tok2vec
while training a new NER from scratch and not training thetok2vec
.Thank you!
Beta Was this translation helpful? Give feedback.
All reactions