I am trying to use word level features based on the paper "Linguistic Input Features Improve Neural Machine Translation", but couldn't find any correct documentation on how to use it.
I want to use different feature vector sizes for different features.
Can anyone help me out?