Hi,
I'm using Hugging Face libraries in order to run LiLT.
How can I decrease inference time? Which code to use?
I've already try BetterTransformer (Optimum) and ONNX but none of them accepts LiLT model.
- BetterTransformer:
NotImplementedError: The model type lilt is not yet supported to be used with BetterTransformer.
- ONNX:
KeyError: "lilt is not supported yet.
Thank you.
Note: I asked this question here, too: NielsRogge/Transformers-Tutorials#284
Hi,
I'm using Hugging Face libraries in order to run
LiLT.How can I decrease inference time? Which code to use?
I've already try
BetterTransformer(Optimum) andONNXbut none of them acceptsLiLTmodel.NotImplementedError: The model type lilt is not yet supported to be used with BetterTransformer.KeyError: "lilt is not supported yet.Thank you.
Note: I asked this question here, too: NielsRogge/Transformers-Tutorials#284