
I may be slow to respond.
Stars
DL-Server
2 repositories
ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implementation, ONNX Serving has very low latency overhead and high t…
A flexible, high-performance serving system for machine learning models