-
Notifications
You must be signed in to change notification settings - Fork 11
Issues: robertknight/rten
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Support prepacked weights for non-MatMul operators
performance
Issues that affect model inference or loading performance
#484
opened Dec 25, 2024 by
robertknight
5 tasks
Support Llama 3 tokenizer (implement
ignore_merges
behavior)
tokenizers
#453
opened Dec 8, 2024 by
robertknight
Align tokenizer pipeline and terminology with Hugging Face tokenizers
tokenizers
#427
opened Dec 2, 2024 by
robertknight
Support vector inputs in MatMulInteger
Spec compliance
Issues with RTen behavior not matching the ONNX specifications
#419
opened Nov 29, 2024 by
robertknight
Support fusing Transpose + MatMul where both inputs are transposed
performance
Issues that affect model inference or loading performance
#398
opened Oct 29, 2024 by
robertknight
Fuse pointwise operations into matmul / convolution operations
performance
Issues that affect model inference or loading performance
#371
opened Sep 21, 2024 by
robertknight
Implement better depthwise convolution kernels
performance
Issues that affect model inference or loading performance
#370
opened Sep 21, 2024 by
robertknight
Adjust default thread count on Apple Silicon systems
performance
Issues that affect model inference or loading performance
#342
opened Sep 2, 2024 by
robertknight
Align ReduceMin / ReduceMax etc. handling of empty tensors with spec
Spec compliance
Issues with RTen behavior not matching the ONNX specifications
#341
opened Sep 1, 2024 by
robertknight
Memoize or precompute subgraphs that depend only on input shapes
#270
opened Jul 5, 2024 by
robertknight
Share implementations for operators based on data type width
#244
opened Jun 21, 2024 by
robertknight
Make unary ops more efficient with non-contiguous inputs
performance
Issues that affect model inference or loading performance
#192
opened May 20, 2024 by
robertknight
1 of 2 tasks
Output a more helpful error if operator is unavailable due to build features
usability
#154
opened May 6, 2024 by
robertknight
Run tests under AddressSanitizer (and possibly other sanitizers)
qa
Quality / correctness checks
#151
opened May 5, 2024 by
robertknight
Validate operator input counts
tooling
Tools for debugging / profiling etc.
#133
opened Apr 29, 2024 by
robertknight
Enable re-using pool across graph executions
performance
Issues that affect model inference or loading performance
#122
opened Apr 26, 2024 by
robertknight
Run tests under WebAssembly in CI
qa
Quality / correctness checks
WebAssembly
#93
opened Apr 14, 2024 by
robertknight
Provide better APIs for working with models that have many inputs / outputs
usability
#71
opened Mar 30, 2024 by
robertknight
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.