-
Notifications
You must be signed in to change notification settings - Fork 141
Issues: microsoft/onnxruntime-genai
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
C examples refer to 0.4.0 in rel-0.5.2
documentation and samples
Improvements or additions to documentation
#1157
opened Dec 17, 2024 by
natke
MAIN BRANCH CONTAINS API CHANGES
0.6.0
bug
Something isn't working
question
Further information is requested
release
#1142
opened Dec 11, 2024 by
aciddelgado
Error while converting Llama 3B fp16 gguf model
bug
Something isn't working
#1137
opened Dec 10, 2024 by
rakshit2020
.Net How to free GPU memory after each inference
bug
Something isn't working
enhancement
New feature or request
performance
#1131
opened Dec 9, 2024 by
strikene
0.5.2 DML 2x to 4x Slower than 0.4.0 (Big regression)
ep:DML
performance
#1114
opened Dec 3, 2024 by
elephantpanda
0.5.2 GPU crashes if initial input is 360 zeros.
crash
ep:DML
#1113
opened Dec 3, 2024 by
elephantpanda
Bug DMLFusedNode_0_0 on second token in 0.5.2 (DML) (Wrong tensor shape)
ep:DML
#1112
opened Dec 3, 2024 by
elephantpanda
.Net After updating to .5, Phi3.5Mini outputs some meaningless characters
model quality
#1109
opened Nov 30, 2024 by
strikene
onnxruntime-genai
generation speed very slow on int4
performance
#1098
opened Nov 23, 2024 by
tarekziade
awq example runs into error with llama 3.2 3b due to embedding layer
documentation and samples
Improvements or additions to documentation
ep:DML
#1089
opened Nov 22, 2024 by
tranlm
Python Phi3Vision Sample Error when divided into multiple Functions
documentation and samples
Improvements or additions to documentation
#1068
opened Nov 17, 2024 by
nmoeller
The onnxruntime-genai.dll(0.5.1.0) crashes when the windows C++ application is closed.
crash
platform:windows
#1067
opened Nov 16, 2024 by
luomaojiang2016
DirectML Execution Provider Error: Unable to load D3D12Core.dll
ep:DML
#1054
opened Nov 9, 2024 by
sjpritchard
Previous Next
ProTip!
Adding no:label will show everything without a label.