Could you please add large buffer support for A7X and A8X gpus by default? https://github.com/ggml-org/llama.cpp/pull/20997
Could you please add large buffer support for A7X and A8X gpus by default?
ggml-org/llama.cpp#20997