Updating to latest llama.cpp #2

AshtonOhms · 2023-12-22T00:26:36Z

Hi! I updated the bindings to be compatible with the latest head of llama.cpp master - enough to get the Main.hs example working again.

Summary:

A few functions no longer exist in llama.h: Namely, the *FromModel functions
The old sampleRepetitionPenalty and sampleFrequencyAndPresencePenalties functions are now replaced with the sole sampleRepetitionPenalties function
New function in wrapper.c for new_context_with_model
llama.h's token_to_str has been replaced with token_to_piece. A corresponding wrapper has been added to the Main.hs example

Tested by running the Mixtral MoE model, and it seems to be working properly.
stack run examples -- -m ../mixtral/mixtral-8x7b-instruct-v0.1.Q2_K.gguf -p "Hello, my name is " -t 12

I probably should do a version bump of some kind, but I'm not sure - let me know your thoughts. Cheers!

AshtonOhms added 9 commits December 20, 2023 21:51

[WIP] Update for latest llama.cpp

2ebdf63

It works! Stops + leaks mem after ~2doz tokens though

dd98673

Fixed mem leak in tokenToPiece

ca94008

Populate ModelParams

4021e98

Cleanup, update doc comments

9409bbd

Refactor tokenToPiece as tokenToString in Main

1883545

Re-add test suite to cabal

ce4f92f

Remove sampleFrequencyAndPresencePenalties (no longer exists)

ea9d851

Fixing params

cdb3e39

AshtonOhms changed the title ~~Updating to~~ Updating to latest llama.cpp Dec 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updating to latest llama.cpp #2

Updating to latest llama.cpp #2

Uh oh!

AshtonOhms commented Dec 22, 2023 •

edited

Loading

Uh oh!

Uh oh!

Updating to latest llama.cpp #2

Are you sure you want to change the base?

Updating to latest llama.cpp #2

Uh oh!

Conversation

AshtonOhms commented Dec 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

AshtonOhms commented Dec 22, 2023 •

edited

Loading