Test New LLMs (Llama2, CodeLlama, etc.) on Chat-UI? #1

krrishdholakia · 2023-09-29T21:52:38Z

Notice you forked chat-ui. if you're trying to test other LLMs (codellama, wizardcoder, etc.) with it, I just wrote a 1-click proxy to translate openai calls to huggingface, anthropic, togetherai, etc. api calls.

code

$ pip install litellm

$ litellm --model huggingface/bigcode/starcoder

#INFO:     Uvicorn running on http://0.0.0.0:8000

>> openai.api_base = "http://0.0.0.0:8000"

Here's the PR on adding openai to chat-ui: huggingface#452

I'd love to know if this solves a problem for you

…gface#208)

Co-authored-by: Eliott C <[email protected]>

* fix deepestChild function throwing error * let -> const * remove unecessary var

close huggingface#203 * added svg favicon and updated link * included png favicon as fallback sizes 16 and 32 * removed sizes to avoid duplicates

@Grsmto

* replace EthicsModal by LoginModal and start binding auth logic * start implementing login modal + give a try to @auth/sveltekit using Github oauth as a test * start custom auth implementation instead of auth.js for consistency with moon-landing * fetch user data from provider * add migration from anonymous to user + bind frontend * add missing auth secret + only migrate conversations for pre-existing users * remove email scope as it's not needed Co-authored-by: Eliott C. <[email protected]> * no need to define .well-known path Co-authored-by: Eliott C. <[email protected]> * use env var for hf hub website url * move sessionId to signature on csrf token exchange * remove ethic modal check and set default settings on new users * anonymous users can read only + modal on write tries * refresh session cookie when existing user signin again rather than use the old one * allow users to keep using the app without loging-in if env var is not present * typo * handle denied login * do not use a form action for login as there is nothing post-ed * use requiresUser instead of env var to define login modal or not * move back to a form action for login so user can't be linked to /login directly * show login modal even for pre-existing users * fix logic of account creation/updates * settings insertOne instead of updateOne * fix missing userId to settings creation * fix missing updatedAt when updating settings of pre-existing users * show login modal for everyone + add comments * fix login modal condition for both required/not required login * 🔨 * bring back missing form values in login modal * refactor default settings spread around to a constant * missing default settings * typo * rename a bunch of things to remove SSO references * always migrate conversations * remove unneeded sha256 Node specific function, replace with browser crypto API * fix typings * Update src/lib/components/LoginModal.svelte * use authCondition() in callback * add logout * 🐛 Fix signout cc @Grsmto, because "path" of the cookie should be "/" * fixup! 🐛 Fix signout * sign out button ui * add hf logo in the button --------- Co-authored-by: Eliott C. <[email protected]> Co-authored-by: Victor Mustar <[email protected]>

Co-authored-by: Eliott C. <[email protected]>

@Grsmto

* replace EthicsModal by LoginModal and start binding auth logic * start implementing login modal + give a try to @auth/sveltekit using Github oauth as a test * start custom auth implementation instead of auth.js for consistency with moon-landing * fetch user data from provider * add migration from anonymous to user + bind frontend * add missing auth secret + only migrate conversations for pre-existing users * remove email scope as it's not needed Co-authored-by: Eliott C. <[email protected]> * no need to define .well-known path Co-authored-by: Eliott C. <[email protected]> * use env var for hf hub website url * move sessionId to signature on csrf token exchange * remove ethic modal check and set default settings on new users * anonymous users can read only + modal on write tries * refresh session cookie when existing user signin again rather than use the old one * allow users to keep using the app without loging-in if env var is not present * typo * handle denied login * do not use a form action for login as there is nothing post-ed * use requiresUser instead of env var to define login modal or not * move back to a form action for login so user can't be linked to /login directly * show login modal even for pre-existing users * fix logic of account creation/updates * settings insertOne instead of updateOne * fix missing userId to settings creation * fix missing updatedAt when updating settings of pre-existing users * show login modal for everyone + add comments * fix login modal condition for both required/not required login * 🔨 * bring back missing form values in login modal * refactor default settings spread around to a constant * missing default settings * typo * rename a bunch of things to remove SSO references * always migrate conversations * remove unneeded sha256 Node specific function, replace with browser crypto API * fix typings * Update src/lib/components/LoginModal.svelte * use authCondition() in callback * add logout * 🐛 Fix signout cc @Grsmto, because "path" of the cookie should be "/" * fixup! 🐛 Fix signout * add basic test setup with Vitest + tests of login DB updates * add test for cookie * try to run tests on CI * try to fix docker compose not able to ignore .env file * oopsy * moved back .env before running tests * add comment about docker compose .env workaround * try to run mongo directly from github actions instead of docker-compose * oopsy * do we need to wait for mongo? * move tested code back to relevant place + rename * fix linting * revert unnecessary change --------- Co-authored-by: Eliott C. <[email protected]>

* add thumb up/down voting system for messages * make like/dislike buttons toggle + bind to server * refactor vote API to better endpoint structure * set score to undefined rather than 0 when toggled * throw if message is not found + refactor retry dispatch * fix undefined class * Only make the buttons invisible if there's no score Co-authored-by: Eliott C. <[email protected]> * only allow thumb up/down if user is the author of the messages * always show thumbs up/down when voted * use MongoDB instead of mutating messages array in code * fix typings * fix linting issue * refactor code to throw before ifs * add auth logic to vote API endpoint * lint fix after merge conflict * on mobile only show thumbs on top + increase spacing between messages * fix thumbs always showing on mobile --------- Co-authored-by: coyotte508 <[email protected]>

* 🐛 Fix settings modal not resetting its state should not mutate passed prop * make variable dynamic in case it's updated from outside Co-authored-by: Eliott C. <[email protected]> * Revert "make variable dynamic in case it's updated from outside" This reverts commit ba1938e. --------- Co-authored-by: Eliott C. <[email protected]>

* ✨ add conversations deletion to settings modal huggingface#233 * remove inert attribute from app only if last modal closed * make the button an underlined text instead * prepare for no-js * ui --------- Co-authored-by: Victor Mustar <[email protected]>

* fix retry not working after vote changes * once assistant answer is received, invalidate the messages so we get the update id * pass response message id from client to server * remove url dependency

* fix stop btn * use media query

Co-authored-by: coyotte508 <[email protected]>

…gface#259) * Add incremental build + multi layer docker for size reduction * Update .dockerignore * fix cache miss * pr comment .gitignore --------- Co-authored-by: Eliott C <[email protected]>

…ce#302)

* web search retries * remove test error lol

…uggingface#332)

…ce#319) * Add support for HF endpoint for summary * add fail-safe for summarization

* add optional timestamp field to messages * Add a `hashConv` function that only uses a subset of the message for hashing

* Add ability to define custom model/dataset URLs * lint --------- Co-authored-by: Nathan Sarrazin <[email protected]>

* Update README.md * Update README.md Co-authored-by: Julien Chaumond <[email protected]> * Align with header * lint * fixed markdown table of content --------- Co-authored-by: Julien Chaumond <[email protected]> Co-authored-by: Nathan Sarrazin <[email protected]>

* disable login on first message * update banner here too * modal wording tweaks * prevent NaN --------- Co-authored-by: Victor Mustar <[email protected]>

This reverts commit 6183fe7.

* disable login on first message * update banner here too * modal wording tweaks * prevent NaN * fix login wall * fix flicker * lint * put modal text behind login check * fix bug with sending messages without login * fix misalignment between ui and api * fix data update on disable login --------- Co-authored-by: Nathan Sarrazin <[email protected]>

This reverts commit 7767757.

* Update README.md * Update README.md Co-authored-by: Julien Chaumond <[email protected]> * Update README.md --------- Co-authored-by: Julien Chaumond <[email protected]>

…face#374)

The userMessageToken, assistantMessageToken, messageEndToken, and parameters.stop settings in `MODELS` do not have to be a token. They can be any string.

* rm open assistant branding * Update SettingsModal.svelte * make settings work with a dynamic list of models * fixed types --------- Co-authored-by: Nathan Sarrazin <[email protected]>

The chat generation removes parameters.stop and <|endoftext|> from the generated text. And additionally trims trailing whitespace. This PR copies that behavior to the summarize functionality, when the summary is produced by a the chat model.

* allow different user and assistant end-token For models like Llama2, the EndToken is not the same for a userMessage and an assistantMessage. This implements `userMessageEndToken` and `assistantMessageEndToken` which overwrites the messageEndToken behavior. This PR also allows empty strings as userMessageToken and assistantMessageToken and makes this the default. This adds additional flexibility, which is required in the case of Llama2 where the first userMessage is effectively different because of the system message. Note that because `userMessageEndToken` and `assistantMessageToken` are nearly always concatenated, it is almost redundant to have both. The exception is `generateQuery` for websearch which have several consecutive user messages. * Make model branding customizable based on env var (huggingface#345) * rm open assistant branding * Update SettingsModal.svelte * make settings work with a dynamic list of models * fixed types --------- Co-authored-by: Nathan Sarrazin <[email protected]> * trim and remove stop-suffixes from summary (huggingface#369) The chat generation removes parameters.stop and <|endoftext|> from the generated text. And additionally trims trailing whitespace. This PR copies that behavior to the summarize functionality, when the summary is produced by a the chat model. * add a login button when users are logged out (huggingface#381) * add fallback to message end token if there's no specified tokens for user & assistant --------- Co-authored-by: Florian Zimmermeister <[email protected]> Co-authored-by: Nathan Sarrazin <[email protected]>

* Use modelUrl instead of building it from model name * Preserve compatibility with optional modelUrl config Use modelUrl if informed, else use the previous behavior.

symsmith and others added 30 commits May 12, 2023 10:38

Improve style and accessibility in the settings modal (huggingface#206)

17d2394

🧑‍💻 Explicit error message when MongoDB configuration lacking (huggin…

8bb1351

…gface#208)

Break long word in user message (huggingface#209)

dbd3d72

Co-authored-by: Eliott C <[email protected]>

📌 Specifiy packageManager to prevent yarn accidents (huggingface#211)

0db6ce6

Fix long word in assistant message (huggingface#214)

ca616cc

fix deepestChild function throwing error (huggingface#213)

64263c4

* fix deepestChild function throwing error * let -> const * remove unecessary var

added svg favicon (huggingface#205)

55a1bbc

close huggingface#203 * added svg favicon and updated link * included png favicon as fallback sizes 16 and 32 * removed sizes to avoid duplicates

💡 parameters is no longer required for a model

2ac7bd0

Update documentation on local inference

31ef570

Announcement banners Configurable with .env vars (huggingface#222)

3dbdd6a

Co-authored-by: Eliott C. <[email protected]>

Update PRIVACY.md

a26aaaa

Bump to v0.2 (huggingface#229)

0c599d2

🐛 Fix "signin with HF" within space + CSRF (huggingface#236)

767afa7

🐛 Add missing classes for pre tag (huggingface#220)

26ccb67

misc ui

fd5e4ef

🐛 Vote not working with new messages (huggingface#249)

a974db9

* fix retry not working after vote changes * once assistant answer is received, invalidate the messages so we get the update id * pass response message id from client to server * remove url dependency

Fix stop generating button (huggingface#244)

293ff91

* fix stop btn * use media query

⚡️ Improve docker incremental build time (huggingface#142)

ff2db2e

🔧 Add "directConnection" option to MongoDB (huggingface#260)

1b9697f

🥅 Display OIDC error properly (huggingface#261)

101f9ef

feat openid login with google (huggingface#250)

fa3b3b4

Co-authored-by: coyotte508 <[email protected]>

Export to parquet: also export score (huggingface#265)

9658717

Add incremental build + multi layer docker for size reduction (huggin…

74532d2

…gface#259) * Add incremental build + multi layer docker for size reduction * Update .dockerignore * fix cache miss * pr comment .gitignore --------- Co-authored-by: Eliott C <[email protected]>

🐛 Fix export of convos (huggingface#267)

aa125df

🩹 Make preferred_username optional

002a2a0

nsarrazin and others added 30 commits June 19, 2023 09:24

Fix README linting & add details about auth

e34af36

add a readme section about theming

7457e8c

Added Serper.dev API as an alternative web search provider (huggingfa…

6f7b315

…ce#302)

add details about websearch to README

b46dc11

very basic rate limiter (huggingface#320)

922b1b2

Add support for websearch retries (huggingface#318)

0aa57de

* web search retries * remove test error lol

loader dots fix

fb55900

feat: factor out HF_API_ROOT to allow different inference endpoints (h…

3baa389

…uggingface#332)

Add support for HF summarization endpoint in the websearch (huggingfa…

10d1ab5

…ce#319) * Add support for HF endpoint for summary * add fail-safe for summarization

Add optional timestamps to messages (huggingface#294)

1eff97d

* add optional timestamp field to messages * Add a `hashConv` function that only uses a subset of the message for hashing

Add ability to define custom model/dataset URLs (huggingface#347)

ce2231f

* Add ability to define custom model/dataset URLs * lint --------- Co-authored-by: Nathan Sarrazin <[email protected]>

bump version to 0.4 (huggingface#353)

a38cbb5

Update README.md (huggingface#354)

479dbfa

Option to disable login on first N messages (huggingface#352)

6183fe7

* disable login on first message * update banner here too * modal wording tweaks * prevent NaN --------- Co-authored-by: Victor Mustar <[email protected]>

Revert "Option to disable login on first N messages (huggingface#352)"

0a662b7

This reverts commit 6183fe7.

support rate limiting based on user IP (huggingface#342)

7767757

Revert "support rate limiting based on user IP (huggingface#342)"

ac291a6

This reverts commit 7767757.

Update README.md (huggingface#359)

569bde3

Added access token note (huggingface#360)

a935f0a

* Update README.md * Update README.md Co-authored-by: Julien Chaumond <[email protected]> * Update README.md --------- Co-authored-by: Julien Chaumond <[email protected]>

Update /privacy and other content following Llama v2 release (hugging…

7dd8724

…face#374)

Clarify that model 'tokens' are not actual tokens (huggingface#367)

19db9db

The userMessageToken, assistantMessageToken, messageEndToken, and parameters.stop settings in `MODELS` do not have to be a token. They can be any string.

Attempt to clarify how hosted API ≠ local endpoint (huggingface#373)

932ee7e

Make model branding customizable based on env var (huggingface#345)

54e8a52

* rm open assistant branding * Update SettingsModal.svelte * make settings work with a dynamic list of models * fixed types --------- Co-authored-by: Nathan Sarrazin <[email protected]>

add a login button when users are logged out (huggingface#381)

8fa7bd9

Leverage model link to modelUrl when informed (huggingface#385)

0ad340e

* Use modelUrl instead of building it from model name * Preserve compatibility with optional modelUrl config Use modelUrl if informed, else use the previous behavior.

Update README.md

49caedf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test New LLMs (Llama2, CodeLlama, etc.) on Chat-UI? #1

Test New LLMs (Llama2, CodeLlama, etc.) on Chat-UI? #1

krrishdholakia commented Sep 29, 2023

Test New LLMs (Llama2, CodeLlama, etc.) on Chat-UI? #1

Are you sure you want to change the base?

Test New LLMs (Llama2, CodeLlama, etc.) on Chat-UI? #1

Conversation

krrishdholakia commented Sep 29, 2023