Skip to content

llamafile v0.8.17

Latest
Compare
Choose a tag to compare
@jart jart released this 01 Dec 01:00
· 14 commits to main since this release
c88f2d3

llamafiler has a new web UI which supports two modes of operation:
chatbot and raw completion. Its syntax highlighting is just as advanced
as the CLI chatbot. It looks much nicer than the old web ui. In a future
release, llamafiler will be folded into llamafile to replace the old server.

  • 988c9ec Introduce raw completions web ui
  • 241bf21 Introduce /v1/completions endpoint in new server
  • 6d89f8f Add binary safety check to server
  • d18ddf1 Add redo button to new web ui
  • bc82424 Add settings modal to web ui
  • bb917bd Add vision model support to new server
  • 4c7b7d5 Implement data URI parser
  • fb4b3e6 Fix JSON parser bug
  • 9d6f89f Improve look and printability of new web ui
  • 25b6910 Make chatbot ui more printer friendly
  • 30518ca Respond to HTTP OPTIONS requests
  • 41abfa3 Work around multiple image handling
  • 35bc088 Make default system prompt configurable on web
  • 28c8e22 Scale and decimate images as needed in browser
  • 14713b5 Get basic chatbot web gui working in llamafiler
  • ef08074 Start porting syntax highlighter to JavaScript
  • fdfdb13 Port remaining highlighting code to javascript

The following improvements have been made to our terminal chatbot.

  • 12c3761 Make CLI chatbot work better with base models
  • e5c0921 Improve VT100 support
  • 4b61791 Fix VT102 support
  • d25c077 Introduce /upload and /forget commands to chatbot
  • 880ebc7 Handle empty system prompt better in cli chatbot

General improvements to this project.

  • f581c40 Fix futex prototype
  • 54d3c72 Make LLaVA fast again
  • 01b8d49 Remove n-gpu-layer limitation (#534)
  • 566cdc1 Improve Gemma system prompt generation
  • 46284fe Reduce attack surface of stb_image
  • 9bb262b Log CUDA kernel vs. runtime versions

Syntax highlighting improvements for chatbot and web ui.

  • d979a1c Add BNF syntax highlighting
  • 4a8311a Add cmake syntax highlighting
  • 40e92cf Add Ocaml syntax highlighting
  • 0995343 Add more Clojure keywords
  • 0068a37 Make D syntax highlighting better
  • 0965a4b Make some markdown improvements
  • 9b96502 Improve JS/HTML syntax highlighting
  • c0622da Put more work into markdown rendering
  • fa1c98f Improve markdown to html rendering
  • 8915432 Further improve markdown to html
  • d25fa3a Improve highlighting in new web ui
  • f5a0bd4 Fix JS regex highlighting issue
  • 2807ae6 Improve Ada syntax highlighting
  • d30da30 Syntax highlight D properly
  • 33a057e Improve Ruby some more
  • 5b0fff1 Improve Ruby syntax highlighting
  • 8413a21 Fix Ruby builtins in web gui

The latest cosmopolitan upgrade introduces a new more powerful syntax
for your .args files. They're now parsed more similarly to the shell,
with support for C style escaping in double-quoted strings. You can also
now add shell-style comments to .args files too. See tool/args/args2.c
in the cosmopolitan codebase for the definitive reference.

  • fb59488 Upgrade to Cosmo v3.9.7
  • 21af0bf Import upstream bestline changes

The following example of the new .args file syntax is provided:

# specify model
-m Qwen2.5-Coder-34B-Instruct.Q6_K.gguf

# prevent flags below from being changed
...

# specify system prompt
--system-prompt "\
you are a friendly ai assistant\n
your job is to be helpful and intelligent"

# hide some stuff from user interfaces
--nologo
--no-display-prompt

You can put .args files inside llamafile, llamafiler, and whisperfile
using the zipalign program.

The following screenshots are provided of the llamafiler web ui.

image

image

image

image