Activity
Rolling back previous change: optimum-quanto is not applicable to pub…
Rolling back previous change: optimum-quanto is not applicable to pub…
Replaced quanto with optimum-quanto so HF-Waitress does not error out…
Replaced quanto with optimum-quanto so HF-Waitress does not error out…
Updated transformers version from 4.44.0 to 4.45.2
Updated transformers version from 4.44.0 to 4.45.2
BUG FIX: default defined for use_flash_attention_2
BUG FIX: default defined for use_flash_attention_2
1) Changed encoding to UTF-8 2) Removed dependencies: nvidia-cuda-run…
1) Changed encoding to UTF-8 2) Removed dependencies: nvidia-cuda-run…
BUG FIX: Correctly displaying LLM name in InfoBox when swapping LLMs …
BUG FIX: Correctly displaying LLM name in InfoBox when swapping LLMs …
CUDA GPU cache emptied post response
CUDA GPU cache emptied post response
1) BUG FIX: LLM_CHANGE_RELOAD_TRIGGER_SET correctly reset for llama.c…
1) BUG FIX: LLM_CHANGE_RELOAD_TRIGGER_SET correctly reset for llama.c…
Force push
updated requirements to address deployment troubles
updated requirements to address deployment troubles
Major update: Refactored /completions_stream to only redirect TextStr…
Major update: Refactored /completions_stream to only redirect TextStr…
fetch_file_list_for_vector_db is now filepath agnostic
fetch_file_list_for_vector_db is now filepath agnostic
Major styling update: 1) New universal font-family: Inter 2) New glas…
Major styling update: 1) New universal font-family: Inter 2) New glas…
BUG FIX: HQQ quantization would error out if torch.dtype (dataType) w…
BUG FIX: HQQ quantization would error out if torch.dtype (dataType) w…
Removed unused imports in HF-Waitress server.
Removed unused imports in HF-Waitress server.
BUG FIX: Removed test-prints for document-chunking that could sometim…
BUG FIX: Removed test-prints for document-chunking that could sometim…
Refined health-check error reporting
Refined health-check error reporting
Enhanced HF-Waitress LLM Management: Add new model_ids, search-filter…
Enhanced HF-Waitress LLM Management: Add new model_ids, search-filter…
Removed troublesome and unnecessary dependency install>=1.3.5
Removed troublesome and unnecessary dependency install>=1.3.5