Skip to content
View isaranto's full-sized avatar
  • Athens, Greece

Highlights

  • Pro

Block or report isaranto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 312 45 Updated Feb 27, 2025

Make your functions return something meaningful, typed, and safe!

Python 3,790 124 Updated Feb 27, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 129,984 10,631 Updated Feb 27, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,432 2,741 Updated Feb 27, 2025

Rust tool that supports PVC snapshots across Kubernetes namespaces

Rust 7 1 Updated Oct 18, 2024

A next generation HTTP client for Python. πŸ¦‹

Python 13,745 876 Updated Feb 27, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 41,593 1,166 Updated Feb 28, 2025

KServe community docs for contributions and process

Python 12 5 Updated Jan 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,644 5,936 Updated Feb 27, 2025

Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)

Python 109 35 Updated Jun 10, 2024

The easiest repo for building GPT applications.

Python 5 Updated Sep 11, 2023

Self-hosted AI coding assistant

Rust 30,087 1,383 Updated Feb 27, 2025

[Not Actively Maintained] Whitebox is an open source E2E ML monitoring platform with edge capabilities that plays nicely with kubernetes

Python 183 5 Updated Jul 11, 2023

Source code for Twitter's Recommendation Algorithm

Scala 62,959 12,174 Updated Jul 10, 2024

πŸ¦œπŸ”— Build context-aware reasoning applications

Jupyter Notebook 101,764 16,505 Updated Feb 28, 2025

Transform your pythonic research to an artifact that engineers can deploy easily.

Go 152 13 Updated Apr 21, 2024

πŸ§™ Build, run, and manage data pipelines for integrating and transforming data.

Python 8,166 818 Updated Feb 25, 2025

Examples and guides for using the OpenAI API

MDX 62,031 9,997 Updated Feb 20, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,461 2,781 Updated Aug 15, 2024

Github mirror of "machinelearning/liftwing/inference-services" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)

Python 3 Updated Feb 27, 2025

concurrent, cache-efficient, and Dockerfile-agnostic builder toolkit

Go 8,506 1,211 Updated Feb 27, 2025

The Fast Cross-Platform Package Manager

C++ 7,162 383 Updated Feb 27, 2025

Standardized Serverless ML Inference Platform on Kubernetes

Python 3,925 1,118 Updated Feb 26, 2025

A curated list of awesome actions to use on GitHub

25,746 1,514 Updated Sep 1, 2024

Beautiful ridgeline plots in Python

Python 220 8 Updated Feb 24, 2025

The little ASGI framework that shines. 🌟

Python 10,645 971 Updated Feb 22, 2025

Bias Auditing & Fair ML Toolkit

Python 707 120 Updated Sep 11, 2024

Hydra is a framework for elegantly configuring complex applications

Python 9,080 663 Updated Feb 18, 2025

Redis Python client

Python 12,899 2,562 Updated Feb 27, 2025

πŸ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.

27,751 3,732 Updated Jul 18, 2024
Next
Showing results