Skip to content

Pinned Loading

  1. multi-gpu-llms multi-gpu-llms Public

    Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes

    Jupyter Notebook 30 13

  2. gpu-partitioning-guide gpu-partitioning-guide Public

    Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others

    Jupyter Notebook 62 14

  3. litemaas litemaas Public

    LiteMaaS is a proof-of-concept application for managing LLM subscriptions, API keys, and usage tracking. It seamlessly integrates with LiteLLM to provide a unified interface for accessing multiple …

    TypeScript 56 29

  4. sardeenz sardeenz Public

    Sardeenz is a proof-of-concept application that allows you to load more than one model on a given GPU. It allows you to add more and more models onto a GPU, until it is fully utilized.

    TypeScript 50 7

  5. dynamic-model-autoscaling dynamic-model-autoscaling Public

    Dynamic Model Autoscaling

    Shell 3 1

  6. s4 s4 Public

    Super Simple Storage Service

    TypeScript 51 12

Repositories

Showing 10 of 145 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…