Skip to content
View Orca-bit's full-sized avatar
  • Chengdu, China
  • 12:29 - 8h ahead

Block or report Orca-bit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

Go 135,201 11,219 Updated Mar 29, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 12,612 1,388 Updated Mar 29, 2025

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 853 165 Updated Dec 30, 2024

Intel® Extension for TensorFlow*

C++ 336 43 Updated Mar 18, 2025

A personal experimental C++ Syntax 2 -> Syntax 1 compiler

C++ 5,696 258 Updated Mar 3, 2025
Go 14 1 Updated Dec 20, 2021

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,073 678 Updated Mar 28, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,771 2,966 Updated Mar 29, 2025

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Python 2,571 505 Updated Mar 29, 2025

Development repository for the Triton language and compiler

MLIR 15,017 1,892 Updated Mar 29, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,526 308 Updated Oct 19, 2024

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,053 528 Updated Mar 29, 2025

An implementation of a deep learning recommendation model (DLRM)

Python 3,847 853 Updated Oct 11, 2024

mal - Make a Lisp

Assembly 10,225 2,600 Updated Dec 23, 2024

LLM training in simple, raw C/CUDA

Cuda 26,165 3,007 Updated Oct 2, 2024

Zerocopy makes zero-cost memory manipulation effortless. We write `unsafe` so you don’t have to.

Rust 1,837 110 Updated Mar 28, 2025

🤖 Just a command runner

Rust 24,546 527 Updated Mar 25, 2025

A Zig language server supporting Zig developers with features like autocomplete and goto definition

Zig 3,635 338 Updated Mar 24, 2025

The build system and package manager for MoonBit

Rust 263 30 Updated Mar 28, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 959 178 Updated Mar 27, 2025

RobustMQ is a next-generation, high-performance, cloud-native, converged message queue that is compatible with multiple mainstream message queuing protocols and has complete Serveless capabilities.

Rust 416 71 Updated Mar 28, 2025

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

Rust 1,538 45 Updated Mar 22, 2025

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on h…

Cuda 141 27 Updated Mar 25, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 13,030 1,873 Updated Mar 26, 2025

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

Rust 338 37 Updated Mar 19, 2025

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 982 200 Updated Mar 23, 2025

Bridging LLM and Recommender System.

Jupyter Notebook 739 68 Updated Mar 22, 2025

Tool for safe ergonomic Rust/C++ interop driven from existing C++ headers

Rust 2,399 156 Updated Mar 5, 2025

Write safer FFI code in Rust without polluting it with unsafe code

Rust 975 43 Updated Mar 26, 2025
Next
Showing results