Skip to content
View 1duo's full-sized avatar
:octocat:
:octocat:
  • San Fransisco Bay Area

Organizations

@ucdavis @gunrock @conda-forge

Block or report 1duo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 507 64 Updated Mar 24, 2025

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 328 31 Updated Mar 25, 2025
C++ 22 Updated Mar 16, 2025

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 831 335 Updated Mar 24, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 16,991 1,886 Updated Feb 23, 2025

ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.

Python 327 59 Updated Mar 25, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,014 614 Updated Aug 18, 2024
Python 1,331 102 Updated Feb 15, 2025

ModernBERT model optimized for Apple Neural Engine.

Python 23 1 Updated Jan 10, 2025

General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.

Zig 38,210 2,758 Updated Mar 25, 2025

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 274 27 Updated Jan 15, 2025

A Simple Virtual Computer

Rust 332 14 Updated Mar 11, 2025

CUDA/Metal accelerated language model inference

C 532 23 Updated Mar 9, 2025

JSON for Classic C++

C++ 706 27 Updated Dec 7, 2024

Book in Progress

C 334 17 Updated Dec 27, 2024

A playbook for systematically maximizing the performance of deep learning models.

28,344 2,332 Updated Jun 18, 2024

Modern C++ Programming Course (C++03/11/14/17/20/23/26)

HTML 13,015 894 Updated Feb 28, 2025

Making the community's best AI chat models available to everyone.

Swift 1,939 82 Updated Feb 3, 2025

Making C Look ✨Pretty✨and Lua/Lisp/Python-esque

C 620 11 Updated Nov 21, 2024

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 247 7 Updated Jan 22, 2025

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,645 138 Updated Feb 26, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,781 227 Updated Mar 25, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,484 244 Updated Feb 20, 2025

Official inference framework for 1-bit LLMs

C++ 12,847 907 Updated Feb 18, 2025

The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.

Python 44 Updated Oct 18, 2024

Swift implementation of Flux.1 using mlx-swift

Swift 78 9 Updated Dec 12, 2024

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 473 31 Updated Oct 16, 2024

Modern, extensible Python project management

Python 6,443 325 Updated Mar 4, 2025

llama3.np is a pure NumPy implementation for Llama 3 model.

Python 977 80 Updated Jun 2, 2024

llama3.cuda is a pure C/CUDA implementation for Llama 3 model.

Cuda 330 24 Updated Jun 4, 2024
Next
Showing results