1duo

Yuduo Wu 1duo

ML Systems & Compilers, LLMs @ .

174 followers · 79 following

San Fransisco Bay Area

Achievements

x2 x2

Achievements

x2 x2

Organizations

Starred repositories

google-ai-edge / ai-edge-torch

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 507 64 Updated Mar 24, 2025

google-ai-edge / LiteRT

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 328 31 Updated Mar 25, 2025

mk1-project / quickreduce

C++ 22 Updated Mar 16, 2025

onnx / onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 831 335 Updated Mar 24, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 16,991 1,886 Updated Feb 23, 2025

microsoft / onnxscript

ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.

Python 327 59 Updated Mar 25, 2025

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,014 614 Updated Aug 18, 2024

Om-Alve / smolGPT

Python 1,331 102 Updated Feb 15, 2025

smpanaro / ModernBERT-AppleNeuralEngine

ModernBERT model optimized for Apple Neural Engine.

Python 23 1 Updated Jan 10, 2025

ziglang / zig

General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.

Zig 38,210 2,758 Updated Mar 25, 2025

andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 274 27 Updated Jan 15, 2025

JanNeuendorf / SVC16

A Simple Virtual Computer

Rust 332 14 Updated Mar 11, 2025

zeux / calm

CUDA/Metal accelerated language model inference

C 532 23 Updated Mar 9, 2025

jart / json.cpp

JSON for Classic C++

C++ 706 27 Updated Dec 7, 2024

mohitmishra786 / myJourneyOfBuildingOS

Book in Progress

C 334 17 Updated Dec 27, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

28,344 2,332 Updated Jun 18, 2024

federico-busato / Modern-CPP-Programming

Modern C++ Programming Course (C++03/11/14/17/20/23/26)

HTML 13,015 894 Updated Feb 28, 2025

huggingface / chat-macOS

Making the community's best AI chat models available to everyone.

Swift 1,939 82 Updated Feb 3, 2025

aartaka / pretty.c

Making C Look ✨Pretty✨and Lua/Lisp/Python-esque

C 620 11 Updated Nov 21, 2024

mit-han-lab / vila-u

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 247 7 Updated Jan 22, 2025

usefulsensors / moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,645 138 Updated Feb 26, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,781 227 Updated Mar 25, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,484 244 Updated Feb 20, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,847 907 Updated Feb 18, 2025

sail-sg / SimLayerKV

The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.

Python 44 Updated Oct 18, 2024

mzbac / flux.swift

Swift implementation of Flux.1 using mlx-swift

Swift 78 9 Updated Dec 12, 2024

mit-han-lab / hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 473 31 Updated Oct 16, 2024

pypa / hatch

Modern, extensible Python project management

Python 6,443 325 Updated Mar 4, 2025

likejazz / llama3.np

llama3.np is a pure NumPy implementation for Llama 3 model.

Python 977 80 Updated Jun 2, 2024

likejazz / llama3.cuda

llama3.cuda is a pure C/CUDA implementation for Llama 3 model.

Cuda 330 24 Updated Jun 4, 2024

Yuduo Wu 1duo

Organizations

Starred repositories

convolutional-networks

federated-learning