We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Tile primitives for speedy kernels
Cuda 2k 105
Convolutions for Sequence Modeling
Assembly 876 70
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
Assembly 631 88
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
Aioli: A unified optimization framework for language model data mixing
Understand and test language model architectures on synthetic tasks.
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
Creative interactive views of any dataset.
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
train with kittens!
Loading…