Skip to content
Change the repository type filter

All

    Repositories list

    • open_clip

      Public
      An open source implementation of CLIP.
      Python
      1.2k12k2529Updated Aug 6, 2025Aug 6, 2025
    • dclm

      Public
      DataComp for Language Models
      HTML
      1211.3k162Updated Aug 3, 2025Aug 3, 2025
    • evalchemy

      Public
      Automatic evals for LLMs
      HTML
      595091412Updated Jun 27, 2025Jun 27, 2025
    • HTML
      2600Updated Jun 15, 2025Jun 15, 2025
    • open_lm

      Public
      A repository for research on medium sized language models.
      Python
      725103535Updated Jun 6, 2025Jun 6, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k000Updated May 18, 2025May 18, 2025
    • datacomp

      Public
      DataComp: In search of the next generation of multimodal datasets
      Python
      61731273Updated Apr 28, 2025Apr 28, 2025
    • rtfm

      Public
      Research on Tabular Foundation Models
      Python
      1254120Updated Dec 13, 2024Dec 13, 2024
    • MixEval

      Public
      The official evaluation suite and dynamic data release for MixEval.
      Python
      41000Updated Sep 20, 2024Sep 20, 2024
    • An open-source framework for training large multimodal models.
      Python
      3104k455Updated Aug 31, 2024Aug 31, 2024
    • tabliblib

      Public
      A Python library for processing and filtering TabLib
      Python
      31100Updated Aug 24, 2024Aug 24, 2024
    • MINT-1T

      Public
      MINT-1T: A one trillion token multimodal interleaved dataset.
      1982110Updated Jul 31, 2024Jul 31, 2024
    • Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
      Python
      4647780Updated Jul 15, 2024Jul 15, 2024
    • A benchmark for distribution shift in tabular data
      Python
      1455121Updated Jun 6, 2024Jun 6, 2024
    • scaling

      Public
      Language models scale reliably with over-training and on downstream tasks
      Jupyter Notebook
      59720Updated Apr 2, 2024Apr 2, 2024
    • Python
      72600Updated Mar 21, 2024Mar 21, 2024
    • Editing Models with Task Arithmetic
      Python
      4449090Updated Jan 11, 2024Jan 11, 2024
    • Python
      25000Updated Oct 29, 2023Oct 29, 2023
    • patching

      Public
      Patching open-vocabulary models by interpolating weights
      Python
      89100Updated Sep 28, 2023Sep 28, 2023
    • Python
      2200Updated Aug 22, 2023Aug 22, 2023
    • LLM training code for MosaicML foundation models
      Python
      573100Updated Aug 10, 2023Aug 10, 2023
    • CSS
      0300Updated Jun 2, 2023Jun 2, 2023
    • Simple large-scale training of stable diffusion with multi-node support.
      Python
      913320Updated May 8, 2023May 8, 2023
    • Efficiently process webdatasets
      Python
      0410Updated Apr 5, 2023Apr 5, 2023
    • Release of ImageNet-Captions
      55000Updated Jan 20, 2023Jan 20, 2023
    • 0000Updated Jan 17, 2023Jan 17, 2023
    • Jupyter Notebook
      4710Updated Nov 3, 2022Nov 3, 2022
    • Python
      12910Updated Oct 18, 2022Oct 18, 2022
    • wise-ft

      Public
      Robust fine-tuning of zero-shot models
      Python
      7372590Updated Apr 29, 2022Apr 29, 2022
    • au21

      Public
      Jupyter Notebook
      0100Updated Nov 8, 2021Nov 8, 2021