Skip to content
Change the repository type filter

All

    Repositories list

    • lmms-engine

      Public
      A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
      Python
      35757120Updated Apr 9, 2026Apr 9, 2026
    • lmms-eval

      Public
      One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
      Python
      Other
      5584k229Updated Apr 9, 2026Apr 9, 2026
    • OneVision-Encoder

      Public
      Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence
      Python
      Apache License 2.0
      1432172Updated Apr 9, 2026Apr 9, 2026
    • multimodal-search-r1

      Public
      [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.
      Python
      Apache License 2.0
      2242130Updated Apr 7, 2026Apr 7, 2026
    • EASI

      Public
      Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
      Python
      Apache License 2.0
      79811Updated Apr 3, 2026Apr 3, 2026
    • SimpleStream

      Public
      A simple video streaming baseline that outperforms SOTAs.
      Python
      38000Updated Apr 3, 2026Apr 3, 2026
    • OpenMMReasoner

      Public
      [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
      Python
      Apache License 2.0
      715650Updated Mar 30, 2026Mar 30, 2026
    • LongVT

      Public
      [CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
      Python
      Apache License 2.0
      1321640Updated Mar 27, 2026Mar 27, 2026
    • NEO

      Public
      NEO Series: Native Vision-Language Models from First Principles
      Python
      Apache License 2.0
      2570011Updated Mar 23, 2026Mar 23, 2026
    • free_openclaw

      Public
      Notes for using openclaw
      PowerShell
      0100Updated Mar 12, 2026Mar 12, 2026
    • For AI Agents to post ideas on their owns.
      TypeScript
      MIT License
      0500Updated Mar 1, 2026Mar 1, 2026
    • .github

      Public
      1102Updated Mar 1, 2026Mar 1, 2026
    • Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing
      TypeScript
      MIT License
      1110960Updated Feb 23, 2026Feb 23, 2026
    • An open-source evaluation toolkit to evaluate MLLMs on Spatial Intelligence using the EASI protocol
      Python
      Apache License 2.0
      01800Updated Feb 13, 2026Feb 13, 2026
    • engram

      Public
      Privacy-first AI memory layer - Signal for AI Memory. E2EE, local-first, works with Claude, Cursor, and any MCP-compatible AI.
      TypeScript
      Other
      21900Updated Feb 13, 2026Feb 13, 2026
    • Homebrew tap for LMMs-Lab applications
      Ruby
      0100Updated Jan 29, 2026Jan 29, 2026
    • opencode

      Public
      The open source coding agent.
      TypeScript
      MIT License
      16k100Updated Jan 20, 2026Jan 20, 2026
    • Fully Open Framework for Democratized Multimodal Training
      Python
      Apache License 2.0
      61788397Updated Dec 27, 2025Dec 27, 2025
    • Fully Open Framework for Democratized Multimodal Reinforcement Learning.
      Python
      Apache License 2.0
      44710Updated Dec 19, 2025Dec 19, 2025
    • [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
      Python
      Other
      1119340Updated Sep 26, 2025Sep 26, 2025
    • VideoMMMU

      Public
      Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
      Python
      Other
      36931Updated Sep 5, 2025Sep 5, 2025
    • sglang

      Public
      SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
      Python
      Apache License 2.0
      5.3k300Updated Aug 26, 2025Aug 26, 2025
    • Enjoy the magic of Diffusion models!
      Python
      Apache License 2.0
      1.2k000Updated Aug 23, 2025Aug 23, 2025
    • Deploying High-Performance Lean 4 Server in One Click
      Python
      MIT License
      0901Updated Aug 14, 2025Aug 14, 2025
    • MGPO

      Public
      High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
      05440Updated Jul 23, 2025Jul 23, 2025
    • sae

      Public
      A framework that allows you to apply Sparse AutoEncoder on any models
      Python
      25230Updated Jul 11, 2025Jul 11, 2025
    • Open-source implementation of AlphaEvolve
      Python
      Apache License 2.0
      941200Updated Jun 20, 2025Jun 20, 2025
    • DeepEyes

      Public
      Python
      Apache License 2.0
      73300Updated Jun 16, 2025Jun 16, 2025
    • agent-rl

      Public
      A fork version of verl to support multi-turn tool use and many more agentic tasks.
      Python
      MIT License
      80100Updated Jun 14, 2025Jun 14, 2025
    • Aero-1

      Public
      Python
      Apache License 2.0
      67840Updated May 4, 2025May 4, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.