Skip to content

Popular repositories Loading

  1. Step-Audio Step-Audio Public

    Python 4.6k 366

  2. Step-Video-T2V Step-Video-T2V Public

    Python 3.1k 328

  3. Step1X-Edit Step1X-Edit Public

    A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

    Python 1.7k 81

  4. Step-Audio2 Step-Audio2 Public

    Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

    Python 1.2k 89

  5. Step1X-3D Step1X-3D Public

    Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

    Python 802 52

  6. Step-Audio-EditX Step-Audio-EditX Public

    A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

    Python 637 41

Repositories

Showing 10 of 20 repositories

Most used topics

Loading…