Popular repositories Loading
-
-
-
Step1X-Edit
Step1X-Edit PublicA SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
-
Step-Audio2
Step-Audio2 PublicStep-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
-
Step-Audio-EditX
Step-Audio-EditX PublicA powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
Repositories
- Step-Audio-R1 Public
stepfun-ai/Step-Audio-R1’s past year of commit activity - Step-Audio-Edit-Benchmark Public
stepfun-ai/Step-Audio-Edit-Benchmark’s past year of commit activity - Step-Audio-EditX Public
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
stepfun-ai/Step-Audio-EditX’s past year of commit activity - NextStep-1 Public
stepfun-ai/NextStep-1’s past year of commit activity - Step-Audio2 Public
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
stepfun-ai/Step-Audio2’s past year of commit activity - Step1X-Edit Public
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
stepfun-ai/Step1X-Edit’s past year of commit activity
Most used topics
Loading…