Skip to content
View simonrouard's full-sized avatar

Block or report simonrouard

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

Python 26 3 Updated Dec 24, 2024

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 720 55 Updated Feb 9, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,480 597 Updated Feb 9, 2025

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 521 63 Updated Oct 26, 2024
Python 185 10 Updated Feb 14, 2024

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

Python 104 8 Updated Nov 20, 2021

The PyTorch-based audio source separation toolkit for researchers

Python 2,322 426 Updated Jan 11, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,628 1,121 Updated Apr 24, 2024

Open-Unmix - Music Source Separation for PyTorch

Python 1,322 197 Updated Jun 17, 2024

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,568 214 Updated Nov 29, 2022
Showing results