Starred repositories
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Tyk Open Source API Gateway written in Go, supporting REST, GraphQL, TCP and gRPC protocols
WhatsApp redesign made with Kivy/KivyMD
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
Library for translating between 200 languages. Built on 🤗 transformers.
Automagically synchronize subtitles with video.
Speed up a Laravel app by caching the entire response
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
A Golang framework for web artisans. Tribute to Laravel.
Half-Life Advanced Effects (HLAE) is a tool to enrich Source (mainly CS:GO) engine based movie making.
On-demand transcoding origin server for live inputs and static files in Go using ffmpeg. Also with NVIDIA GPU hardware acceleration.
The python library for real-time communication
Restoring old and blurry face photos with AI.
Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Simplified media playback for bigscreen devices
Responsive images while we wait for srcset to finish cooking
Apache Traffic Control is an Open Source implementation of a Content Delivery Network
imagor video thumbnail server in Go and ffmpeg C bindings
Fast, secure image processing server and Go library, using libvips
The core of Membrane Framework, multimedia processing framework written in Elixir
NPM library to generate HLS Live from HLS VOD