Lists (12)
Sort Name ascending (A-Z)
Stars
Official PyTorch implementation for "Large Language Diffusion Models"
Context parallel attention that accelerates DiT model inference with dynamic caching
A suite of image and video neural tokenizers
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
A customisable 3D platform for agent-based AI research
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Count the MACs / FLOPs of your PyTorch model.
Simple script for downloading Youtube comments without using the Youtube API
real time face swap and one-click video deepfake with only a single image
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
fenneishi / Deep-Live-Cam
Forked from hacksider/Deep-Live-Camreal time face swap and one-click video deepfake with only a single image (uncensored)
A feature-rich command-line audio/video downloader
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A python script that discovers hidden YouTube API clients. Just a research project.
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
The fastest way to create an HTML app
Downloads videos and playlists from YouTube
Intelligent proxy pool for Humans™ to extract content from the internet and build your own Large Language Models in this new AI era
Command-line program to download videos from YouTube.com and other video sites
Python Client for Google's Private InnerTube API. Works with YouTube, YouTube Music and more!
Invidious is an alternative front-end to YouTube