ARC Lab, Tencent PCG

All

79 repositories

VerseCrafter
Public
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
world-model
Python
•
Other
•0•31•0•0•Updated Jan 9, 2026Jan 9, 2026
DSR_Suite
Public
Jupyter Notebook
•
Apache License 2.0
•5•50•1•0•Updated Jan 7, 2026Jan 7, 2026
TimeLens
Public
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Python
•
Other
•3•82•4•0•Updated Dec 19, 2025Dec 19, 2025
ColorFlow
Public
The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow：基于检索增强的图像序列上色
computer-vision image-colorization colorization automatic-colorization
Python
•
Other
•40•452•13•0•Updated Dec 10, 2025Dec 10, 2025
SEED-Voken
Public
SEED-Voken: A Series of Powerful Visual Tokenizers
Python
•
Apache License 2.0
•37•986•3•1•Updated Nov 25, 2025Nov 25, 2025
GenCompositor
Public
Official implementation of the paper "GenCompositor: Generative Video Compositing with Diffusion Transformer"
video-editing diffusion-models diffusion-transformer
Python
•
Other
•6•132•3•0•Updated Nov 24, 2025Nov 24, 2025
ARC-Chapter
Public
Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Apache License 2.0
•1•33•2•0•Updated Nov 19, 2025Nov 19, 2025
BlobCtrl
Public
[SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing
image-editing aigc
Python
•
Other
•3•25•1•0•Updated Nov 14, 2025Nov 14, 2025
RollingForcing
Public
Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
real-time long-context long-video-generation video-diffusion-model efficient-tuning
Python
•
Other
•12•295•8•0•Updated Oct 31, 2025Oct 31, 2025
MindOmni
Public
Python
•
Other
•2•140•2•0•Updated Oct 15, 2025Oct 15, 2025
vllm
Public
vllm for ARC-Hunyuan-Video-7B
Python
•
Apache License 2.0
•0•2•0•5•Updated Oct 6, 2025Oct 6, 2025
GeometryCrafter
Public
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
depth-estimation video-to-4d iccv2025
Python
•
Other
•17•420•3•0•Updated Oct 2, 2025Oct 2, 2025
Moto
Public
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
Python
•
Other
•5•159•6•0•Updated Oct 1, 2025Oct 1, 2025
ARC-Hunyuan-Video-7B
Public
Structured Video Comprehension of Real-World Shorts
Python
•
Other
•7•226•15•0•Updated Sep 21, 2025Sep 21, 2025
AudioStory
Public
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
video-to-audio diffusion-models text-to-audio audio-generation multimodal-large-language-models video-dubbing
Jupyter Notebook
•18•294•3•1•Updated Sep 21, 2025Sep 21, 2025
IC-Custom
Public
[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning
flux application image image-editing image-inpainting image-customization aigc
Python
•
Other
•3•158•1•0•Updated Sep 15, 2025Sep 15, 2025
BrushEdit
Public
[under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
image-editing image-inpainting diffusion-models
Python
•
Other
•29•586•11•0•Updated Sep 3, 2025Sep 3, 2025
ToonComposer
Public
Streamlining Cartoon Production with Generative Post-Keyframing
Python
•
Other
•46•519•8•0•Updated Aug 20, 2025Aug 20, 2025
TokLIP
Public
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
Python
•
Other
•5•233•8•0•Updated Aug 18, 2025Aug 18, 2025
FreeSplatter
Public
[ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
JavaScript
•
Other
•14•215•10•2•Updated Aug 4, 2025Aug 4, 2025
TencentARC.github.io
Public
HTML
•0•0•0•0•Updated Aug 1, 2025Aug 1, 2025
Video-Holmes
Public
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
Python
•
Apache License 2.0
•2•85•2•0•Updated Jul 13, 2025Jul 13, 2025
SEED-Bench-R1
Public
Python
•
Apache License 2.0
•2•96•2•0•Updated Jun 23, 2025Jun 23, 2025
GRPO-CARE
Public
Python
•
Apache License 2.0
•2•80•5•0•Updated Jun 23, 2025Jun 23, 2025
AnimeGamer
Public
[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
Python
•
Other
•28•342•5•1•Updated Apr 9, 2025Apr 9, 2025
VideoPainter
Public
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
video video-editing video-inpainting video-dataset
Python
•
Other
•39•545•15•0•Updated Apr 8, 2025Apr 8, 2025
DiTCtrl
Public
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
Python
•
Other
•9•320•8•0•Updated Mar 30, 2025Mar 30, 2025
DI-PCG
Public
Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
Python
•
Other
•3•133•3•0•Updated Mar 23, 2025Mar 23, 2025
Divot
Public
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
Python
•
Other
•2•86•3•0•Updated Feb 27, 2025Feb 27, 2025
MotionCtrl
Public
Official Code for MotionCtrl [SIGGRAPH 2024]
Python
•
Apache License 2.0
•78•1.5k•29•0•Updated Feb 19, 2025Feb 19, 2025