claude-shorts

Interactive longform-to-shortform video creator powered by Claude Code. Extracts viral-ready vertical clips from long videos using Claude as the intelligent orchestrator with Remotion-rendered premium animated captions.

How It Works

Claude Code guides you through a 10-step interactive pipeline:

Preflight - Validates input video, checks disk space, detects GPU
Transcribe - GPU-accelerated transcription with word-level timestamps (faster-whisper)
Detect Content - Auto-classifies: talking-head, screen recording, or podcast
Analyze - Claude reads the full transcript and scores 8-12 candidate segments
Present - Shows candidates in a formatted table with scores, hooks, and rationale
Approve - You pick segments, adjust timecodes, choose caption style and platform
Snap Boundaries - Aligns cut points to word boundaries, sentence endings, and audio silences
Prepare - Extracts clips (FFmpeg stream copy) and computes reframe coordinates
Render - Remotion renders 1080x1920 vertical video with animated captions
Export - Platform-optimized encoding (YouTube Shorts, TikTok, Instagram Reels)

Demo

Demo video/GIF coming soon — showing the full pipeline from input to rendered short with Bold-style captions.

Features

Claude-powered segment scoring - 5-dimension rubric (hook strength, coherence, emotion, value density, payoff) with weighted scoring. No heuristic keyword matching - Claude understands narrative arcs.
3 caption styles - Bold (ALL CAPS, yellow highlights), Bounce (bouncy spring, rotating colors), Clean (minimal fade-in)
Cursor tracking - For screen recordings, detects mouse cursor via frame differencing and smoothly pans the crop to follow it
Audio-aware boundary snapping - Never cuts mid-word or mid-sentence. Extends to natural sentence endings and silence points.
Remotion rendering - React-based single-pass rendering with spring animations, word-level karaoke highlighting, hook text overlays, and progress bars
GPU acceleration - CUDA for transcription, NVENC for export encoding (falls back to CPU gracefully)

Prerequisites

FFmpeg (system package)
Python 3.10+
Node.js 18+
Claude Code (CLI)
NVIDIA GPU recommended (for CUDA transcription + NVENC encoding)

Installation

# Clone the repository
git clone https://github.com/AgriciDaniel/claude-shorts.git
cd claude-shorts

# Install Python + Node.js dependencies
bash setup.sh

# Install as a Claude Code skill
bash install.sh

Windows

claude-shorts requires Unix tools (FFmpeg, bash). On Windows, use WSL 2:

# Inside WSL
git clone https://github.com/AgriciDaniel/claude-shorts.git
cd claude-shorts
bash setup.sh
bash install.sh

What `setup.sh` does

Creates a Python virtual environment at ~/.shorts-skill/ (or reuses ~/.video-skill/ if it exists)
Installs faster-whisper, mediapipe, numpy, opencv-python, and PyTorch (CUDA or CPU variant)
Runs npm install in the remotion/ directory
Checks for system dependencies (FFmpeg, jq)

Usage

In Claude Code, invoke the skill:

/shorts

Then provide your video file when prompted. Claude will:

Transcribe the video
Present scored segment candidates
Ask which segments to render, caption style, and target platform
Render and export the final shorts

Example Interaction

You: /shorts ~/Videos/my-talk.mp4
Claude: [Transcribes, detects content type, scores segments]

| # | Time          | Dur  | Score | Hook                           |
|---|---------------|------|-------|--------------------------------|
| 1 | 04:22 - 05:01 | 39s  | 87    | "Nobody talks about this..."  |
| 2 | 12:45 - 13:28 | 43s  | 82    | "Here's the exact framework." |
| 3 | 08:11 - 08:52 | 41s  | 79    | "I tested this for 6 months." |

Claude: Which segments? Caption style? Platform?
You: 1 and 3, bounce style, youtube

Claude: [Snaps boundaries, extracts clips, renders, exports]
Output: shorts/short_01_yt.mp4, shorts/short_03_yt.mp4

Project Structure

claude-shorts/
├── SKILL.md                           # 10-step interactive pipeline (Claude Code skill)
├── CLAUDE.md                          # Project-level instructions
├── install.sh                         # Install to ~/.claude/skills/
├── setup.sh                           # Python + Node dependency installer
│
├── scripts/
│   ├── transcribe.py                  # faster-whisper GPU transcription
│   ├── detect_content.py              # MediaPipe content type classifier
│   ├── compute_reframe.py             # Face tracking + cursor tracking + crop
│   ├── snap_boundaries.py             # Audio-aware boundary snapping
│   ├── preflight.sh                   # Input validation + disk space check
│   ├── detect_gpu.sh                  # NVIDIA NVENC detection
│   └── export.sh                      # Platform-specific FFmpeg encoding
│
├── remotion/
│   ├── package.json                   # Remotion v4 + React 19 + Zod
│   ├── render.mjs                     # Bundle-once-render-many orchestrator
│   ├── remotion.config.ts
│   └── src/
│       ├── Root.tsx                   # Composition registry
│       ├── ShortVideo.tsx             # Main composition
│       ├── types.ts                   # Zod schemas for props
│       ├── components/
│       │   ├── VideoFrame.tsx         # Reframed video with animated crop pan
│       │   ├── Captions.tsx           # Style dispatcher
│       │   ├── BoldCaptions.tsx        # Bold ALL CAPS, pop-in spring
│       │   ├── BounceCaptions.tsx     # Bouncy scale, bright colors
│       │   ├── CleanCaptions.tsx      # Minimal fade-in
│       │   ├── HookOverlay.tsx        # First 3.5s hook text
│       │   └── ProgressBar.tsx        # Bottom progress indicator
│       ├── hooks/
│       │   └── useCaptionPages.ts     # @remotion/captions TikTok-style pages
│       └── styles/
│           ├── fonts.ts               # @font-face declarations
│           └── theme.ts               # Color palettes per style
│
└── references/
    ├── scoring-rubric.md              # 5-dimension scoring criteria
    ├── caption-styles.md              # Visual specs + spring configs
    ├── platform-specs.md              # YouTube/TikTok/Instagram encoding
    └── remotion-patterns.md           # Remotion best practices

Caption Styles

Style	Font	Animation	Best For
Bold	Montserrat Bold	Pop-in spring, yellow active word	Business, education
Bounce	Bangers	Bouncy scale 70-120-100%, rotating colors	Entertainment, energy
Clean	Inter Bold	Fade-in opacity, white + shadow	Professional, interviews

Platform Export Specs

Platform	Codec	Bitrate	Audio
YouTube Shorts	H.264 High 4.2	12 Mbps	AAC 192k
TikTok	H.264	CRF 18, -preset slow	AAC 128k
Instagram Reels	H.264 High 4.2	4.5 Mbps (max 5000k)	AAC 128k

Content Type Strategies

Content Type	Reframe Strategy	Zoom
Talking-head	Face-tracked center crop (MediaPipe)	9:16 exact
Screen recording	Cursor-tracked pan with moderate zoom	55% of source width
Podcast	Dominant speaker tracking	9:16 exact

Dependencies

Python (installed via `setup.sh`)

faster-whisper - GPU-accelerated Whisper
mediapipe - Face detection for content classification + reframing
numpy - Array operations for cursor tracking smoothing
opencv-python - Frame differencing for cursor detection

Node.js (installed via `setup.sh`)

Remotion v4 - React-based video rendering
@remotion/captions - TikTok-style word-level captions
React 19 - Component framework
Zod - Runtime type validation for props

Fonts

Caption fonts are bundled from Google Fonts under the SIL Open Font License:

Montserrat Bold (Bold style)
Bangers Regular (Bounce style)
Inter Bold (Clean style)

System

FFmpeg - Audio extraction, segment cutting, export encoding
jq - JSON processing in shell scripts

How Segment Scoring Works

Claude scores each candidate on 5 weighted dimensions:

Dimension	Weight	What Claude Looks For
Hook Strength	0.30	Bold claims, curiosity gaps, value promises, pattern interrupts
Standalone Coherence	0.25	Makes complete sense without any context from the rest of the video
Emotional Intensity	0.20	Strong opinions, surprise reveals, humor, passion
Value Density	0.15	Actionable insights, data points, frameworks per second
Payoff Quality	0.10	Satisfying conclusion - punchline, reveal, call-to-action

Final score = weighted sum, scale 0-100. Minimum threshold: 60.

Support

Issues: GitHub Issues
Discussions: GitHub Discussions

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.claude-plugin		.claude-plugin
.github		.github
references		references
remotion		remotion
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SKILL.md		SKILL.md
claude-shorts-header.jpeg		claude-shorts-header.jpeg
install.sh		install.sh
requirements.txt		requirements.txt
setup.sh		setup.sh
uninstall.ps1		uninstall.ps1
uninstall.sh		uninstall.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

claude-shorts

How It Works

Demo

Features

Prerequisites

Installation

Windows

What `setup.sh` does

Usage

Example Interaction

Project Structure

Caption Styles

Platform Export Specs

Content Type Strategies

Dependencies

Python (installed via `setup.sh`)

Node.js (installed via `setup.sh`)

Fonts

System

How Segment Scoring Works

Support

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

claude-shorts

How It Works

Demo

Features

Prerequisites

Installation

Windows

What setup.sh does

Usage

Example Interaction

Project Structure

Caption Styles

Platform Export Specs

Content Type Strategies

Dependencies

Python (installed via setup.sh)

Node.js (installed via setup.sh)

Fonts

System

How Segment Scoring Works

Support

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

What `setup.sh` does

Python (installed via `setup.sh`)

Node.js (installed via `setup.sh`)

Packages