KEP-2401: Kubeflow LLM Trainer V2 #2401

Electronic-Waste · 2025-01-23T05:40:22Z

Electronic-Waste · 2025-01-23T05:41:01Z

/kind feature

andreyvelich · 2025-03-12T12:25:16Z

@Electronic-Waste Thank you for creating dedicated issue for each task!
I've created new label for you, feel free to use it:

/area llm

Electronic-Waste · 2025-03-12T12:36:11Z

@andreyvelich Thank you for this!

I'm also wondering if we could pin this issue to let more people track our progress. Currently, this issue has been postponed to the second page of issues. It might be inconvenient for other folks to notice this issue:)

andreyvelich · 2025-03-12T12:39:50Z

Yes, let me pin it.

google-oss-prow bot added the area/runtimes label Jan 23, 2025

google-oss-prow bot added the kind/feature label Jan 23, 2025

Electronic-Waste changed the title ~~KEP: Kubeflow LLM Trainer V2~~ KEP-2401: Kubeflow LLM Trainer V2 Jan 23, 2025

andreyvelich mentioned this issue Jan 27, 2025

Cannot fine-tune LLM without GPU - CUDA error and DDP initialization #2371

Open

Electronic-Waste mentioned this issue Feb 1, 2025

KEP-2401: Kubeflow LLM Trainer V2 #2410

Merged

Electronic-Waste mentioned this issue Mar 12, 2025

Create model exporter for checkpointing and training output #2245

Open

google-oss-prow bot added the area/llm label Mar 12, 2025

andreyvelich pinned this issue Mar 12, 2025

andreyvelich mentioned this issue Mar 12, 2025

KEP-2170: Kubeflow Trainer V2 API #2170

Open

17 tasks

Garvit-77 mentioned this issue Mar 24, 2025

KEP-2401: Create LLM Training Runtimes for Llama 3.2 model family #2510

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-2401: Kubeflow LLM Trainer V2 #2401

KEP-2401: Kubeflow LLM Trainer V2 #2401

Electronic-Waste commented Jan 23, 2025 •

edited

Loading

Electronic-Waste commented Jan 23, 2025

andreyvelich commented Mar 12, 2025

Electronic-Waste commented Mar 12, 2025 •

edited

Loading

andreyvelich commented Mar 12, 2025

KEP-2401: Kubeflow LLM Trainer V2 #2401

KEP-2401: Kubeflow LLM Trainer V2 #2401

Comments

Electronic-Waste commented Jan 23, 2025 • edited Loading

Electronic-Waste commented Jan 23, 2025

andreyvelich commented Mar 12, 2025

Electronic-Waste commented Mar 12, 2025 • edited Loading

andreyvelich commented Mar 12, 2025

Electronic-Waste commented Jan 23, 2025 •

edited

Loading

Electronic-Waste commented Mar 12, 2025 •

edited

Loading