-
Notifications
You must be signed in to change notification settings - Fork 757
Pull requests: kubeflow/trainer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(runtimes): Support MLX Distributed Runtime with OpenMPI
size/XL
#2565
opened Mar 24, 2025 by
andreyvelich
Loading…
update contributing guide for trainer v2
size/L
#2563
opened Mar 23, 2025 by
burhanuddin6
Loading…
1 task
test(runtime): add UT for jobset runtime valid function.
size/L
#2562
opened Mar 23, 2025 by
Harshal292004
Loading…
1 task
docs: update CONTRIBUTING.md for Kubeflow Trainer V2
size/XS
#2561
opened Mar 23, 2025 by
muzzlol
Loading…
1 task done
test(runtime): add UT for torch runtime valid function.
size/L
#2560
opened Mar 22, 2025 by
IRONICBo
Loading…
1 task done
Implement CustomValidation UT for MPI plugin
size/L
#2555
opened Mar 21, 2025 by
tenzen-y
Loading…
1 task
Fix Prometheus metrics counter
ok-to-test
size/M
#2553
opened Mar 21, 2025 by
izuku-sds
Loading…
1 task
[feature]:add validatioons for MPIRuntime with RunLauncherAsNode
size/S
#2551
opened Mar 20, 2025 by
Harshal292004
Loading…
1 task
Updated base image to Debian image and changed install commands compatible with Debian image
lgtm
size/S
#2528
opened Mar 16, 2025 by
Debabrata47
Loading…
KEP-2170: Add manifest overlays for standalone installation
size/M
#2527
opened Mar 16, 2025 by
Doris-xm
Loading…
1 task
KEP-2401: Add
TorchTuneConfig
to train()
API
size/L
#2522
opened Mar 14, 2025 by
Electronic-Waste
Loading…
1 task
chore: Add unit tests for
pkg/apply
size/L
#2479
opened Mar 6, 2025 by
akagami-harsh
Loading…
1 task
Add Initialized and ComponentsCreated conditions to TrainJob API
do-not-merge/hold
size/M
#2464
opened Mar 1, 2025 by
dineshkolhe1
Loading…
Config API for Kubeflow Trainer controller manager
ok-to-test
size/L
#2428
opened Feb 9, 2025 by
chahatsagarmain
Loading…
1 task
Added an example Notebook to fine-tune Llama3 model using PyTorchJob
size/L
#2419
opened Feb 5, 2025 by
aishwaryaraimule21
Loading…
1 task
Use dictionary unpacking to pass trainer function arguments
size/XS
#2384
opened Jan 9, 2025 by
astefanutti
Loading…
1 task done
KEP-2170: Add the manifests overlay for Kubeflow Training V2
lgtm
ok-to-test
size/L
#2382
opened Jan 9, 2025 by
Doris-xm
Loading…
1 task
Fix read permission denied on train script when run as non-root
size/XS
#2373
opened Jan 7, 2025 by
astefanutti
Loading…
1 task done
Update workflow and docs for releasing Training Operator
lifecycle/stale
size/L
#2362
opened Dec 23, 2024 by
LogicalGuy77
Loading…
1 task done
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.