Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use official online container #21

Merged
merged 1 commit into from
Jun 3, 2024

Conversation

jeffnvidia
Copy link
Contributor

@jeffnvidia jeffnvidia commented May 19, 2024

Summary

Based on PR 31
This MR is for using the official latest NeMo version

Test Plan

Test by @amaslenn
CI
Test by @jeffnvidia
2.1 Slurm command generation
$ cloudaix --mode run --system_config_path conf/v0.6/general/system/israel_1.toml --test_scenario_path conf/v0.6/general/test_scenario/llama/llama.toml

Additional Notes

@jeffnvidia jeffnvidia force-pushed the use_online_containe_for_NeMo branch from 1c0f103 to cb937ef Compare May 19, 2024 14:11
@srinivas212
Copy link
Contributor

Needs PR template populated.

@srinivas212
Copy link
Contributor

Pl rebase directly on main.

@jeffnvidia
Copy link
Contributor Author

Pl rebase directly on main.

It was based on another PR of mine so that everything can be merged together

@jeffnvidia jeffnvidia force-pushed the use_online_containe_for_NeMo branch 3 times, most recently from 762005b to 5a4aefb Compare May 22, 2024 15:38
@amaslenn amaslenn requested a review from srinivas212 May 23, 2024 06:11
@jeffnvidia jeffnvidia force-pushed the use_online_containe_for_NeMo branch from 5a4aefb to edbdf6a Compare May 23, 2024 14:31
tests/test_slurm_system.py Outdated Show resolved Hide resolved
@jeffnvidia jeffnvidia force-pushed the use_online_containe_for_NeMo branch from edbdf6a to 1525b55 Compare May 29, 2024 14:28
@jeffnvidia jeffnvidia force-pushed the use_online_containe_for_NeMo branch from 1525b55 to b610739 Compare May 30, 2024 09:10
@amaslenn amaslenn requested review from amaslenn and removed request for amaslenn May 30, 2024 09:31
@lappazos
Copy link
Contributor

lappazos commented Jun 3, 2024

@srinivas212 @TaekyungHeo please merge? was tested completlely

@TaekyungHeo TaekyungHeo merged commit 3b91e7f into NVIDIA:main Jun 3, 2024
2 checks passed
@jeffnvidia jeffnvidia deleted the use_online_containe_for_NeMo branch July 30, 2024 14:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants