Skip to content

Conversation

@vsoch
Copy link
Member

@vsoch vsoch commented Apr 1, 2023

It is not clear yet how these machine learning runs can best integrate with flux, beyond submitting a job to Flux. We will need to think about this. One design, however, I think could work really nicely here is:

  1. Use Foundry for storing data, download a dataset via the broker pre command.
  2. Use flux filemap in the batch script (with batch:true and batchRaw: true) to map the data to nodes
  3. Run some job that uses the data across the nodes (e.g., MPI or similar)

vsoch and others added 2 commits March 31, 2023 19:40
It is not clear yet how these machine learning runs can best integrate with flux, beyond submitting a job
to Flux. We will need to think about this. One design, however, I think could work really nicely here is:

1. Use Foundry for storing data, download a dataset via the broker pre command.
2. Use flux filemap in the batch script (with batch:true and batchRaw: true) to map the data to nodes
3. Run some job that uses the data across the nodes (e.g., MPI or similar)

Signed-off-by: vsoch <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants