[BOUNTY - $300] Support Multi-GPU #223

AlexCheema · 2024-09-19T19:21:18Z

Currently you can only run one exo instance on each device.

There are some design decisions here:

Should we support running multiple exo instances on the same device, with one per GPU
Or should we support running one exo instance that uses multiple GPUs

Sean-fn · 2024-10-15T03:38:52Z

I agree with supporting one exo instance that uses multiple GPUs.
This approach would allow us to shard more when only one model is inferencing.
What do you think?

cmcmaster1 · 2024-10-26T03:21:22Z

Chiming in as a new user with a multi-GPU setup. One instance is easiest. Users can simply control GPU selection with the CUDA_VISIBLE_DEVICES environment variable.

jorge123255 · 2024-11-10T13:42:31Z

i was able to do this, I forked to github and added configurations to the integration.py and integration_engine.py

jorge123255 · 2024-11-10T13:43:00Z

the only issue is trying to have it show up on the exo console page to show 2 gpus instead of one, still testing.

benjamin-asdf · 2024-11-25T14:59:54Z

Multiple instances per device, assign a gpu to each. (approach 1)

pros:

levarage all orchestration and model splitting functionality, ideally you figure out how to parallelize layers (and only once)
aesthetics: uses the primitive exo functionality on a different scale
this approach seems to scale with different kinds of topologies (not even known ones)

cons:

node communication overhead?

single instance per deivce, assign multiple gpus (approach 2):

aspects:

nodes have to broadcast the sum of their multi-gpu setup RAM, and nodes have to internally handle mutli-gpu.

pros:

don't have to deal with overlapping system resources (ports, file locks, etc.)
inference engines already support multi gpu? (but exo does, too - across devices)

cons:

2 ways of doing multi gpu
composability?

Case 1:
Multigpu (approach 2) is very easy to do.
In that case one might go with approach 2, for now and keep approach 1 in mind for later.

Case 2:
Multigpu is not easy to do, approach 1 and 2 are roughly the same effort.
In that case, I would go with approach 1.

freerainboxbox · 2025-01-30T04:58:24Z

I implemented a temporary workaround using approach 2 in #656.

AlexCheema · 2025-01-30T20:48:21Z

I implemented a temporary workaround using approach 2 in #656.

I suppose this isn't a full solution for multi-gpu, it's just a wrapper on VISIBLE_DEVICES.
This will be supported fully in the rearchitect I'm working on.

AlexCheema changed the title ~~Support Multi-GPU~~ [BOUNTY - $300] Support Multi-GPU Sep 19, 2024

silibattlebot mentioned this issue Jan 29, 2025

MULTI GPU SUPPORT #643

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BOUNTY - $300] Support Multi-GPU #223

[BOUNTY - $300] Support Multi-GPU #223

AlexCheema commented Sep 19, 2024

Sean-fn commented Oct 15, 2024

cmcmaster1 commented Oct 26, 2024

jorge123255 commented Nov 10, 2024

jorge123255 commented Nov 10, 2024

benjamin-asdf commented Nov 25, 2024 •

edited

Loading

freerainboxbox commented Jan 30, 2025

AlexCheema commented Jan 30, 2025

[BOUNTY - $300] Support Multi-GPU #223

[BOUNTY - $300] Support Multi-GPU #223

Comments

AlexCheema commented Sep 19, 2024

Sean-fn commented Oct 15, 2024

cmcmaster1 commented Oct 26, 2024

jorge123255 commented Nov 10, 2024

jorge123255 commented Nov 10, 2024

benjamin-asdf commented Nov 25, 2024 • edited Loading

Multiple instances per device, assign a gpu to each. (approach 1)

single instance per deivce, assign multiple gpus (approach 2):

freerainboxbox commented Jan 30, 2025

AlexCheema commented Jan 30, 2025

benjamin-asdf commented Nov 25, 2024 •

edited

Loading