Skip to content

Preview / Blocked by upstream: MLX on three clouds: Fal, Modal, Replicate#256

Draft
anthonywu wants to merge 7 commits intofilipstrand:mainfrom
anthonywu:mlx-on-clouds
Draft

Preview / Blocked by upstream: MLX on three clouds: Fal, Modal, Replicate#256
anthonywu wants to merge 7 commits intofilipstrand:mainfrom
anthonywu:mlx-on-clouds

Conversation

@anthonywu
Copy link
Collaborator

@anthonywu anthonywu commented Aug 23, 2025

Update: Jan 31, 2026

All three cloud GPU service examples have been updated with the release from #321

  1. modal run: nvproxy: Add support for UVM_ENABLE_READ_DUPLICATION and UVM_SET_ACCESSED_BY. google/gvisor#12436 unblocks the issue at the upstream gvisor project, but that runtime update has not been adopted by Modal yet (same error as August attempt)
  2. fal run fal_run_mflux.py will start a remote app, and even a playground GUI, the generation apparently works if you observe only the log text content – tqdm progress bars complete, but the Result does not return to the playground GUI. Will investigate whether this is just my skill issue with Fal Apps.
  3. Replicate cog predict - TBD

Update: Aug 23, 2025

This is WIP to try out mflux on Linux (CPU for now) on Fal, Replicate Cog and Modal

When I get these to work, we can merge these as tools/* resources that isn't part of the official project but can be copy/pasteable to deployable application layers.

@anthonywu anthonywu requested a review from filipstrand August 23, 2025 10:11
@anthonywu anthonywu changed the title Preview: MLX on three clouds: Fal, Modal, Replicate Preview / Blocked by upstream: MLX on three clouds: Fal, Modal, Replicate Aug 26, 2025
pyproject.toml Outdated
"huggingface-hub>=0.24.5,<1.0",
"matplotlib>=3.9.2,<4.0",
"mlx>=0.27.0,<0.28.0",
"mlx>=0.27.0,<0.29.0",
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested this locally too and works fine!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants