[Feat]: Cloud inference alongside on-device models

With Gemma 4 E2B/E4B just dropping, on-device inference on Android is genuinely good now. But serious tasks still warrant a larger cloud model.

Would love the ability to configure cloud API keys (OpenAI, Anthropic, etc.) and switch between local and cloud backends per message — same chat UI, shared conversation history. The goal is using local inference for most things to cut API costs, and reaching for cloud only when needed.

Essentially the same model-switching UX that Open WebUI offers, but built into PocketPal.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feat]: Cloud inference alongside on-device models #663

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

[Feat]: Cloud inference alongside on-device models #663

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions