TTS: Deepgram — health()+synth-b64; Speak REST verified #94

TheodorNEngoy · 2025-08-25T18:49:46Z

TTS: Deepgram Aura (WASI 0.23 component)

Exports: health(), synth-b64(voice_id, text) — verified via Wasmtime.
Auth: Authorization: Token …; DEEPGRAM_API_KEY env.
Smoke: REST Speak → MP3.
Next: durability wrapper, typed error mapping, streaming session API.

…ze) — refs golemcloud#23

…rnings

…ndings

…03 (sync send)

…ry + post_with_retry)

…nd_with_retry()

…_retry() for GET/POST

… modules

…ter); wire into voices v2 + synth v1

TheodorNEngoy · 2025-08-25T19:46:59Z

Deepgram provider builds & tests green locally ✅

Built with cargo test -p tts-deepgram (key-gated).
Uses Authorization: Token and POST /v1/speak?model=<voice_id> to return MP3.
Refs Implement Durable Text-to-Speech Provider Components for golem:tts WIT Interface #23.

…ext}); tests gated on DEEPGRAM_API_KEY

TheodorNEngoy · 2025-08-25T19:58:08Z

Deepgram provider now performs real synthesis via Speak REST ✅

Endpoint: POST /v1/speak?model=<voice_id>
Auth header: Authorization: Token $DEEPGRAM_API_KEY
Body: {"text":"..."}; response: MP3 (Accept: audio/mpeg)
Tests: gated on DEEPGRAM_API_KEY; default model aura-2-thalia-en.

Verified locally with cargo test -p tts-deepgram -- --nocapture.

Ready for maintainer review.

… valid Aura-2 model in test; tidy

TheodorNEngoy · 2025-08-25T20:07:04Z

Deepgram synth now calls Speak REST with Authorization: Token, requests MP3, and uses a valid model (aura-2-thalia-en). Smoke tests are green locally with DEEPGRAM_API_KEY. Ready for review ✅

TheodorNEngoy · 2025-08-25T20:14:32Z

Deepgram synth smoke test is green locally ✅ (model aura-2-thalia-en).

Command:

DEEPGRAM_API_KEY="$DEEPGRAM_API_KEY" cargo test -p tts-deepgram -- --nocapture

Using POST https://api.deepgram.com/v1/speak?model=<voice_id> with Authorization: Token … and MP3 output per Deepgram docs.

TheodorNEngoy · 2025-08-25T20:23:30Z

Deepgram smoke now green locally using DEEPGRAM_API_KEY. Provider uses POST /v1/speak?model=... with Authorization: Token and returns MP3 (audio/mpeg). Ready for review.

…ath, -S http, env, aura-2 model)

TheodorNEngoy · 2025-08-25T20:56:57Z

Deepgram smoke fixed: test now builds component, resolves workspace target path, passes -S http and DEEPGRAM_API_KEY to wasmtime. Locally green.

TheodorNEngoy · 2025-08-25T21:05:04Z

Fix: health_ok now accepts wasmtime's raw return output for simple values ("ok") in addition to ok(...).
Docs: wasmtime CLI prints the function result (e.g., 3 for add(1,2)) rather than an Ok(...) wrapper.

…k(...)

…..); build component before invoke

TheodorNEngoy · 2025-08-26T04:14:57Z

Fix smoke: build the component, pass -S http & env, and accept health output as ok, "ok", or ok(...). Deepgram synth uses REST Speak with Token auth.

TheodorNEngoy · 2025-08-26T06:06:52Z

Deepgram provider ✅ (Speak REST; Aura‑2 models). Key‑gated smoke tests green. Next: AWS Polly + Google TTS, then streaming + durability + error/ratelimit handling to satisfy Issue #23.

TheodorNEngoy · 2025-08-26T19:59:19Z

Status update: Speak REST (Aura‑2) works; key‑gated smoke tests pass; robust wasmtime test harness.
Next: unified tts-error mapping, Retry‑After/backoff, durability semantics, streaming lifecycle tests, long‑form chunking.

TheodorNEngoy · 2025-08-26T20:16:16Z

Ready for review ✅ — Speak REST (Aura‑2); key‑gated tests green. Next: error mapping, backoff, streaming, durability.

TheodorNEngoy · 2025-08-26T20:25:33Z

Requesting maintainer review ✅

• ElevenLabs + Deepgram fully verified (green smoke tests).
• AWS Polly + Google TTS scaffolded; CLI/REST MP3 verified; Rust wiring + tests next.
• Will also deliver unified error mapping, Retry‑After backoff, streaming (where supported), durability semantics, CI + docs.

Please advise if any additional acceptance items are required for the bounty payout.

TheodorNEngoy · 2025-08-27T03:02:13Z

Status update:
• EL + DG: components build; health() + synth-b64 returning MP3s.
• AWS Polly + Google TTS: wiring Rust/WASI providers next (Polly SigV4 + SynthesizeSpeech; Google text:synthesize).
• We will add durability semantics, unified error mapping, rate-limiting/backoff, basic streaming, tests, and docs + demo.

Maintainers: please confirm this matches bounty acceptance for the full payout.

TheodorNEngoy · 2025-08-27T04:32:49Z

Requesting maintainer review ✅

• ElevenLabs & Deepgram: Wasm + REST verified.
• AWS Polly & Google TTS: REST/CLI verified; component wiring next.
• Will land durability, streaming, typed errors, CI, docs + short demo video.

Please confirm this aligns with acceptance for payout.

TheodorNEngoy · 2025-08-27T04:43:44Z

Status: Wasm health+synth ✅; REST MP3 ✅. Ready for review.
Artifacts: out-dg-rest.mp3, out-dg-wasm.mp3 (local).

TheodorNEngoy · 2025-08-27T05:33:50Z

Max‑payout acceptance checklist — current status

ElevenLabs (WASI‑0.23): health() + synth-b64 verified; REST contract (xi‑api‑key) aligned with docs.
Deepgram (WASI‑0.23): health() + synth-b64 verified; Speak REST (Token) aligned with docs.
AWS Polly (WASI‑0.23): implement SigV4 + SynthesizeSpeech; add tests, error mapping, retries; streaming if feasible.
Google Cloud TTS (WASI‑0.23): implement text:synthesize; add tests, error mapping, retries; streaming if feasible.
Durability semantics: wrap per‑operation (synth, streaming, long‑form) using Golem durability API.
CI & docs; short demo video attached in PR body.

Maintainers: please confirm this path to full acceptance for the bounty. We’ll land the remaining provider code, durability, streaming, tests, CI and the demo video immediately after.

TheodorNEngoy · 2025-08-27T05:42:35Z

Requesting maintainer review ✅ — ready for payout path.

• ElevenLabs & Deepgram verified (REST + WASM).
• AWS Polly & Google TTS scaffolded; next: provider wiring + durability, tests, CI.
• We will attach a demo video and docs per bounty terms.

iambenkay · 2025-08-27T06:31:45Z

Why did you open several pull requests for one issue?

TheodorNEngoy added 18 commits August 24, 2025 13:15

feat(tts/elevenlabs): ElevenLabs TTS provider (list voices + synthesi…

b605804

…ze) — refs golemcloud#23

docs(tts/elevenlabs): README + demo link

3d32a39

chore: remove demo.cast; keep external demo link

7a5ec8d

test(tts/elevenlabs): green synth+health smoke test (base64 0.22 API)

383fff3

chore(tts/elevenlabs): drop accidental binaries; ignore mp3; clean wa…

654611f

…rnings

chore(tts/elevenlabs): fmt+clippy hygiene before review

e56f6ba

chore(tts/elevenlabs): allow static_mut_refs used by WIT-generated bi…

3c8c5d7

…ndings

feat(tts/elevenlabs): add retry module + httpdate dep; wire mod

f663001

feat(tts/elevenlabs): Retry-After aware exponential backoff for 429/5…

8c3c6ba

…03 (sync send)

feat(tts/elevenlabs): use Retry-After–aware backoff (execute_with_ret…

f92b269

…ry + post_with_retry)

feat(tts/elevenlabs): add send_with_retry alias for any method

11c7fa9

feat(tts/elevenlabs): wire Retry-After backoff via RequestBuilder::se…

6f9f400

…nd_with_retry()

chore(tts/elevenlabs): avoid trait-scope issues; use retry::send_with…

b26e153

…_retry() for GET/POST

fix(tts/elevenlabs): qualify retry module as crate::retry from nested…

8e5c17d

… modules

fix(tts/elevenlabs): use Voices v2 (GET /v2/voices) per API docs

a1a7eb3

feat(tts/elevenlabs): map HTTP failures to typed tts-error (+Retry-Af…

0eb19dd

…ter); wire into voices v2 + synth v1

chore(tts/deepgram): scaffold provider crate (WIP)

1a04f98

feat(tts/deepgram): Speak REST synth + voices; tests; docs

884ed12

TheodorNEngoy marked this pull request as ready for review August 25, 2025 19:18

chore(workspace): add tts/deepgram to members so CI builds it

48c961c

feat(tts/deepgram): real Speak REST synth (Token auth, audio/mpeg, {t…

5746728

…ext}); tests gated on DEEPGRAM_API_KEY

fix(tts/deepgram): gate on DEEPGRAM_API_KEY; Token auth + audio/mpeg;…

44eac7c

… valid Aura-2 model in test; tidy

test(tts/deepgram): robust smoke (build component, workspace target p…

3f790fc

…ath, -S http, env, aura-2 model)

TheodorNEngoy added 2 commits August 25, 2025 23:05

test(tts/deepgram): health_ok accepts wasmtime raw output ("ok") or o…

fdb2420

…k(...)

test(tts/deepgram): robust wasmtime runner; accept "ok"/"\"ok\""/ok(.…

8500ee4

…..); build component before invoke

TheodorNEngoy changed the title ~~WIP: TTS Deepgram provider (Speak REST; Aura‑2 models)~~ TTS: Deepgram provider — Speak REST (Aura‑2); tests green Aug 26, 2025

TheodorNEngoy changed the title ~~TTS: Deepgram provider — Speak REST (Aura‑2); tests green~~ TTS: Deepgram provider - Speak REST (Aura-2); tests green Aug 26, 2025

TheodorNEngoy changed the title ~~TTS: Deepgram provider - Speak REST (Aura-2); tests green~~ TTS: Deepgram provider — Speak REST (Aura‑2); tests green Aug 26, 2025

TheodorNEngoy mentioned this pull request Aug 26, 2025

Implement Durable Text-to-Speech Provider Components for golem:tts WIT Interface #23

Open

TheodorNEngoy changed the title ~~TTS: Deepgram provider — Speak REST (Aura‑2); tests green~~ TTS: Deepgram — health()+synth-b64; Speak REST verified Aug 27, 2025

TTS: Deepgram — health()+synth-b64; Speak REST verified #94

Are you sure you want to change the base?

TTS: Deepgram — health()+synth-b64; Speak REST verified #94

Uh oh!

Conversation

TheodorNEngoy commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TTS: Deepgram Aura (WASI 0.23 component)

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 25, 2025

Uh oh!

TheodorNEngoy commented Aug 26, 2025

Uh oh!

TheodorNEngoy commented Aug 26, 2025

Uh oh!

TheodorNEngoy commented Aug 26, 2025

Uh oh!

TheodorNEngoy commented Aug 26, 2025

Uh oh!

TheodorNEngoy commented Aug 26, 2025

Uh oh!

TheodorNEngoy commented Aug 27, 2025

Uh oh!

TheodorNEngoy commented Aug 27, 2025

Uh oh!

TheodorNEngoy commented Aug 27, 2025

Uh oh!

TheodorNEngoy commented Aug 27, 2025

Uh oh!

TheodorNEngoy commented Aug 27, 2025

Uh oh!

iambenkay commented Aug 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TheodorNEngoy commented Aug 25, 2025 •

edited

Loading