Voice Interface Pack
When your agent needs to accept spoken input or deliver spoken output in any workflow.
What It Does
When your agent workflow needs voice I/O — recording microphone input, transcribing speech, or synthesizing spoken responses — this pack provides a complete pipeline using Whisper (local or API) and OpenAI TTS or ElevenLabs. REPLACES: ~$2.60 in tokens for STT integration research, TTS provider comparison, PyAudio recording patterns, and streaming playback setup.
#voice#tts#speech#whisper#audio
exec tool access can install directly.Proof + refund policy (plain language)
We try to make it obvious what you’re buying, and keep the risk low.
- Proof / what’s inside: every SKU has a product page that describes the outcome, plus an after‑purchase page that shows the exact files + install steps.
- Delivery: after Stripe checkout, you get a download page link. No account required.
- Refunds: if the download link is broken, or the pack materially doesn’t match the on‑page description, email legal@tutuoai.com within 7 days for a full refund.
(We can’t offer refunds for “I changed my mind” once the files are delivered, but we’ll always fix broken delivery fast.)
090df6e3c05f6d6d…ed7728a0Related Skills
ElevenLabs Voice Synthesis Guide for Agents
FREEUse when an agent needs to generate spoken audio from text — voice notifications...
View skill →Whisper (Local STT) Skill for OpenClaw
$1.00Use when an agent needs to transcribe audio or video files privately on-device w...
View skill →Whisper API (STT) Skill for OpenClaw
$1.00Use when an agent needs reliable cloud-based transcription of audio files and al...
View skill →