Back to blog

BYOK vs. cloud: what really happens to your dictation privacy

BYOK or all-in-one cloud service for voice dictation? What actually changes for the privacy of your audio and transcribed text. A concrete comparison.

When you dictate on your Mac, your voice travels somewhere to be transcribed. The real question isn't “is it secure?” but “who sees my audio, and what do they do with it?” Depending on whether you use an all-in-one cloud service or a BYOK (Bring Your Own Key) approach, the answer is completely different. Here's what it means in practice.

BYOK vs. cloud: the difference in one sentence

  • All-in-one cloud service: you speak, the audio passes through the publisher's servers, which route it to an AI model, then send the text back to you. The publisher sits in the middle of everything.
  • BYOK: you supply your own API key (OpenAI, Gemini, Groq…). Your audio goes directly from your Mac to the AI provider. The app publisher is never on that path.

The distinction sounds technical, but it shifts your trust from one party to another. And when it comes to privacy, knowing who you're trusting changes everything.

What really happens with a cloud service

With an all-in-one service, the publisher manages the infrastructure for you. Convenient — but that comes with three things people often forget:

  1. Your audio travels through a third-party server. Even if the publisher promises to store nothing, the audio passes through their machine before reaching the model. You're taking their word for it.
  2. You depend on their retention policy. Logs, caching, “product improvement”: terms of service vary and can change without you noticing.
  3. Your data may be used for training. Many consumer offerings reserve the right to use your content to improve their models, unless you uncheck an option buried deep in the settings.

That's not necessarily dishonest. But it adds one more intermediary between your voice and the final text — an intermediary that sees everything go by.

What BYOK changes for your privacy

With BYOK, the app installed on your Mac talks to the AI provider using your key. The direct consequences:

  • One fewer party. The app publisher never receives your audio or your text. There's no longer an intermediary server you need to trust.
  • You choose your provider. OpenAI, Google, Groq… you keep the contractual relationship, so the privacy policy that applies is the one you agreed to.
  • Pro API terms are often better. Paid APIs from major providers generally don't train their models on submitted requests, unlike their free consumer apps. Always verify the terms for each provider, but the framework is clearer.

BYOK doesn't magically make your dictations invisible: your audio is still processed by an AI provider. But it removes the most opaque link in the chain.

Side-by-side comparison

CriterionAll-in-one cloudBYOK
Audio pathMac → publisher → AIMac → AI (direct)
Trusted intermediaryThe publisherNone beyond the AI provider
Applicable policyThe publisher'sYour chosen provider's
Choice of AI providerLocked inYour choice
CostSubscriptionYou pay the AI per use
Getting startedImmediatePaste an API key

BYOK requires a small upfront effort (create a key, paste it). In exchange, you gain transparency over where your data goes.

What about audio storage?

This is the most reassuring point, and the easiest to verify. A well-designed dictation app stores no audio files: your voice is transcribed on the fly, then the audio is discarded. No recording lands on any disk — not yours, not anyone else's. That's the approach Speech Flow takes: hold Ctrl, speak, the text is inserted at your cursor, and the audio is gone.

When comparing tools, always ask: “is the audio retained, and for how long?” Our comparison with Wispr Flow covers this architectural difference in detail.

FAQ

Is BYOK really more private than a cloud service?
Yes, on one key point: your audio no longer passes through the app publisher's servers. It goes directly to your AI provider. You replace two intermediaries with one — and that one is a provider whose terms you chose and accepted.

Are my API keys safe on my Mac?
With a native macOS app, the key is stored locally (ideally in the keychain) and is only used to talk to the AI provider. It never passes through any publisher server.

Is BYOK complicated to set up?
No. You create a key with OpenAI, Gemini, or Groq and paste it into the app. Once done, you forget about it. That's the only extra step compared to an all-in-one service.

In summary

If the privacy of your dictations matters, BYOK removes the most opaque intermediary and gives you back control over which policy applies. Speech Flow is a native macOS app (~50 MB) using BYOK at €69 lifetime: your keys, no audio stored, clean text inserted at your cursor. If you'd rather not manage keys, an all-inclusive option is available too. The details are on the pricing page — choose the trade-off that works for you.