Skip to content

Transcription Overview

FloWords uses state-of-the-art AI models to convert your voice into text. All processing happens locally on your Mac, ensuring complete privacy.

When you speak, FloWords:

  1. Captures audio from your selected microphone
  2. Processes it locally using AI models on your Mac
  3. Transcribes to text with high accuracy
  4. Pastes automatically where your cursor is

FloWords offers multiple transcription approaches:

Local Models

Whisper and Parakeet models run entirely on your Mac. No internet required, complete privacy.

Cloud Providers

Optional cloud services for enhanced accuracy or speed. Requires API keys and internet.


AspectLocal ModelsCloud Providers
Privacy100% privateData sent to servers
InternetNot requiredRequired
CostFree (included)Pay per usage
SpeedDepends on MacGenerally faster
AccuracyVery good to excellentExcellent

FloWords includes three local transcription engines. All run on your Mac and are multilingual.

ModelEngineDownloadRAMLatency
Whisper Turbo (default)OpenAI • Q5_0~547 MB~2 GB~200-800 ms
Parakeet V3NVIDIA • INT8~640 MB~2 GB~50-200 ms
Apple SpeechNative macOSNoneMinimal~100-500 ms
  • Whisper Turbo - recommended default, best balance of accuracy and speed
  • Parakeet V3 - fastest, lowest latency
  • Apple Speech - no download, on-device, great for quick drafts

For users who prefer cloud transcription:

  • OpenAI Whisper API - High accuracy, reliable
  • Groq - Ultra-fast transcription
  • Deepgram - Real-time streaming
  • Google Gemini - Multimodal capabilities
  • ElevenLabs - Speech recognition
  • Mistral - European AI provider
  • Soniox - Multilingual async transcription

FloWords includes an intelligent fallback system:

  1. Primary: Your selected model (local or cloud)
  2. Secondary: Alternative model if primary fails
  3. Tertiary: Apple’s native Speech Recognition

This ensures you always get a transcription, even if your preferred method encounters issues.


FloWords can transcribe from:

  • Live microphone - Real-time dictation
  • Audio files - WAV, MP3, M4A, AAC, FLAC, AIFF, CAF
  • Video files - MP4, MOV (extracts audio)

Whisper models support 99+ languages including:

  • English, Spanish, French, German
  • Chinese, Japanese, Korean
  • Arabic, Hindi, Portuguese
  • And many more…

Set your language in Settings > Model > Language or enable auto-detection.