Model Specifications
Detailed technical specifications for the transcription models included in FloWords.
Local Models Overview
Section titled “Local Models Overview”FloWords includes three local transcription engines. All run entirely on your Mac and are multilingual.
| Model | Engine | Download | RAM | WER | Latency |
|---|---|---|---|---|---|
| Whisper Turbo (default) | OpenAI • Q5_0 | ~547 MB | ~2 GB | ~7-8% | ~200-800 ms |
| Parakeet V3 | NVIDIA • INT8 | ~640 MB | ~2 GB | ~6.34% | ~50-200 ms |
| Apple Speech | Native macOS | None | Minimal | ~8% | ~100-500 ms |
Model Details
Section titled “Model Details”Whisper Turbo
Section titled “Whisper Turbo”Name: Whisper TurboEngine: OpenAI Whisper (Large v3 Turbo, Q5_0)Backend: whisper.cpp (optimized for Apple Silicon)Download: ~547 MBMemory Usage: ~2 GBAccuracy: ~7-8% WERLatency: ~200-800 msLanguages: MultilingualBest for:
- General daily use
- Best balance of accuracy and speed
- Recommended default engine
Parakeet V3
Section titled “Parakeet V3”Name: Parakeet V3Engine: NVIDIA Parakeet (via FluidAudio)Quantization: INT8Download: ~640 MBMemory Usage: ~2 GBAccuracy: ~6.34% WERLatency: ~50-200 msLanguages: Multilingual (English + European)Best for:
- Fast, low-latency dictation
- When speed is the priority
- Good performance with moderate resources
Apple Speech
Section titled “Apple Speech”Name: Apple SpeechEngine: Native macOS Speech (SFSpeechRecognizer)Download: None (built into macOS)Processing: On-deviceAccuracy: ~8% WER (lower than Whisper)Latency: ~100-500 msLanguages: MultilingualBest for:
- Quick drafts
- No download required
- Speed and privacy over precision
Language Support
Section titled “Language Support”- Whisper Turbo - multilingual, supports 99+ languages with automatic detection
- Parakeet V3 - English and European languages
- Apple Speech - multilingual (Arabic, German, English, Spanish, French, Italian, Japanese, Korean, Portuguese, Chinese, and more)
Model Performance by Hardware
Section titled “Model Performance by Hardware”Apple Silicon (Recommended)
Section titled “Apple Silicon (Recommended)”- Hardware acceleration for all three models
- ~2-3x faster than Intel
- Minimal battery impact
- All three models run great
Intel Macs
Section titled “Intel Macs”- Slower than Apple Silicon, higher CPU usage
- Parakeet V3 or Apple Speech recommended for better speed
Audio Specifications
Section titled “Audio Specifications”Input Requirements
Section titled “Input Requirements”| Specification | Value |
|---|---|
| Sample Rate | 16000 Hz |
| Bit Depth | 16-bit |
| Channels | Mono |
| Format | PCM |
FloWords automatically converts audio to these specifications.
Supported Input Formats
Section titled “Supported Input Formats”| Format | Extension | Notes |
|---|---|---|
| WAV | .wav | Native support |
| MP3 | .mp3 | Converted to WAV |
| M4A | .m4a | Converted to WAV |
| AAC | .aac | Converted to WAV |
| FLAC | .flac | Converted to WAV |
| AIFF | .aiff | Converted to WAV |
| CAF | .caf | Converted to WAV |
| MP4 | .mp4 | Audio extracted |
| MOV | .mov | Audio extracted |
Model Selection Guide
Section titled “Model Selection Guide”By Use Case
Section titled “By Use Case”| Use Case | Recommended Model |
|---|---|
| Daily use | Whisper Turbo |
| Maximum speed | Parakeet V3 |
| No download / quick drafts | Apple Speech |
| Highest accuracy | Whisper Turbo |
By Situation
Section titled “By Situation”| Situation | Recommended |
|---|---|
| Clear speech | Any |
| Low latency needed | Parakeet V3 |
| Offline, no setup | Apple Speech |
| Technical terms | Whisper Turbo + dictionary |
Next Steps
Section titled “Next Steps”- Download Models to get started
- Configure Settings for your needs
- Review Best Practices for optimal use