FloWords can transcribe existing audio and video files, not just live recordings. Perfect for meetings, interviews, podcasts, and lectures.
| Format | Extension | Notes |
|---|
| WAV | .wav | Uncompressed, best quality |
| MP3 | .mp3 | Most common, widely compatible |
| M4A | .m4a | Apple format, good compression |
| AAC | .aac | Advanced audio coding |
| FLAC | .flac | Lossless compression |
| AIFF | .aiff | Apple lossless format |
| CAF | .caf | Core Audio Format |
| Format | Extension | Notes |
|---|
| MP4 | .mp4 | Most common video format |
| MOV | .mov | Apple QuickTime format |
- Open FloWords (click menu bar icon or use shortcut)
- Drag your audio/video file onto the FloWords window
- Wait for transcription to complete
- Copy or save the result
- Open FloWords
- Click File > Transcribe Audio File (or press ⌘O)
- Select your file in the dialog
- Wait for transcription to complete
- Review and use the transcription
- Right-click your audio/video file in Finder
- Select Open With > FloWords
- FloWords opens and begins transcription
- Review results when complete
Choose your transcription model before processing:
Use: Tiny or Base models
- Faster processing
- Good for quick reviews
- Lower accuracy on challenging audio
Use: Medium or Large models
- Best accuracy
- Better with accents and noise
- Slower processing time
Use: Small model
- Good balance
- Reasonable speed
- Solid accuracy
For best results, set the correct language:
- Go to Settings > Model
- Set Language to match your audio
- Or use Auto-Detect for multilingual content
| Duration | Recommendation |
|---|
| < 5 minutes | Any model, fast processing |
| 5-30 minutes | Small or Medium model |
| 30-60 minutes | Consider splitting or using cloud |
| > 1 hour | Split file or use cloud provider |
For very long recordings:
- Use an audio editor (Audacity, GarageBand)
- Split into 30-minute segments
- Transcribe each segment
- Combine results
During transcription:
- Progress bar shows completion percentage
- Estimated time remaining displayed
- Cancel button available if needed
Clear Audio
Clean recordings transcribe better. Reduce background noise when possible.
Single Speaker
Single-speaker content is easier to transcribe accurately.
Standard Speed
Normal speaking pace provides best results.
Good Volume
Avoid audio that’s too quiet or clipping.
- Remove silence at the beginning/end
- Normalize audio to consistent volume
- Reduce noise if heavily distorted
- Convert unusual formats to WAV or MP3
After transcription:
- Review for obvious errors
- Use AI Enhancement to fix grammar
- Add punctuation if needed
- Format as required (paragraphs, lists, etc.)
- Record meeting with Voice Memos or similar
- Export as M4A or MP3
- Drag into FloWords
- Get full meeting transcript
- Share with attendees
- Record interview (external recorder or phone)
- Transfer file to Mac
- Open in FloWords
- Transcribe with Large model for accuracy
- Edit and format as needed
- Download podcast episode
- Transcribe in FloWords
- Use AI Enhancement to summarize
- Create show notes or highlights
- Record lecture (with permission)
- Transcribe after class
- Use as study notes
- Search for specific topics
Currently, FloWords processes files one at a time. For multiple files:
- Queue files by adding them one after another
- Wait for each to complete
- Review results individually
Click Copy button to copy transcription text to clipboard.
Click Save to export as:
- Plain text (.txt)
- Markdown (.md)
Click Enhance to process with AI enhancement:
- Fix grammar and spelling
- Add punctuation
- Format as desired
- Convert file to WAV or MP3
- Use VLC or Audacity for conversion
- Check file isn’t corrupted
- Try a smaller file first
- Check available memory
- Restart FloWords
- Try a different model
- Use a larger model
- Check audio quality
- Set correct language
- Consider cloud provider for difficult audio
- File may be very long
- Switch to smaller model
- Check Activity Monitor for issues
- Consider splitting the file
- File size: Very large files may take considerable time
- Audio quality: Poor quality audio = poor transcription
- Languages: Best results with supported languages
- Specialized content: Technical jargon may need dictionary additions