Links

Download on iOS & Android

Features

Transcription – Turn voice messages into text instantly
Summarization – Skip the "uhh" & "ahh" and get to the key points
Translation – Break language barriers with a single tap
Auto-detection – Get notified when new messages arrive

Tech Stack

Flutter (Hive, GoRouter, Riverpod, Pigeon for cleaner method channels)
Groq (Whisper)
DeepL for translation
Fastlane for publishing
Firebase (Analytics, Crashlytics, Storage for user feedback)

How It Works (Expand section below for more details)

The app checks for new voice message files when opened (manual trigger available).
Transcriptions are displayed in a list with a summary option.
Users can enable notification listening for automatic detection.
If enabled (or if the app is backgrounded), a notification with the transcription is sent.
On iOS, users must share the audio file manually since files are not programmatically accessible.

Detailed Version

Finding the Audio Files

WhatsApp stores two types of audio files:

Voice Notes (standard WhatsApp voice messages)
Audio Files (shared audio not recorded in WhatsApp)

Trim Talk focuses on voice notes, which are stored under: /storage/emulated/0/WhatsApp/Media/WhatsApp Voice Notes/ Each week’s messages are stored in folders formatted as YEAR-WEEKNUMBER. However, WhatsApp does not follow ISO week numbering, requiring a workaround:

If the expected week's folder is missing, the app checks the previous week instead.

Reading the Files

Since these files are in another app's dedicated folder, access varies by Android version:

Android 12 and below: Requires READ_EXTERNAL_STORAGE permission.
Android 13+: Three options, but only Storage Access Framework (SAF) works.
- Media Store (not possible due to .nomedia file preventing indexing).
- MANAGE_EXTERNAL_STORAGE (restricted to file manager apps).
- SAF (requires user selection of the folder and accessing files via content resolver).

SAF is not well-supported in Flutter. Existing packages failed, so I wrote custom method channel calls to access files.

Transcribing the Files

Initially, I aimed for on-device transcription, testing multiple solutions:

Android Speech-to-Text API (incompatible with audio files).
Whisper (various implementations: TensorFlow, Mediapipe, method channels, etc.)
- None provided an optimal balance of performance and accuracy due to mobile hardware limitations.

Cloud-based APIs were tested:

Deepgram, Google, AssemblyAI, OpenAI Whisper
- Worked but were slow, inaccurate, or expensive.
Groq
- Uses an LPU™ Inference Engine, accelerating open-source models like whisper-large-v3.
- Fast, accurate, and cost-efficient.

Automating the Process

To avoid requiring manual checks:

Used workmanager for background tasks (every 15 min), transcribing files and displaying notifications.
However, method channels do not work in background tasks due to separate isolates.
Tried multiple alternatives without success (likely possible natively, but not in Flutter).
Best workaround: Notification Listener Service to trigger processing (not ideal due to reliability and permissions required).

Key Takeaways

Flutter is powerful but has limitations with native/platform-specific features.
Android's developer experience is not always fun.
Groq is an excellent transcription solution.
Deep understanding of isolates, method channels, and the Flutter engine.
Experience in building, publishing, and maintaining Flutter packages and apps.

Code & Commands

Useful Commands

Generate Hive: dart run build_runner build -d
Generate Pigeon API: dart run pigeon --input pigeon_api.dart
Check the scripts folder (fix Pods, publish, etc.)

Debug Mode

Notifications always appear.
WorkManager debug notifications are visible.
Transcriber returns dummy data.
read file permission is always true (android)

Links

Download on iOS and Android or share the Universal Link.

Support

I built this project on my free time, if you'd like to support it, consider contributing here. Thank you! :)

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github		.github
.vscode		.vscode
android		android
assets		assets
ios		ios
lib		lib
scripts		scripts
.env.example		.env.example
.fvmrc		.fvmrc
.gitignore		.gitignore
.metadata		.metadata
CHANGELOG.md		CHANGELOG.md
README.md		README.md
TODO.md		TODO.md
analysis_options.yaml		analysis_options.yaml
devtools_options.yaml		devtools_options.yaml
index.html		index.html
l10n.yaml		l10n.yaml
mise.toml		mise.toml
pigeon_api.dart		pigeon_api.dart
privacy-policy.pdf		privacy-policy.pdf
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml
untranslated-messages-file.txt		untranslated-messages-file.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Download on iOS & Android

Features

Tech Stack

How It Works (Expand section below for more details)

Finding the Audio Files

Reading the Files

Transcribing the Files

Automating the Process

Key Takeaways

Code & Commands

Useful Commands

Debug Mode

Links

Support

About

Uh oh!

Sponsor this project

Uh oh!

Uh oh!

Languages

Uh oh!

tempo-riz/trimtalk

Folders and files

Latest commit

History

Repository files navigation

Download on iOS & Android

Features

Tech Stack

How It Works (Expand section below for more details)

Finding the Audio Files

Reading the Files

Transcribing the Files

Automating the Process

Key Takeaways

Code & Commands

Useful Commands

Debug Mode

Links

Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Sponsor this project

Uh oh!

Uh oh!

Languages