Initial audio ducking implementation by tylxr59 · Pull Request #380 · Stypox/dicio-android

tylxr59 · 2025-12-17T14:08:02Z

First initial implementation of audio ducking via AudioFocusManager. Looking for feedback on how it is structured and functions.

This functions through AUDIOFOCUS_GAIN_TRANSIENT_MAY_DUCK which will lower any background audio during the user's interaction with Dicio. Audio ducking starts in SttInputDeviceWrapper when it detects a listening state and is held until TTS finishes in AndroidTtsSpeechDevice.onDone().

I've added some fallback releases in VoskInputDevice.stopListening() (when the user taps the mic button to cancel an interaction), MainActivity.onStop() (when the user leaves the app), and when a skill errors out.

I believe this covers all cases but could definitely use some help testing this implementation.

I've merged this into my test build - https://github.com/tylxr59/dicio-android/tree/tylxrs-build

Resolves #363

Stypox

Good idea, this looks simple enough, thanks!

I found this bug while playing music in background with NewPipe. If Dicio does not understand what I said, it says "Could you repeat", releases audio focus, and starts the stt again. When the STT restarts, there is no audio focus. Though note that NewPipe's reaction to ducking may be buggy itself soooo idk, which app would you suggest to test this with?

One thing I would suggest to reduce having to worry too much about when to release audio focus and when not to (e.g. because the next part of the workflow might need it) is to debounce by, say, 100ms the transition from focused to not focused. It might also help solve the bug above, and also avoid keeping the audio focus uselessly for a long time in case a skill takes long to produce output.

This can be achieved with a new variable shouldRequestFocus: Flow<Boolean> in AudioFocusManager, and then.

shouldRequestFocus.mapLatest { shouldBeFocused ->
    if (!shouldBeFocused) {
        delay(100ms);
    }
    return@mapLatest shouldBeFocused;
}.forEach { shouldBeFocused ->
    if (shouldBeFocused) requestFocus() else releaseFocus()
}

Stypox · 2026-02-20T13:25:42Z

app/src/main/kotlin/org/stypox/dicio/io/speech/AndroidTtsSpeechDevice.kt

+class AndroidTtsSpeechDevice(
+    private var context: Context,
+    locale: Locale,
+    private val audioFocusManager: AudioFocusManager


Can't you use runnablesWhenFinished instead? And I guess also call those in onError then

Stypox · 2026-02-20T13:31:15Z

app/src/main/kotlin/org/stypox/dicio/io/audio/AudioFocusManager.kt

+        if (!hasFocus) {
+            return
+        }


What if you remove this? Just in case it becomes out of sync with Android's state and then the audio focus never gets released anymore.

Stypox · 2026-02-20T13:32:07Z

app/src/main/kotlin/org/stypox/dicio/io/audio/AudioFocusManager.kt

+    @Synchronized
+    fun onTtsStarted() {
+        if (!hasFocus) {
+            Log.d(TAG, "TTS started without audio focus, requesting now")


Suggested change

Log.d(TAG, "TTS started without audio focus, requesting now")

Log.w(TAG, "TTS started without audio focus, requesting now")

Stypox · 2026-02-20T13:32:54Z

app/src/main/kotlin/org/stypox/dicio/io/audio/AudioFocusManager.kt

+            return
+        }
+
+        if (Build.VERSION.SDK_INT >= Build.VERSION_CODES.O) {


Where did you find this code? Can you add a comment with a link to documentation?

Initial audio ducking implementation

4b3e876

AlbatorLaho mentioned this pull request Jan 11, 2026

Stop playback while listening #204

Open

Stypox reviewed Feb 20, 2026

View reviewed changes

Stypox linked an issue Feb 23, 2026 that may be closed by this pull request

Stop playback while listening #204

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial audio ducking implementation#380

Initial audio ducking implementation#380
tylxr59 wants to merge 1 commit intoStypox:masterfrom
tylxr59:add-audio-ducking

tylxr59 commented Dec 17, 2025

Uh oh!

Stypox left a comment

Uh oh!

Stypox Feb 20, 2026

Uh oh!

Stypox Feb 20, 2026

Uh oh!

Stypox Feb 20, 2026

Uh oh!

Stypox Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	Log.d(TAG, "TTS started without audio focus, requesting now")
	Log.w(TAG, "TTS started without audio focus, requesting now")

Uh oh!

Conversation

tylxr59 commented Dec 17, 2025

Uh oh!

Stypox left a comment

Choose a reason for hiding this comment

Uh oh!

Stypox Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Stypox Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Stypox Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Stypox Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants