hachyderm.io is one of the many independent Mastodon servers you can use to participate in the fediverse.
Hachyderm is a safe space, LGBTQIA+ and BLM, primarily comprised of tech industry professionals world wide. Note that many non-user account types have restrictions - please see our About page.

Administered by:

Server stats:

10K
active users

For the past couple of months I've been working on Pied (pied.mikeasoft.com), an application that makes it easy to use modern, natural sounding, text-to-speech voices on Linux. It does this by integrating the Piper neural text-to-speech engine with speech-dispatcher, so most existing software will work with it out of the box.

The first beta version is now available in the snap store: snapcraft.io/pied

And available as a Flatpak from pied.mikeasoft.com

(Other package formats will follow)

I'd appreciate any feedback if you're able to test it, thanks!

Musharraf :verified:

@mikesheldon

Nice work with Pied.
FWIW: I'm developing Sonata, a frontend for Piper (and other TTS models).
Recently, I implemented streaming support in Piper models:
PR: github.com/rhasspy/piper/pull/
This feature is currently supported in Sonata, and it improves latency considerably.
In my Win 11 PC, it takes less than 170 seconds to generate the first chunk of audio.

Sonata repo: github.com/mush42/sonata

GitHubONNX streaming support by mush42 · Pull Request #255 · rhasspy/piperBy mush42