Christian Lawson-Perfect @christianp

0 posts0 participants0 posts today

**Verfassungklage@troet.cafe** @Verfassungklage@troet.cafe · Apr 7

Verfassungklage@troet.cafe @Verfassungklage@troet.cafe

Bei der Recherche für einen Artikel über #Text2Speech und #Speech2Text unter #Linux bin ich auf die kleine App Speech Note gestoßen, nicht zu verwechseln mit dem proprietären SpeechNotes. Insofern ist der Name nicht wirklich clever gewählt. Clever ist dagegen das Konzept der noch jungen Anwendung.

Speech Note ist eine vielseitige Anwendung für Notizen, die durch ihre Funktionen und Datenschutzorientierung hervorsticht.

https://linuxnews.de/speech-note-notizen-und-mehr/

LinuxNews.de · Apr 7Speech Note – Notizen und mehr

More from

LinuxNews.de

**Parleur** @parleur@mastodon.parleur.net · Mar 8

Mar 8

Parleur @parleur@mastodon.parleur.net

Ça existe les applications libres de #speech2text ?

**utzer [Pleroma]** @utzer@soc.utzer.de · Jan 7

Jan 7

utzer [Pleroma] @utzer@soc.utzer.de

Wenn eins ein paar Audiodateien #transkribieren wollen würde, wie würde eins das auf #Debian machen? Ich hatte mal ein Tool irgendwo gefunden, dass das in CLI konnte und dann den Text meine ich sogar mit den Sprechern ausgeworfen hat. Vielleicht hab ich das Tool sogar noch installiert.

Offline wäre super.

#Speech2Text #VoiceRecognition

#voicerecognition

**Sharon Machlis** @smach@masto.machlis.com · Nov 7, 2024

Nov 7, 2024

Sharon Machlis @smach@masto.machlis.com

New open-source speech-to-text model Moonshine “returns results faster and more efficiently than the current state of the art, OpenAI’s Whisper, while matching or exceeding its accuracy” one of its creators says. “Key improvements are an architecture that offers an overall 1.7x speed boost compared to Whisper, and a flexibly-sized input window.”

Blog post by Pete Warden: https://petewarden.com/2024/10/21/introducing-moonshine-the-new-state-of-the-art-for-speech-to-text/

GitHub: https://github.com/usefulsensors/moonshine
Paper: https://arxiv.org/abs/2410.15608

Pete Warden's blog · Oct 21, 2024Introducing Moonshine, the new state of the art for speech to textCan you imagine using a keyboard where it took a key press two seconds to show up on screen? That’s the typical latency for most voice interfaces, so it’s no wonder they’ve failed…

#GenAI #speech2text

**Christopher Stark** @christopherstark@mastodon.social · Sep 1, 2024

Sep 1, 2024

Christopher Stark @christopherstark@mastodon.social

SpeechNote:
Best open-source Text-To-Speach-Program around:

https://flathub.org/apps/net.mkiol.SpeechNote

#opensource #linux #speech2text

**Alexis Perrier** @alexisperrier@sigmoid.social · Jun 25, 2024

Jun 25, 2024

Alexis Perrier @alexisperrier@sigmoid.social

I'm extracting speech from audio files in French using Wav2Vec2.
the result is really not great, barely readable
"nerla sene reste trop oulué pour les épreuves notiques des gios "

But adding a LLM layer to correct it works like a charm
"La Seine reste trop polluée pour les épreuves nautiques des JO."

So much time saved. No need to tinker with the models and audio anymore.
#speech2text #data #audio

**Dietmar** @dzb@mastodon.world · Mar 9, 2024

Mar 9, 2024

Dietmar @dzb@mastodon.world

The implementation of the new @deepgramai „nova-2“ speech recognition model in my self-developed #app "Anruf Fee" has now brought me the hoped for #speech2text improvements for the #German language in this app.

The attached example of an incoming spam call shows how well it works. It saves me having to answer annoying unknown callers, but at the same time ensures that I don't miss anything important by recognizing the caller's topic.
https://apps.apple.com/de/app/anruf-fee/id6443781534

**Kettwachsler** @phpmacher@sueden.social · Nov 7, 2023

Nov 7, 2023

Kettwachsler @phpmacher@sueden.social

So eine #Leuchtschrift hinten am Rücken, die während des #Radfahrens dort anzeigt, was ich gerade sage.

DAS fände ich cool.

Quasi #Speech2Text for Dödel behind!

**Tykayn** @tykayn@mastodon.cipherbliss.com · May 21, 2023

May 21, 2023

Tykayn @tykayn@mastodon.cipherbliss.com

vous auriez de bons moteurs de text to speech qui font des voix propres et sans fucking accent anglais ? (du texte transformé en voix, pas l'inverse hein)
payant ou non, tant que y'a pas de services de GAFAM dedans et que la voix est vraiment propre sans effet de voix de robot ça m'intéresse.
#a11y #accessibilité #vocalisation #stt #speech2text

**tobozo** @tobozo@mastodon.social · Mar 2, 2023 *

Mar 2, 2023 *

tobozo @tobozo@mastodon.social

what happens when you get whisper.cpp to listen to #chiptunes?

both speak at 16KHz so they should understand each other, right?

https://github.com/ggerganov/whisper.cpp

Track: Alpha by @lukhash

#C64 #MOS6581 #Speech2Text

**frin** @frin@mastodon.online · Feb 5, 2023

Feb 5, 2023

frin @frin@mastodon.online

There is a free #opensource tool called #Whisper, based on OpenAI. It can convert speech to text even in offline mode. it works with many languages and can even translate output to English, generate subtitles and more. It only works on WAV files, but you can convert to WAV with ffmpeg. Make sure to download and use the large model for best results, you will have to compile the program yourself: https://github.com/ggerganov/whisper.cpp #AI #speech2text

GitHubGitHub - ggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++Port of OpenAI's Whisper model in C/C++. Contribute to ggerganov/whisper.cpp development by creating an account on GitHub.

**ccchris** @ccchris@chaos.social · Jan 28, 2023

Jan 28, 2023

ccchris @ccchris@chaos.social

Gibt es datenschutzfreundliche #Speech2Text-Lösungen für #Android. Eigener Server mit #Whisper käme für mich in Frage. 100% Offline wäre das schönste. Fremde Server scheiden aus (#Datenschutz). #askfedi

**Dervishe the Grey** @dervishe@mastodon.sdf.org · Jan 26, 2023

Jan 26, 2023

Dervishe the Grey @dervishe@mastodon.sdf.org

Connaissez-vous un bon soft de transcription de parole #speech2text libre ?
Ce serait pour retranscrire des interviews

**amaz1ng** @amaz1ng@social.anon-groups.de · Jul 1, 2022

Jul 1, 2022

amaz1ng @amaz1ng@social.anon-groups.de

#Speech2Text

Eine Freundin hat fortschreitende Artrose in den Handgelenken & war früher eine fleißige Schreiberin.

Heute leidet sie alleine bei der Beantwortung einer einzigen E-Mail.

Welches Tool kennt ihr, Möglichst Open-Source & Offline, um Texte aus Sprache zu transkribieren?

#Fedipower

Continued thread

**Lapineige** @Lapineige@mamot.fr · Mar 9, 2022

Mar 9, 2022

Lapineige @Lapineige@mamot.fr

Sur le site de Vosk (https://alphacephei.com/vosk/models), je vois deux gros modèles, celui de Vosk, et un de #LINTO, et sur le site de LINTO (https://doc.linto.ai/#/services/linstt_download) il y a :
- des modèles v1, 4 différents
- des modèles v2 (a priori meilleurs ?), dont 2 "acoustic models" et 2 "decoding graphs".

Lesquels servent à quoi ?

À l'usage, vous en avez trouvé un meilleur ? (et en quoi ?)
Ou c'est pareil ?

Je ne m'y retrouve pas dans tous ces modèles de #Speech2Text

VOSK Offline Speech Recognition APIVOSK ModelsAccurate speech recognition for Android, iOS, Raspberry Pi and servers with Python, Java, C#, Swift and Node.

**Lapineige** @Lapineige@mamot.fr · Mar 9, 2022

Mar 9, 2022

Lapineige @Lapineige@mamot.fr

Question pour les gens qui ont testé la transcription automatique de la parole (#Speech2Text) intégrée à #Kdenlive : quel modèle #Vosk choisir ?

Je compte prendre un des "gros" modèle pour avoir la meilleure fidélité de la retranscription.

Mais pour retranscrire du français, quel modèle est le plus performant ?

Plus de détails

**Couscous** @couscous@mamot.fr · Jan 23, 2021

Jan 23, 2021

Couscous @couscous@mamot.fr

@Chocobozzz (et peut-être) @booteille

Hey hey, j'avais une petite question à propos de #peertube. En m'impliquant sur #commonvoice de Mozilla, je me suis demandé si des projets s'en servaient pour du #speech2text (j'ai dû mal cherché et rien trouvé d'utilisable pour un end user...), et donc je me demandais si peertube envisageait d'implémenter ça?

Même avec des trad approximatives, ce serait d'une grande aide pour gagner du temps sur les traductions en général je suppose?

Recent searches

Search options

Administered by:

Server stats:

#speech2text