{"id":50903,"date":"2026-04-07T13:29:12","date_gmt":"2026-04-07T13:29:12","guid":{"rendered":"https:\/\/ubuntuhandbook.org\/?p=50903"},"modified":"2026-04-07T13:29:12","modified_gmt":"2026-04-07T13:29:12","slug":"speed-of-sound-new-voice-typing-app-for-linux-desktop","status":"publish","type":"post","link":"https:\/\/ubuntuhandbook.org\/index.php\/2026\/04\/speed-of-sound-new-voice-typing-app-for-linux-desktop\/","title":{"rendered":"Speed of Sound &#8211; New Voice Typing App for Linux Desktop"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-thumbnail wp-image-50904\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound-icon-250x250.webp\" alt=\"\" width=\"250\" height=\"250\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound-icon-250x250.webp 250w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound-icon-300x300.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound-icon-700x700.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound-icon-768x768.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound-icon.webp 1200w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/p>\n<p>Looking a way to automatically type text on your PC by speaking? Here&#8217;s a new app to do the job in Linux desktop!<\/p>\n<p>It&#8217;s <a href=\"https:\/\/github.com\/zugaldia\/speedofsound\" target=\"_blank\" rel=\"noopener\">Speed of Sound<\/a>, a free open-source tool that uses local AI model to convert your speech to text and write anywhere that&#8217;s currently in focus.<\/p>\n<p><!--more--><\/p>\n<p>The app comes with a Whisper Tiny model (made by OpenAI). Just press a key to start recording, speak your text, stop recording, then it will use the model to convert the speech to text, and automatically type into any focused application.<\/p>\n<div style=\"width: 610px;\" class=\"wp-video\"><!--[if lt IE 9]><script>document.createElement('video');<\/script><![endif]-->\n<video class=\"wp-video-shortcode\" id=\"video-50903-1\" width=\"610\" height=\"359\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound.mp4?_=1\" \/><a href=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound.mp4\">https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/speedofsound.mp4<\/a><\/video><\/div>\n<p><b>The whole process works offline on user&#8217;s computer<\/b>. Though, it supports downloading more voice models, connecting to cloud services such as Anthropic, Google, or OpenAI, or using self-hosted Ollama, vLLM or llama.cpp services on your local server that require network permission.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-50906\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-prefs-model-700x700.webp\" alt=\"\" width=\"610\" height=\"610\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-prefs-model-700x700.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-prefs-model-300x300.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-prefs-model-250x250.webp 250w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-prefs-model-768x768.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-prefs-model.webp 922w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><\/p>\n<p><b>It supports multiple languages<\/b>. You can set the primary and secondary languages, then switch between them on the fly.<\/p>\n<p>As non-native speaker, it can accurately type out what I said in English. Though, probably due to language model, it somehow does not work as expected when speaking in my native language.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-50907\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-multilanguages-700x700.webp\" alt=\"\" width=\"610\" height=\"610\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-multilanguages-700x700.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-multilanguages-300x300.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-multilanguages-250x250.webp 250w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-multilanguages-768x768.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-multilanguages.webp 922w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><\/p>\n<h3>How to Install Speed of Sound<\/h3>\n<p>The app <strong>works on both X11 and Wayland<\/strong>, and it&#8217;s available to install in different package formats, choose any one of them that you prefer:<\/p>\n<ul>\n<li>Flatpak package, runs in sandbox for both <code>amd64<\/code> and <code>arm64\/aarch64<\/code> processors.<\/li>\n<li>Snap package, runs in sandbox (<code>amd64<\/code> only, meaning Intel\/AMD processors).<\/li>\n<li>AppImage package, non-install required (<code>amd64<\/code> only).<\/li>\n<li>DEB package for Debian, Ubuntu, Linux Mint, etc distros (<code>amd64<\/code> only).<\/li>\n<li>RPM package for Fedora\/RHEL, etc distros (<code>amd64<\/code> only).<\/li>\n<\/ul>\n<p><b>Flatpak package<\/b><\/p>\n<p>The <a href=\"https:\/\/flathub.org\/en\/apps\/io.speedofsound.SpeedOfSound\" target=\"_blank\" rel=\"noopener\">flatpak package<\/a> can be installed directly from Linux Mint&#8217;s Software Manager, Fedora Workstation GNOME Software, and other Linux&#8217;s package manager that support this package format out-of-the-box.<\/p>\n<p>For other distributions, do the following steps one by one to get it:<\/p>\n<ul>\n<li>First, <a href=\"https:\/\/flathub.org\/en\/setup\" target=\"_blank\" rel=\"noopener\">enable Flatpak support<\/a>. For Debian\/Ubuntu, open terminal (Ctrl+Alt+T) and run command:\n<pre>sudo apt install flatpak<\/pre>\n<\/li>\n<li>Then, install the package via command (log out and back in if you don&#8217;t see app icon afterwards):\n<pre>flatpak install https:\/\/dl.flathub.org\/repo\/appstream\/io.speedofsound.SpeedOfSound.flatpakref<\/pre>\n<\/li>\n<\/ul>\n<p><b>Snap package<\/b><\/p>\n<p>The snap package is available to install in Ubuntu through Ubuntu Software or App Center.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-50908\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-snap-700x473.webp\" alt=\"\" width=\"610\" height=\"412\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-snap-700x473.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-snap-300x203.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-snap-768x519.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-snap.webp 965w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><\/p>\n<p><b>AppImage, DEB, RPM<\/b><\/p>\n<p>The other 3 packages are available to download in the Github releases page via the link below:<\/p>\n<div class=\"wp-block-buttons aligncenter\">\n<div class=\"wp-block-button is-style-fill\"><a class=\"wp-block-button__link has-vivid-cyan-blue-to-vivid-purple-gradient-background has-text-color has-background\" href=\"https:\/\/github.com\/zugaldia\/speedofsound\/releases\" target=\"_blank\" rel=\"noreferrer noopener\">Download (under Assets)<\/a><\/div>\n<\/div>\n<p>For AppImage, simply add executable permission from its properties dialog, then run to launch it. <\/p>\n<p>While, modern Debian, Ubuntu, Fedora, etc distros support double-clicking open then install DEB\/RPM package.<\/p>\n<h3>Set up Speed of Sound<\/h3>\n<p>The first launch of the application will show you a welcome dialog, telling the basic use. You need to click Allow remote interaction, so it can type anything in other application windows.<\/p>\n<div id=\"attachment_50909\" style=\"width: 620px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-50909\" class=\"size-large wp-image-50909\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-injection-700x424.webp\" alt=\"\" width=\"610\" height=\"369\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-injection-700x424.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-injection-300x182.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-injection-768x466.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-injection.webp 1016w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><p id=\"caption-attachment-50909\" class=\"wp-caption-text\">grant permission to type in other applications<\/p><\/div>\n<p>It by default supports <code>Super+Z<\/code> keyboard shortcuts to start\/stop listening, though it only works when the Speed of Sound window is open and focused.<\/p>\n<p><b>To set a global shortcut that works when it&#8217;s in background<\/b>, open Preferences and click &#8220;Set up&#8221; button under Global Shortcut.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-50910\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-globalshortcut-700x416.webp\" alt=\"\" width=\"610\" height=\"363\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-globalshortcut-700x416.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-globalshortcut-300x178.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-globalshortcut-768x456.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/ss-globalshortcut.webp 922w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><\/p>\n<p>If that button is NOT available like me (see the last screen), then download the script &#8220;<b>trigger.sh<\/b>&#8221; from <a href=\"https:\/\/github.com\/zugaldia\/speedofsound\/releases\" target=\"_blank\">Github releases page<\/a> (under Assets section), put it anywhere for long time use, and add executable permission.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-script-700x451.webp\" alt=\"\" width=\"610\" height=\"393\" class=\"aligncenter size-large wp-image-50911\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-script-700x451.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-script-300x193.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-script-768x495.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-script.webp 1131w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><\/p>\n<p>Finally, create a custom keyboard shortcut bind to that script.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-shortcut-700x460.webp\" alt=\"\" width=\"610\" height=\"401\" class=\"aligncenter size-large wp-image-50912\" srcset=\"https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-shortcut-700x460.webp 700w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-shortcut-300x197.webp 300w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-shortcut-768x505.webp 768w, https:\/\/ubuntuhandbook.org\/wp-content\/uploads\/2026\/04\/trigger-shortcut.webp 1153w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><\/p>\n<p>After that, you may press that keyboard shortcut to start listening, and press again to stop. For more about the app, see <a href=\"https:\/\/www.speedofsound.io\/\" target=\"_blank\">www.speedofsound.io<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Looking a way to automatically type text on your PC by speaking? Here&#8217;s a new app to do the job in Linux desktop! It&#8217;s Speed of Sound, a free open-source tool that uses local AI model to convert your speech to text and write anywhere that&#8217;s currently in focus.<\/p>\n","protected":false},"author":1,"featured_media":50904,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[52],"tags":[2236],"class_list":["post-50903","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-app-review","tag-input"],"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/posts\/50903","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/comments?post=50903"}],"version-history":[{"count":0,"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/posts\/50903\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/media\/50904"}],"wp:attachment":[{"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/media?parent=50903"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/categories?post=50903"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ubuntuhandbook.org\/index.php\/wp-json\/wp\/v2\/tags?post=50903"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}