Vocalinux

Voice-to-text for Linux, finally done right!

A seamless free open-source private voice dictation system for Linux, comparable to the built-in solutions on macOS and Windows.

🎉 Alpha Release!

We're excited to share Vocalinux with the community. Try it out and let us know what you think!

✨ Features

🎤 Double-tap Ctrl to start/stop voice dictation
⚡ Real-time transcription with minimal latency
🌎 Universal compatibility across all Linux applications
🔒 Offline operation for privacy and reliability (with VOSK)
🤖 Optional Whisper AI support for enhanced accuracy
🎨 System tray integration with visual status indicators
🔊 Audio feedback for recording status
⚙️ Graphical settings dialog for easy configuration

🚀 Quick Install

One-liner Installation (Recommended)

curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/install.sh | bash -s -- --tag=v0.4.0-alpha

Note: Installs v0.4.0-alpha. For the most recent version, check GitHub Releases.

This will:

Clone the repository to ~/.local/share/vocalinux-install
Install all system dependencies
Set up a virtual environment in ~/.local/share/vocalinux/venv
Install both VOSK and Whisper AI speech engines:
- VOSK: installs the vosk Python package from PyPI
- Whisper: installs the openai-whisper package from PyPI, which also pulls in PyTorch (the ML framework Whisper requires)
Create a symlink at ~/.local/bin/vocalinux
Download the default Whisper tiny speech model (~75MB)

⏱️ Note: Installation takes ~5-10 minutes due to Whisper AI dependencies (PyTorch with CUDA support, ~2.3GB).

Whisper with CPU-only PyTorch (no NVIDIA GPU needed):

curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/install.sh | bash -s -- --tag=v0.4.0-alpha --whisper-cpu

This installs Whisper with CPU-only PyTorch (~200MB instead of ~2.3GB). Works great for systems without NVIDIA GPU.

For low-RAM systems (8GB or less) - VOSK only:

curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/install.sh | bash -s -- --tag=v0.4.0-alpha --no-whisper

This skips Whisper installation entirely and configures VOSK as the default engine.

Alternative: Install from Source

# Clone the repository
git clone https://github.com/jatinkrmalik/vocalinux.git
cd vocalinux

# Run the installer (will prompt for Whisper)
./install.sh

# Or with Whisper support
./install.sh --with-whisper

The installer handles everything: system dependencies, Python environment, speech models, and desktop integration.

After Installation

# If ~/.local/bin is in your PATH (recommended):
vocalinux

# Or activate the virtual environment first:
source ~/.local/bin/activate-vocalinux.sh
vocalinux

# Or run directly:
~/.local/share/vocalinux/venv/bin/vocalinux

Or launch it from your application menu!

Uninstall

# If installed via curl:
curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/uninstall.sh | bash

# If installed from source:
./uninstall.sh

📋 Requirements

OS: Ubuntu 22.04+ (other Linux distros may work)
Python: 3.8 or newer
Display: X11 or Wayland
Hardware: Microphone for voice input

🎙️ Usage

Voice Dictation

Double-tap Ctrl to start recording
Speak clearly into your microphone
Double-tap Ctrl again (or pause speaking) to stop

Voice Commands

Command	Action
"new line"	Inserts a line break
"period" / "full stop"	Types a period (.)
"comma"	Types a comma (,)
"question mark"	Types a question mark (?)
"exclamation mark"	Types an exclamation mark (!)
"delete that"	Deletes the last sentence
"capitalize"	Capitalizes the next word

Command Line Options

vocalinux --help              # Show all options
vocalinux --debug             # Enable debug logging
vocalinux --engine whisper    # Use Whisper AI engine
vocalinux --model medium      # Use medium-sized model
vocalinux --wayland           # Force Wayland mode

⚙️ Configuration

Configuration is stored in ~/.config/vocalinux/config.json:

{
  "speech_recognition": {
    "engine": "vosk",
    "model_size": "small",
    "vad_sensitivity": 3,
    "silence_timeout": 2.0
  }
}

You can also configure settings through the graphical Settings dialog (right-click the tray icon).

🔧 Development Setup

# Clone and install in dev mode
git clone https://github.com/jatinkrmalik/vocalinux.git
cd vocalinux
./install.sh --dev

# Activate environment
source venv/bin/activate

# Run tests
pytest

# Run from source with debug
python -m vocalinux.main --debug

📁 Project Structure

vocalinux/
├── src/vocalinux/           # Main application code
│   ├── speech_recognition/  # Speech recognition engines
│   ├── text_injection/      # Text injection (X11/Wayland)
│   ├── ui/                  # GTK UI components
│   └── utils/               # Utility functions
├── tests/                   # Test suite
├── resources/               # Icons and sounds
├── docs/                    # Documentation
└── web/                     # Website source

📖 Documentation

Installation Guide - Detailed installation instructions
Update Guide - How to update Vocalinux
User Guide - Complete user documentation
Contributing - Development setup and contribution guidelines

🗺️ Roadmap

🤝 Contributing

We welcome contributions! Whether it's bug reports, feature requests, or code contributions, please check out our Contributing Guide.

Quick Links

⭐ Support

If you find Vocalinux useful, please consider:

⭐ Starring this repository
🐛 Reporting bugs you encounter
📖 Improving documentation
🔀 Contributing code

📜 License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

Made with ❤️ for the Linux community

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.github		.github
docs		docs
resources		resources
src/vocalinux		src/vocalinux
tests		tests
web		web
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CNAME		CNAME
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
activate-vocalinux.sh		activate-vocalinux.sh
build-website.sh		build-website.sh
install.sh		install.sh
pyproject.toml		pyproject.toml
setup.py		setup.py
uninstall.sh		uninstall.sh
vocalinux.desktop		vocalinux.desktop

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vocalinux

Voice-to-text for Linux, finally done right!

✨ Features

🚀 Quick Install

One-liner Installation (Recommended)

Alternative: Install from Source

After Installation

Uninstall

📋 Requirements

🎙️ Usage

Voice Dictation

Voice Commands

Command Line Options

⚙️ Configuration

🔧 Development Setup

📁 Project Structure

📖 Documentation

🗺️ Roadmap

🤝 Contributing

Quick Links

⭐ Support

📜 License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 3

Languages

License

jatinkrmalik/vocalinux

Folders and files

Latest commit

History

Repository files navigation

Vocalinux

Voice-to-text for Linux, finally done right!

✨ Features

🚀 Quick Install

One-liner Installation (Recommended)

Alternative: Install from Source

After Installation

Uninstall

📋 Requirements

🎙️ Usage

Voice Dictation

Voice Commands

Command Line Options

⚙️ Configuration

🔧 Development Setup

📁 Project Structure

📖 Documentation

🗺️ Roadmap

🤝 Contributing

Quick Links

⭐ Support

📜 License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 3

Languages

Packages