Mistral TTS
pendingby Maria Cecilia Espina
Text-to-speech using Mistral Voxtral with voice cloning, streaming playback, and system voices fallback.
Obsidian Mistral TTS
Text-to-speech plugin for Obsidian with two engines: Mistral Voxtral (cloud, best quality) and system voices (local, instant).
Features
- Two TTS engines -- switch between Mistral cloud TTS and your OS system voices
- Read notes aloud -- entire note or selected text
- Streaming playback -- starts playing in ~0.8s using Web Audio API (Mistral)
- Voice cloning -- clone any voice from a 2-3 second audio sample (Mistral)
- Voice preview -- listen to any voice before selecting it
- Save audio to vault -- next to the note or in a dedicated folder
- Markdown-aware -- strips frontmatter, headings, code blocks, and formatting before speaking
- Multiple entry points -- command palette, right-click menus, ribbon icon, status bar controls
Installation
From Obsidian Community Plugins
- Open Obsidian Settings > Community Plugins > Browse
- Search for "Mistral TTS"
- Install and enable
Manual Installation
- Download
main.js,manifest.json, andstyles.cssfrom the latest release - Create a folder
mistral-ttsin your vault's.obsidian/plugins/directory - Copy the three files into that folder
- Restart Obsidian and enable "Mistral TTS" in Settings > Community Plugins
Setup
Mistral Voxtral (cloud)
- Get an API key from console.mistral.ai
- Open plugin settings, paste your API key
- Create a voice by uploading a short audio sample (2-3 seconds), or use a pre-existing one
- Select text and use "Read aloud" from the right-click menu or command palette
System Voices (local)
- Open plugin settings, switch the TTS engine dropdown to "System voices"
- Pick a voice from the dropdown (uses your OS built-in voices)
- No API key needed, works offline
Commands
| Command | Description |
|---|---|
| Read selection aloud | Speaks the currently selected text |
| Read entire note aloud | Speaks the full active note |
| Pause / resume playback | Toggle pause |
| Stop playback | Stop and reset |
Audio Formats (Mistral)
| Format | Use case |
|---|---|
| MP3 | General use (default) |
| WAV | Highest quality |
| FLAC | Lossless compression |
| Opus | Low bitrate |
Streaming mode uses PCM for lowest latency (~0.8s to first audio).
Supported Languages
English, French, Spanish, Portuguese, Italian, Dutch, German, Hindi, Arabic.
Development
git clone https://github.com/mcmespinaa/obsidian-mistral-tts.git
cd obsidian-mistral-tts
npm install
npm run dev # watch mode
npm run build # production build
Support
If you find this plugin useful, you can buy me a coffee.
License
For plugin developers
Search results and similarity scores are powered by semantic analysis of your plugin's README. If your plugin isn't appearing for searches you'd expect, try updating your README to clearly describe your plugin's purpose, features, and use cases.