The model, which lets enterprises build voice agents for sales and customer engagement, puts Mistral in direct competition with the likes of ElevenLabs, Deepgram, and OpenAI.
Mistral releases a new open source model for speech generation Ivan Mehta 4:30 AM PDT · March 26, 2026 French AI company Mistral released a new open source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support.
The new model, called Voxtral TTS, supports nine languages, including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic.
“Our customers have been asking for a speech model.
So we built a small-sized speech model that can fit on a smartwatch, a smartphone, a laptop, or other edge devices.
The cost of it is a fraction of anything else on the market, but it offers state-of-the-art performance,” Pierre Stock, VP of science operations at Mistral AI, told TechCrunch during a phone interview.
Mistral said the new model can adapt a custom voice with a sample of less than five seconds and can capture characteristics like subtle accents, inflections, intonations, and irregularities in the flow of speech.