March 30, 2026

Smallest.ai launches Lightning V3, claims to beat ElevenLabs on conversational TTS

Smallest.ai released Lightning V3 on March 27, 2026, a text-to-speech model targeting real-time voice agents. The San Francisco-based startup claims the model beats OpenAI, Cartesia, and ElevenLabs on conversational quality benchmarks. The claims are self-reported, and the model weights are not publicly available.

Smallest.ai homepage showing the Lightning TTS platform with voice agent and text-to-speech features — Smallest.ai homepage as of March 29, 2026. Source: smallest.ai

What Lightning V3 is

Lightning V3 is a non-autoregressive TTS model that generates audio at 44,100 Hz. Smallest.ai says it requires under 1 GB of VRAM and achieves a real-time factor of 0.01, meaning it generates 10 seconds of speech in 100 milliseconds. Time-to-first-audio is under 100 milliseconds according to the company (Smallest.ai blog, Mar 27, 2026).

The release includes two sub-versions. V3.1 is the default model optimized for natural conversation, with a reported WVMOS score of 5.06. V3.2 adds instruction-following: it accepts directives for emotion, pitch, volume, and whispering.

Both versions support 15 languages as of March 27, 2026: English, Spanish, French, Italian, Dutch, Swedish, Portuguese, German, Hindi, Tamil, Kannada, Telugu, Malayalam, Marathi, and Gujarati (Smallest.ai blog, Mar 27, 2026). The model handles automatic language detection and mid-sentence code-switching.

Seven of those 15 languages are Indic. That is not a coincidence.

Listen: Lightning V3 vs ElevenLabs

Methodology

Both samples generated by VoiceRadar on March 30, 2026 via official APIs. Same text for both, only the last sentence differs (provider name). Smallest.ai: Lightning V3.1, voice Magnus, 44,100 Hz, mp3. ElevenLabs: Flash v2.5, voice Roger, 44,100 Hz, mp3. No post-processing applied. Note: Lightning V3.1 mispronounces the ".io" TLD in "voiceradar.io." ElevenLabs Flash v2.5 handles it correctly. The samples are published unmodified.

Introducing Lightning V3 — it beats every model we tested against. ElevenLabs, Cartesia, OpenAI. Lightning sets a new SOTA with V3 in conversational text-to-speech.

→ Highest MOS score for conversational TTS at 3.9
→ ~76% win rate vs gpt-4o-mini-tts on naturalness
→ 15 languages with mid-sentence code-switching
→ Built from scratch for voice agents, not read-aloud

View on X

The company behind it

Smallest.ai was founded in 2023 by Sudarshan Kamath and Akshat Mandloi, both former Bosch engineers and IIT Guwahati graduates. Kamath previously led AI perception for self-driving vehicles at Bosch and contributed to Toppr’s $150 million acquisition (Inc42, Aug 2025).

The company has raised $8.26 million across four rounds, with an $8 million seed led by Sierra Ventures in August 2025. Other investors include 3one4 Capital, Better Capital, AWS Startups, and individual angel Shyamal Hitesh Anadkat, who works at OpenAI (Crunchbase).

Beyond TTS, Smallest.ai operates two products: Waves (TTS, voice cloning, voice conversion) and Atoms (a real-time voice agent platform). The company reported handling over 1 million calls per month for enterprise customers and serving 5,000+ SMBs across India and the United States as of its August 2025 funding announcement (Inc42, Aug 2025).

The benchmark claims

Smallest.ai reports the following scores for Lightning V3 as of March 27, 2026:

MOS (conversational): 3.89
WVMOS (V3.1): 5.06
Intonation: 3.33 (highest among competitors tested)
Prosody: 3.07 (highest among competitors tested)
WER: 5.38%
Preference vs OpenAI gpt-4o-mini-tts: 76.2% in blind listening tests across 1,088 samples in English, Hindi, Spanish, and Tamil

The preference evaluation used an LLM-as-a-Judge framework on the Seed TTS dataset (Smallest.ai blog, Mar 27, 2026).

Editorial note

VoiceRadar has not independently verified these benchmark claims. All scores cited above are self-reported by Smallest.ai. The preference evaluation uses LLM-as-a-Judge rather than human evaluators. No independent third-party benchmark has been published as of March 29, 2026. Press coverage of the launch consists entirely of ANI/PTI wire syndication, with no coverage from independent tech publications such as TechCrunch, VentureBeat, or The Verge.

The closed-source question

The company name is “Smallest” and the pitch centers on efficiency: under 1 GB VRAM, non-autoregressive architecture, runs on small systems. But the model weights are not publicly available. You cannot download Lightning V3, inspect its architecture, measure its actual VRAM usage, or reproduce the benchmark results. Access is API-only, starting at $9 per month.

This stands in direct contrast to Mistral’s Voxtral TTS, which launched one day earlier on March 26, 2026 with open weights on Hugging Face. Voxtral is a 4B-parameter model that can be downloaded, run locally on a Mac with Apple Silicon via MLX, and independently benchmarked by anyone (Mistral AI, Mar 26, 2026). When Mistral claims performance numbers, you can verify them yourself.

When a company claims “smallest and fastest” but does not release the model, the claims remain marketing until proven otherwise.

Smallest.ai documentation page showing API reference for the Lightning TTS model — Smallest.ai API documentation as of March 29, 2026. The API is the only way to access Lightning V3. Source: smallest.ai/docs

Pricing and availability

Lightning V3 is available through Smallest.ai’s API on the Pro plan at $9 per month, with usage at approximately $0.025 per 1,000 characters for V3.1 as of March 2026. Enterprise pricing is custom and includes on-premise deployment, 99.99% uptime SLA, and HIPAA/SOC2/GDPR/ISO 27001 compliance (Smallest.ai).

Smallest.ai pricing page showing Pro plan at $9 per month and Enterprise custom pricing — Smallest.ai pricing as of March 29, 2026. Pro plan starts at $9/month. Source: smallest.ai

For comparison as of March 2026:

Provider	Model	Per 1K chars	Open source
OpenAI	gpt-4o-mini-tts	$0.015	No
Mistral	Voxtral TTS	$0.016	Yes (open weights)
Smallest.ai	Lightning V3.1	$0.025	No
ElevenLabs	Flash Turbo	$0.06	No
ElevenLabs	Multilingual v3	$0.12	No

OpenAI and Mistral are cheaper per character. OpenAI does not offer voice cloning. Mistral’s model launched days ago and has not been widely tested in production. ElevenLabs remains the most expensive per character but has the largest language support (70+), the broadest product suite, and the strongest enterprise traction as of March 2026.

What this tells us about the TTS market in March 2026

Two TTS models launched within 24 hours of each other in the last week of March 2026. One is open-source with verifiable claims. The other is closed-source with self-reported benchmarks and wire-syndicated press coverage.

Both are priced below ElevenLabs. Both target the voice agent market. Both claim to match or beat ElevenLabs on quality.

The TTS market is fragmenting. Pricing pressure is coming from every direction: open-source models that anyone can run, Indian startups operating with lower cost structures, and big tech integrating TTS into existing platforms. For developers evaluating a TTS provider in 2026, the question is shifting from “which one sounds best” to “which claims can I actually verify.”

VoiceRadar will be running independent benchmarks of open-source TTS models on consumer hardware in the coming weeks. Results will be published here.