Sarvam AI Launches Bulbul-v2 with Realistic Indian Accents

Sarvam AI has launched Bulbul-v2, an advanced text-to-speech model supporting 11 Indian languages with natural Indian accents.
Built for real-time speech synthesis, the model includes features like pitch, pace, and loudness control, smart text preprocessing, and multiple audio sample rates.
As the first Indian startup chosen to build a sovereign LLM under the IndiaAI mission, Sarvam AI is contributing significantly to linguistic accessibility and AI innovation in India.
Key Features of Bulbul-v2
- Supports 11 Indian languages with regional accent precision.
- Enables real-time synthesis and multi-language (including code-mixed) text support.
Fine-grained control over,
- Pitch
- Pace
- Loudness
- Multiple sample rates (8kHz to 24kHz).
- Smart text preprocessing: normalises numbers, dates, and mixed-language content.
Aim and Objectives
- To democratise AI voice technology for Indian users.
- Offer a customisable, natural-sounding voice model suitable for various business and branding applications.
- Promote linguistic inclusivity in India’s digital ecosystem.