1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
const axios = require('axios');
const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/veena-max-tts";
const data = {
"text": "Segmind lagao, model chalao, itna tez ki result aane se pehle chai bhi tthandi na ho.",
"speaker_id": "vinaya_assist",
"normalize": true
};
(async function() {
try {
const response = await axios.post(url, data, { headers: { 'x-api-key': api_key } });
console.log(response.data);
} catch (error) {
console.error('Error:', error.response.data);
}
})();
Provide the text to convert into speech. Use greetings or instructions, like 'Welcome to VeenaMAX, your TTS solution.'
Choose a voice for your text. For a calm tone, select 'soumya_calm'; for impact, select 'agastya_impact'.
Allowed values:
Enable text normalization for better pronunciation. Use this for complex texts or mixed languages.
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
Edited by Segmind Team on September 20, 2025.
VeenaMAX by Maya Research is a high-performance Text-to-Speech (TTS) model designed for Indian languages and multi-script content. It functions with high accuracy by transforming written text into natural, humanized speech enriched with expressive touch in tonality. It effectively supports Hindi (Devanagari and Roman), English, as well as more conversational Hinglish. The built-in emotional intelligence feature with 8 distinct voice personalities makes it an excellent model for several industries, especially with its super-fast processing power useful in real-time streaming for interactive applications, and audio output.
soumya_calm
for informational content and educational materialagastya_impact
for marketing and announcement contentvinaya_assist
(default) for customer service applicationscharu_soft
or mohini_whispers
for gentle, natural conversationsVeenaMAX features automatic script detection and seamless code-switching (for smooth transition between languages), which is effective in natural pronunciation of mixed language content without manual intervention.
Streaming mode enables real-time audio output useful for interactive applications, while non-streaming mode generates complete audio files for download or storage.
VeenaMAX includes a domain-specific terminology option that enables users to select appropriate voice personalities and enable text normalization for industry-specific content such as banking, healthcare, and other industries.
You can select the voice personality based on your use case: soumya_calm
for professional content, agastya_impact
for engaging announcements, vinaya_assist
for customer service, and other options for specific emotional tones.
Text normalization automatically customizes pronunciation, numbers, and special characters for a natural speech output. It is highly effective for multi-language content and complex terminology.