-
LoginSelect language Europe and America 🇺🇸 English (ALL) 🇫🇷 Français (French) 🇩🇪 Deutsch (Germany) 🇪🇸 Español (Spanish) 🇮🇹 Italiano (Italian) 🇧🇷 Português (Portuguese) 🇷🇺 Русский (Russian) 🇹🇷 Türkçe (Turkish) 🇫🇮 Suomi (Finnish) 🇸🇪 Svenska (Sweden) 🇳🇴 Norsk (Norwegian) 🇩🇰 Dansk (Danish) 🇷🇴 Română (Romanian) 🇭🇺 Magyar (Hungarian) 🇵🇱 Polski (Polish) 🇳🇱 Nederlands (Dutch) 🇨🇿 Čeština (Czech) 🇺🇦 Українська (Ukrainian) 🇭🇷 Hrvatski (Croatian) 🇬🇷 Ελληνικά (Greek) 🇸🇰 Slovenčina (Slovak) 🇧🇬 Български (Bulgarian) 🇸🇮 Slovenščina (Slovenian) ca Català (Catalan) 🇮🇪 Gaeilge (Irish) 🇱🇹 Lietuvių (Lithuanian) 🇲🇹 Malti (Maltese) 🇪🇪 Eesti (Estonian) 🇮🇸 Íslenska (Icelandic) 🇱🇻 Latviešu (Latvian) 🇬🇧 Welsh (Cymraeg) 🇲🇰 Macedonian (Македонски) 🇧🇦 Bosnian (Bosanski) 🇪🇸 Galician (Galego) 🇺🇿 Uzbek (Oʻzbekcha) 🇷🇸 Serbian (Српски) 🇨🇦 Inuktitut (ᐃᓄᒃᑎᑐᑦ) 🇨🇦 Inuktitut (Latin) Asia and Africa 🇨🇳 中文 (简体/Simplified Chinese) 🇭🇰 中文 (繁體/Traditional Chinese) 🇰🇷 한국 (Korean) 🇯🇵 日本語 (Japanese) 🇻🇳 Việt (Vietnam) 🇲🇾 Melayu (Malaysia) 🇮🇩 Indonesia (Indonesia) 🇹🇭 ภาษาไทย (Thai) 🇦🇪 Arabic (العربية) 🇮🇱 בעברית (Israel) 🇮🇳 हिंदी (Hindi) 🇮🇳 اردو (Urdu) 🇮🇳 தமிழ் (Tamil) 🇮🇳 मराठी (Marathi) 🇮🇳 తెలుగు (Telugu) 🇮🇳 ગુજરાતી (Gujarati) 🇮🇳 മലയാളം (Malayalam) 🇧🇩 বাংলা (Bengali) 🇮🇳 ಕನ್ನಡ (Kannada) sw Kiswahili (Swahili) 🇿🇦 Afrikaans 🇿🇦 Zulu (IsiZulu) 🇦🇿 Azerbaijani (Azərbaycanca) 🇮🇩 Javanese (ꦧꦱꦗꦮ) 🇬🇪 Georgian (ქართული) 🇰🇿 Kazakh (Қазақша) 🇰🇭 Khmer (ភាសាខ្មែរ) 🇱🇦 Lao (ລາວ) 🇲🇳 Mongolian (Монгол) 🇲🇲 Burmese (ဗမာ) 🇳🇵 Nepali (नेपाली) 🇦🇫 Pashto (پښتو) 🇱🇰 Sinhalese (සිංහල) 🇸🇴 Somali (Soomaali) 🇮🇳 Assamese (অসমীয়া) 🇦🇲 Armenian (Հայերեն) 🇮🇳 Odia (ଓଡ଼ିଆ) 🇮🇳 Punjabi (ਪੰਜਾਬੀ)
AI Voice Library
Filters
No AI voices found matching your criteria. Please reset your search or filter to try again.
Your favorite voices will appear here
Dialogue Block Conversion History Dialogue Block Conversion History Demo
No conversion history yet
No conversion records found for this Dialogue Block. Please convert the Dialogue Block to speech first.
#[[ index + 1 ]]
Latest [[ history_panel.current_block.ai_voice_config.name || 'Voice' ]] [[ format_history_time(record.create_time) ]]No conversion records found for this Dialogue Block. Please convert the Dialogue Block to speech first.
Generate All Dialogue Blocks
Status
[[generate_all.status_text]]
Current Failed Dialogue Blocks:
Audio Export History Demo
No conversion history yet
Click the Generate Dialogue Audio button to create new merged audio files
Generated Audio Files
Download or play your generated audio files.#[[index + 1]] Latest
Created: [[format_history_time(merge_record.create_time)]]Dialogue Blocks: [[merge_record.blocks_count]] files processed
Generate All Settings
Current Settings Preview:
- Audio Format: [[generate_all.settings.audio_format.toUpperCase()]]
- Audio Quality: [[generate_all.settings.voice_high_quality ? "High Quality (large size, slow synthesis)" : "Standard Quality (small size, fast synthesis)"]]
- Pause Interval Between Dialogue Blocks: [[get_current_pause_interval()]]
Send Feedback
Upgrade plan
Upgrade Required
Upgrade to Pro/Studio to unlock all premium features!
The AI Voice Dialogue Generator feature is exclusively available to users with the following subscription levels: Pro/Studio.
Explore the basic functionality here. To create, edit, rename, or delete projects, please upgrade to a Pro or Studio subscription (Lite plan not supported).
| Plan & Features | AI Voice Generator (Single Speaker) |
AI Voice Dialogue Generator (Multi-Speaker Dialogue) |
Multi-emotional Settings (Emotion) |
API support API for Developers |
|---|---|---|---|---|
| Free | Not Available | Not Available | Not Available | Not Available |
| Lite | Available | Not Available | Not Available | Not Available |
| Pro/Studio | Available | Available | Available | Available |
-
Usage tips:
- Characters (Pronounce as characters)
- Cardinal (Pronounce as a number)
- Ordinal (Pronounce as an ordinal)
- Digits (Pronounce as digits)
- Fraction (Pronounce as a fraction)
- Date (Pronounce as a date)
- Time (Pronounce as time)
- Telephone (Pronounce as a phone number)
- Currency (Pronounce as currency)
- Address (Pronounce as an address)
- Name (Pronounce as a name)
To use say-as, first select the text you want to mark, then choose a tag. Or, insert a tag and fill in the content between the tags. This helps the TTS engine interpret the text in a specific way (e.g., as a date, address, etc.).
Example:
<say-as interpret-as="telephone">123-456-7890</say-as>
-
Usage tips:
- Strong (Strong emphasis level)
- Moderate (Moderate emphasis level)
- Reduced (Reduced emphasis level)
After adding the emphasis tag, write the text you want to emphasize between the tags. You can choose the emphasis level: strong, moderate, or reduced. This will make the TTS engine stress the selected text accordingly.
Example:
<emphasis level="moderate">Hello</emphasis>
-
Usage tips:
- Insert phoneme tag
Step 1: Add the phoneme tag to your word or phrase. Step 2: Replace the value in ph= with the correct IPA pronunciation. You can use ChatGPT or other tools to generate the IPA for your word.
Example:
<phoneme alphabet="ipa" ph="tə.ˈmeɪ.toʊ">tomato</phoneme>