text-to-speech
updated
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
• 2404.14700
• Published • 32
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper
• 2306.15687
• Published • 1
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
• 2403.03100
• Published • 37
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
• 2404.09956
• Published • 11
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech
Prompts
Paper
• 2307.07218
• Published • 28
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
Bias
Paper
• 2306.03509
• Published • 5
parler-tts/dac_44khZ_8kbps
76.7M • Updated • 586
• 19
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
• 0.6B • Updated • 2.99k
• 358
Wenetspeech4TTS/WenetSpeech4TTS
Updated • 2.49k
• 86
Text-to-Audio
• Updated • 9
Feature Extraction
• 96.2M • Updated • 2.32M
• • 302
Text-to-Speech
• Updated • 11.7M
• • 6.21k
Text-to-Speech
• 4B • Updated • 243
• 526
Text-to-Speech
• 2B • Updated • 2.3k
• 1.11k
stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
• 4B • Updated • 74
• 197
Text-to-Speech
• Updated • 143
• 417
Text-to-Speech
• 2B • Updated • 6.57k
• • 2.86k