Build a simplified inference wrapper for tokenizer-free TTS models. Focus on creating an easy-to-use API or UI that allows creators to perform voice cloning and creative manipulation.
Suggested repo: VoxFlow
"Diffusion-powered, tokenizer-free TTS in your own repo."
Estimated effort: 25h