https://github.com/coqui-ai/tts https://github.com/serp-ai/bark-with-voice-clone https://github.com/metavoiceio/metavoice-src https://github.com/myshell-ai/OpenVoice https://github.com/collabora/WhisperSpeech https://github.com/neonbjb/tortoise-tts
Has anyone else had success with these? Are there other projects I should look at?
https://www.ddmckinnon.com/2024/10/03/dans-weekly-ai-speech-...
I tried zero-shot voice cloning in all of the top OSS models in the Arena and performance was bad.
There is still a big gap between 11Labs and Character.ai and the VoiceCraft voices would not be confused for the real speaker, but this is much closer.