


Implementation of the neural network proposed in Natural Speech, a text-to-speech generator
Natural Speech – Pytorch (wip) Implementation of the neural network proposed in Natural Speech, a text-to-speech generator that is indistinguishable from human recordings for the first time. The novelty of the paper...
Natural sounding text-to-speech in the terminal (and more)
gosling Natural sounding text-to-speech in the terminal (and more). Pre-requisites This is NOT intended to be a completely-free, pick-up-and-use TTS solution. In fact, it is simply a wrapper around Google’s Cloud Text-to-Speech API. You will need: A GCP account with billing enabled. Google gives you 1 million characters free every month. That’s nearly 10 books a month. See pricing. Once you have a GCP account, enable the TTS...




HOVI: Creates Text-to-Speech messages from people's face movement to help people who have difficulty communicating with one's voice
🌱 2022 Solution Challenge: HOVI 🌱 Team Member: Kang Inyeong, Kim Yeonghyeon, Lee Seulbi, Park Jisoo from GDSC SeoulTech (2021.12.21-ing) 🌱 Index What is HOVI? What is HOVI’s SDGs? Who can be a HOVI’s...

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
DiffGAN-TTS – PyTorch Implementation PyTorch implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs Repository Status Naive Version of DiffGAN-TTS Active...


Are we on the verge of a learning system breakthrough?
I've been experimenting with a popular video-based training platform as I continue my journey as a lifelong learner. The platform is excellent and offers a variety of resources; in fact, there are so many options to choose from it's sometimes hard deciding where to go next. And that's the problem. The platform doesn't really know me. It's designed as a one-to-many approach where the content is extremely broad and it's up to the user to...

Austin or Boston? Making artificial speech more expressive, natural, and controllable
Did you say you wanted to book a flight to Austin… or Boston? Even a human would at times struggle to differentiate between the names of these two cities — they do sound quite similar. An AI in a dialog with a user...



TikTok adds auto captions to make videos accessible to hard of hearing and deaf
TikTok this morning announced the launch of a new feature designed to make its app accessible to people who are hard of hearing or deaf. The company is today debuting auto captions — a feature that, when enabled, will...

