daxophoneme

daxophoneme t1_itzpuki wrote

Many of those examples really showed off the fact that their dataset was built from a lot of badly recorded sound clips. Yikes! Seems like the quality of training is going to be very important.

Now, those examples at the bottom of the page where they map one sound onto the contour of another are what interest me. A friend of mine is working on something similar and more sophisticated.

7