Submitted by Apprehensive-Tax-214 t3_107u88d in MachineLearning

Improve Tortoise TTS by 30% inference speed, and packaged it up as a hosted API that charges per-second. All code is open-sourced: https://github.com/metavoicexyz/tortoise-tts-modal-api, https://github.com/metavoicexyz/tortoise-tts

It can be used via a UI on: https://tts.themetavoice.xyz

There are more details here: https://twitter.com/vatsal_aggarwal/status/1612536547248836608?s=20

17

Comments

You must log in or register to comment.

iaalaughlin t1_j3qcffy wrote

30% faster than what?

How much does it cost? I’m not signing in just to see the pricing.

Why not use replicate’s api?

$0.00055 per second or $0.033 per minute. Is that cheaper or more expensive than what you are charging?

https://replicate.com/afiaka87/tortoise-tts

8

Impossible-Bus-6729 t1_j3rl5zi wrote

You are absolutely right. However, despite being open source, it requires a lot of time/effort/resources to get the tortoise text-to-speech synthesis working. Therefore, it's very difficult for a non-technical person to use, and for most developers looking to build applications on top it's a waste of time since they will have to reinvent the wheel to create an ML endpoint.

That is why we built this managed API on top for anyone to get started with testing the model / integrating TTTS quickly. Hope that helps?

We open-sourced our ML infrastructure for TTTS deployment so that others can contribute to make it faster, scale better or add features like easy fine-tuning with data.

3

Impossible-Bus-6729 t1_j3rm2wg wrote

30% faster than the original TTTS!

We are providing the API at cost, i.e. $0.0005833/second.
We also provide $5 of free credits to try out the API. It currently costs $0.03/30words. Therefore, $5 ~ 160 API calls.

Based on your comment, its at par with replicate pricing. But, we will be faster with inference times & hence yielding micro-savings compounding over time

3

aledinuso t1_j3rx7pn wrote

A multilingual version of this would be nice

1

RuairiSpain t1_j3tj6zs wrote

Tried it, returns blank audio on my mobile

1

MrHumun t1_j3x4s1p wrote

Where are you hosting GPU for this?

1