Viewing a single comment thread. View all comments

rshah4 t1_jdy111n wrote

How about using embeddings from open-source models like those at Hugging Face. That would save your embedding costs.

2

darkbluetwilight OP t1_jdy6eu6 wrote

Nice suggestion thanks! llama-index currently uses an embedding version of Ada which has negligible pricing (0.0002/1000tokens I think) The once-off index creation (1.3million tokens) cost about 40c.
It was the AI text generation costs that was killing me.

2